Firefox/Input/Reduce the Abuse
From MozillaWiki
Contents
Reduce the abuse
Summary
Input gets a non-trivial amount of hate speech and other abusive responses. Per our community guidelines and our legal notices (both are linked to in the footer of Input, we should be deleting or redacting egregious examples.
Up until now, we had no good way of identifying and dealing with such things. This project will yield the infrastructure to allow us to measure and deal with inappropriate Input content.
Status
- Project owner: Will Kahn-Greene
- Status: In-progress
History
- FIXME: Fill in pre-history
- September 3rd, 2014: Wrote up project page
- September 8th, 2014: Pushed prototype classifier into production to see how well it works. Pulling data now.
Requirements
internship project (2014q3)
Requirements:
- build a Python library that can be used to classify texts as spam/ham/abuse
- integrate it into Fjord for testing/honing and figuring out our options
Anti-requirements:
- we won't remove or otherwise change responses based on classification--this is purely a research phase
Bugs:
Tracker bug: https://bugzilla.mozilla.org/show_bug.cgi?id=1062429
ID | Summary | Priority | Status |
---|---|---|---|
1062436 | classifier flags for responses | -- | RESOLVED |
1062439 | post_save celery task for classifying responses | -- | RESOLVED |
1062444 | generate classifier training data | -- | RESOLVED |
1062453 | create analyzer view for examining classification data | -- | RESOLVED |
1062455 | add spicedham to vendor/ | -- | RESOLVED |
1063825 | implement spicedham backend for fjord | -- | RESOLVED |
6 Total; 0 Open (0%); 6 Resolved (100%); 0 Verified (0%);
v1 (2014q4)
Depends on outcome of internship project.
Future possibilities
- "flag as spam/abuse" buttons on the dashboard allowing users who are authenticated and authorized to flag items as spam/abuse