OpenNews/hackdays/storyandalgorithm/conditionofanonymity: Difference between revisions

From MozillaWiki
Jump to navigation Jump to search
(Project that displays all reasons ("because clauses") that sources were given anonymity in the New York Times from January 1, 2000 to the present.)
(Created description.)
Line 1: Line 1:
<ul><li><b>Project name:</b>  
<ul><li><b>Project name: Condition of Anonymity</b>  
</li></ul>
</li></ul>
<ul><li><b>Your team:</b> Please list project team members
<ul><li><b>Your team:</b> Daniel X. O'Neil and Daniel McLaughlin
</li></ul>
</li></ul>
<ul><li><b>Project URL(s), if applicable:</b>
<ul><li><b>Project URL(s), if applicable:</b>
</li><li><b>Hashtag, if #relevant:</b>  
</li><li><b>Hashtag, if #relevant:</b>  
</li></ul>
</li></ul>
<ul><li><b>What are you building:</b> What will the thing you are creating do, enable or solve, in 1 human-readable paragraph
<ul><li><b>What are you building:</b> We're building a method for consuming all articles published in the New York Times that contain statements from anonymous sources, a website to display the reasons the source was given anonymity ("because clause"), the snippet in which that clause appears, the description of the source, and the information provided by source. The corpus is all NYT articles since January 1, 2000 that contain the phrase, "condition of anonymity" or "anonymity because". We're also streaming all new articles containing those phrases and streaming the clauses to Twitter [https://twitter.com/#!/conditionof @conditionof] , along with links to the full snippet on our site. We also allow users to provide guesses as to  the identity of the sources, blind item-style.
</li></ul>
</li></ul>
<ul><li><b>Who is it for:</b> Describe your target user audience, e.g. "Everyone who uses the internet", "people in high risk environments", "People who still read 'Peanuts'"
<ul><li><b>Who is it for:</b> This site is for New York Times aficionados, people who like blind items, and people who dig getting data from unstructured text.
</li></ul>
</li></ul>
<ul><li><b>Your goal for this weekend:</b> where are you trying to get to by 2:45pm Sunday in terms of features, functionality or other creations
<ul><li><b>Your goal for this weekend:</b> Pull the relevant articles (done), analyze text (nearly done), publish the processed text (with snippet, description of source, anonymity reason, and information provided by source) in some fashion. Later: organize this data into an interface that allows users to provide guesses on the source and stream the because clauses on Twitter.
</li></ul>
</li></ul>
<ul><li><b>Your starting point:</b> Are you writing from scratch? Designing a new concept? Modeling a data space? Extending a codebase or library? Adding a feature to a platform? Combining tools?
<ul><li><b>Your starting point:</b> Using <a href="http://nltk.org/">Natural Language Toolkit</a> in Python and the <a href="http://developer.nytimes.com/docs/article_search_api">New York Times Article Search API</a>.
</li></ul>
</li></ul>
<ul><li><b>Anything else we should know:</b> At most 1 non-long paragraph of useful additional context
<ul><li><b>Anything else we should know:</b> Nope; that's it.
</li></ul>
</li></ul>

Revision as of 01:59, 17 June 2012

  • Project name: Condition of Anonymity
  • Your team: Daniel X. O'Neil and Daniel McLaughlin
  • Project URL(s), if applicable:
  • Hashtag, if #relevant:
  • What are you building: We're building a method for consuming all articles published in the New York Times that contain statements from anonymous sources, a website to display the reasons the source was given anonymity ("because clause"), the snippet in which that clause appears, the description of the source, and the information provided by source. The corpus is all NYT articles since January 1, 2000 that contain the phrase, "condition of anonymity" or "anonymity because". We're also streaming all new articles containing those phrases and streaming the clauses to Twitter @conditionof , along with links to the full snippet on our site. We also allow users to provide guesses as to the identity of the sources, blind item-style.
  • Who is it for: This site is for New York Times aficionados, people who like blind items, and people who dig getting data from unstructured text.
  • Your goal for this weekend: Pull the relevant articles (done), analyze text (nearly done), publish the processed text (with snippet, description of source, anonymity reason, and information provided by source) in some fashion. Later: organize this data into an interface that allows users to provide guesses on the source and stream the because clauses on Twitter.
  • Anything else we should know: Nope; that's it.