OpenNews/hackdays/storyandalgorithm/conditionofanonymity: Difference between revisions

Revision as of 16:19, 17 June 2012

Your team: Daniel X. O'Neil, Daniel McLaughlin, Hong Qu, and Gabriel Floirit

Project URL(s), if applicable: Info will be published to a Heroku project and streamed to our Twitter account: @conditionof

What are you building: We're building a method for consuming all articles published in the New York Times that contain statements from anonymous sources, a website to display the reasons the source was given anonymity ("because clause"), the snippet in which that clause appears, the description of the source, and the information provided by source. The corpus is all NYT articles since January 1, 2000 that contain the phrase, "condition of anonymity" or "anonymity because". We're also streaming all new articles containing those phrases and streaming the clauses to Twitter @conditionof , along with links to the full snippet on our site.

Who is it for: This site is for New York Times aficionados, people who like blind items, and people who dig getting data from unstructured text.

Your goal for this weekend: Pull the relevant articles (done), analyze text (nearly done), publish the processed text (with snippet, description of source, anonymity reason, and information provided by source) in some fashion. Later: organize this data into an interface that allows users to provide guesses on the source and stream the because clauses on Twitter.

Your starting point: Using Natural Language Toolkit in Python and the New York Times Article Search API.

Anything else we should know: We need people who can help review the "because clauses" and mark interesting ones for display. Here's a document we're using to plan our work.

@@ Line 15: / Line 15: @@
 <ul><li><b>Your starting point:</b> Using [http://nltk.org/ Natural Language Toolkit] in Python and the [http://developer.nytimes.com/docs/article_search_api New York Times Article Search API].
 </li></ul>
-<ul><li><b>Anything else we should know:</b> We need a designer to create the Web site and anyone who can help review the "because clauses" and mark interesting ones for display. Here's [https://docs.google.com/document/d/1c7ohf_JKmvaqvUJYJq9vhjIhhsgVl5KY-t-rknfO0QI/edit a document we're using to plan our work].
+<ul><li><b>Anything else we should know:</b> We need people who can help review the "because clauses" and mark interesting ones for display. Here's [https://docs.google.com/document/d/1c7ohf_JKmvaqvUJYJq9vhjIhhsgVl5KY-t-rknfO0QI/edit a document we're using to plan our work].
 </li></ul>