NewsDiffs: Version Controlling the News
Eric Price Margaret Sullivan
MIT The New York Times
2013-03-11 http://newsdiffs.org/
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 1 / 30
NewsDiffs: Version Controlling the News Eric Price Margaret - - PowerPoint PPT Presentation
NewsDiffs: Version Controlling the News Eric Price Margaret Sullivan MIT The New York Times 2013-03-11 http://newsdiffs.org/ Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 1 / 30 NewsDiffs
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 1 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 2 / 30
◮ Print: hard to change, daily deadlines. Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 2 / 30
◮ Print: hard to change, daily deadlines. ◮ Online: easy to change, deadline now. Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 2 / 30
◮ Print: hard to change, daily deadlines. ◮ Online: easy to change, deadline now.
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 2 / 30
◮ Print: hard to change, daily deadlines. ◮ Online: easy to change, deadline now.
◮ Reporter writes a rushed story. Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 2 / 30
◮ Print: hard to change, daily deadlines. ◮ Online: easy to change, deadline now.
◮ Reporter writes a rushed story. ◮ Editor makes a pass or two. Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 2 / 30
◮ Print: hard to change, daily deadlines. ◮ Online: easy to change, deadline now.
◮ Reporter writes a rushed story. ◮ Editor makes a pass or two. ◮ (Another) reporter rewrites the story. Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 2 / 30
◮ Print: hard to change, daily deadlines. ◮ Online: easy to change, deadline now.
◮ Reporter writes a rushed story. ◮ Editor makes a pass or two. ◮ (Another) reporter rewrites the story. ◮ Editor makes another pass or two. Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 2 / 30
◮ Print: hard to change, daily deadlines. ◮ Online: easy to change, deadline now.
◮ Reporter writes a rushed story. ◮ Editor makes a pass or two. ◮ (Another) reporter rewrites the story. ◮ Editor makes another pass or two.
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 2 / 30
◮ Print: hard to change, daily deadlines. ◮ Online: easy to change, deadline now.
◮ Reporter writes a rushed story. ◮ Editor makes a pass or two. ◮ (Another) reporter rewrites the story. ◮ Editor makes another pass or two.
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 2 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 3 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 3 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 4 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 5 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 6 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 7 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 7 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 7 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 7 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 8 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 9 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 10 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 10 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 10 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 10 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 10 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 10 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 11 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 12 / 30
◮ Running on AFS, a networked file system ◮ Moved version metadata from git to MySQL. ◮ Optimized queries to both backends
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 13 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 14 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 15 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 16 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 16 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 16 / 30
◮ But resource constraints: running on free MIT servers out of my
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 17 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 18 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 18 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 18 / 30
◮ Received (and merged) patch to parse tagesschau.de. Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 18 / 30
◮ 20-30% in opinion, books, fashion sections ◮ 55-60% in sports, NY region, world sections
◮ 11% in world section. Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 19 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 20 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 21 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 22 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 23 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 24 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 25 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 26 / 30
1
2
3
◮ Automated tools to sift through the changes for interesting ones. ◮ Someone to use our data for research Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 27 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 28 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 28 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 28 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 28 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 28 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 28 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 28 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 28 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 29 / 30
Eric Price, Margaret Sullivan (MIT, NYT) NewsDiffs: Version Controlling the News 2013-03-11 30 / 30