Stuart Sierra Program on Law & Technology Columbia Law School - PowerPoint PPT Presentation

Feb 01, 2023 •327 likes •514 views

Stuart Sierra Program on Law & Technology Columbia Law School http://altlaw.org/ - the site http://lawcommons.org/ - wiki & mailing list http://columbialawtech.org/ - my employer Talking Points AltLaw History, motivation

Stuart Sierra Program on Law & Technology Columbia Law School http://altlaw.org/ - the site http://lawcommons.org/ - wiki & mailing list http://columbialawtech.org/ - my employer
Talking Points ● AltLaw – History, motivation – Data sources – Back-end ● Semantic Web – What I've done – What I want – Problems I see
Front-end
Data Sources – Large Corpora ● Paul Ohm's corpus, http://bulk.altlaw.org/ – 7 GB, 200,000+ files harvested from court web sites ● Cornell U.S. Code – 748 MB of XML ● http://bulk.resource.org/courts.gov/c/ – 2 GB, 700,000+ federal cases, XHTML ● http://pacer.resource.org/ – 736 GB, 2.7 million PDFs, 1.8 million HTML files
Data Sources – Court Web Sites www.supremecourtus.gov ● 20-40 new cases daily www.ca1.uscourts.gov ● PDF, WordPerfect, HTML, www.ca2.uscourts.gov www.ca3.uscourts.gov plain text www.ca4.uscourts.gov www.ca5.uscourts.gov www.ca6.uscourts.gov . . . 14 appeals courts total 94 district courts ?? state courts ?? local/other courts
Back-end (1) Large Corpora Common Big Data Daily Crawls Merge Model
Back-end (2) Citation Graph Ranking Clustering Common Enhanced Big Data Common Model Duplicate Data Merge Detection Model Entity Extraction Semantic Analysis
Scaling Stuart ● Java ● ● Ruby ● ● Clojure
The Grand Unified Data Model ● Key-value pairs? (files, Berkeley DB) ● Documents? (Solr/Lucene, CouchDB) ● Trees? (XML, JSON, Objects) ● Graphs? (RDF) ● Tables? (SQL)
● “Disk is the new tape.” – NO random access – NO disk seeks – Run at full disk transfer rate, not seek rate ● Data must be splittable ● Process each record in isolation
Secret Weapons ● Hadoop – open-source MapReduce ● Amazon EC2 – cluster by the hour ● Clojure – Lisp on the JVM ● Solr – full-text search + document storage; no SQL database! ● Ruby on Rails
The Grand Unified Data Model ● Key-value pairs? (files, Berkeley DB) ● Documents? (Solr/Lucene, CouchDB) ● Trees? (XML, JSON, Objects) ● Graphs? (RDF) ● Tables? (SQL)
Mismatch ● Hadoop ● RDF – Disk is the new tape – Normalized – Flat key/value files – Random access – Isolated records – Graph structure ● Solr / Lucene – Linked records – Denormalized – Flat documents
Semantic Web – What I Want ● Publish linked data for others ● Accept new data without writing new parsers/scrapers ● Richer internal data model ● Inference over multiple data sources
AltLaw on the Semantic Web ● Persistent URIs for federal courts – e.g. http://id.altlaw.org/courts/us/fed/app/3 – 303 redirects to HTML/RDF ● Beginnings of an ontology – http://github.com/lawcommons/altlaw-vocab – Extension of Dublin Core & Bibliontology ● Semantic web crawler – Output uses “HTTP Vocabulary in RDF”
Questions ● What's in it for you? – How do you want my data? ● Bulk RDF/XML downloads ● RDFa embedded in HTML ● SPARQL endpoint – What would you do with it? ● What's in it for me? – Universal data model – Less data transformation

Recommend

Municipal Building Project Cynthia Stuart | Stuart Consulting Introductions Cynthia Stuart,

www.barnetmunicipalvt.org Municipal Building Project Cynthia Stuart | Stuart Consulting Introductions Cynthia Stuart, Stuart Consulting Andrea Brohcu, NCIC Format Presentation Regarding Two Options Cynthia Stuart (questions as we go

690 views • 35 slides

Socorro/Sierra Socorro/Sierra Regional Water Plan Regional Water Plan Presentation to

Socorro/Sierra Socorro/Sierra Regional Water Plan Regional Water Plan Presentation to Presentation to The Interstate Stream The Interstate Stream Commission Commission Presented by Presented by Socorro- -Sierra Water Planning Sierra

401 views • 12 slides

Sierra Leone Legal Information Institute Can it be a tool for promoting the rule of law? Law via

Sierra Leone Legal Information Institute Can it be a tool for promoting the rule of law? Law via the Internet 2011 Maria WARREN Hongkong Mohamed A B TIMBO Background Sierra Leone Special Court for Sierra Leone Incentive for

935 views • 14 slides

Reclaiming the Sierra Elizabeth Izzy Martin CEO The Sierra Fund Original feather picture

Reclaiming the Sierra Elizabeth Izzy Martin CEO The Sierra Fund Original feather picture Miner and Mercury Flask detail from brass seal on west side of Capitol Mining left a lasting legacy from the Sierra to the Sea Abandoned Mines

267 views • 25 slides

2019 MAYOR'S STATE OF THE CITY ADDRESS City of Sierra Madre The Golden Age of Sierra Madre

2019 MAYOR'S STATE OF THE CITY ADDRESS City of Sierra Madre The Golden Age of Sierra Madre Finances City Services Public Safety Library Public Works (Water, Sewer) AGENDA Stewardship Clean Power Alliance The Golden Age of Sierra Madre

411 views • 30 slides

Sun Corridor Inc. Presentation to Sierra Vista City Council Sierra Vista Technical Assistance

Sun Corridor Inc. Presentation to Sierra Vista City Council Sierra Vista Technical Assistance Program Update March 27, 2018 SVTAP Purpose Facilitate and encourage commercial diversification of area defense contractors through programs,

289 views • 5 slides

Gaseous Galaxy Halos Josh Peek Columbia / Hubble Fellow w ith Mary Putman Columbia Ryan Joung

Gaseous Galaxy Halos Josh Peek Columbia / Hubble Fellow w ith Mary Putman Columbia Ryan Joung Columbia Look up here if you get lost! Gaseous Galaxy Halos Josh Peek Columbia / Hubble Fellow w ith Mary Putman Columbia Ryan Joung Columbia

800 views • 57 slides

Statistics Sierra Leone Statistics Sierra Leone PRESENTATION : Compilation process of Sierra

UNITED NATIONS DEPARTMENT OF ECONOMIC AND SOCIAL AFFAIRS STATISTICS DIVISION Workshop on compilation of international merchandise trade statistics, Abuja, Nigeria, 30 aot - 2 septembre 2005 Country Presentation Statistics Sierra Leone

468 views • 12 slides

2016 Sierra Valley Groundwater Study Workshop Burkhard Bohm Plumas Geo-Hydrology February 24,

2016 Sierra Valley Groundwater Study Workshop Burkhard Bohm Plumas Geo-Hydrology February 24, 2017 The groundwater studies Inventory of Sierra Valley Wells and Groundwater Quality Conditions Published November 29, 2016 Sierra

628 views • 48 slides

2021 & 2025 Draft LCR Study Results Sierra Area Ebrahim Rahimi Lead Regional Transmission

2021 & 2025 Draft LCR Study Results Sierra Area Ebrahim Rahimi Lead Regional Transmission Engineer Stakeholder Call March 16, 2020 ISO Public ISO Public Sierra Area Transmission System & LCR Subareas Legend: Table Mountain Sierra

341 views • 15 slides

Mac OS 10.12 Sierra Introduction: ! Sierra 10.12 is the latest Macintosh operating system from

Mac OS 10.12 Sierra Introduction: ! Sierra 10.12 is the latest Macintosh operating system from Apple. ! Previous Systems: ! OSX 10.5 Leopard ! OSX 10.6 Snow Leopard ! OSX 10.7 Lion ! OSX 10.8 Mountain Lion ! OS X 10.9 Mavericks ! OS X10.10

351 views • 18 slides

Mac OS 10.13 High Sierra Introduction: ! High Sierra 10.13 is the latest Macintosh operating

Mac OS 10.13 High Sierra Introduction: ! High Sierra 10.13 is the latest Macintosh operating system from Apple. ! Previous Systems: OSX 10.5 Leopard ! OSX 10.6 Snow Leopard ! OSX 10.7 Lion ! OSX 10.8 Mountain Lion ! OS X 10.9 Mavericks ! OS

539 views • 19 slides

Mike Goulden UC Irvine (mgoulden@uci.edu) Ill talk about two projects in the Sierra National

Mike Goulden UC Irvine (mgoulden@uci.edu) Ill talk about two projects in the Sierra National Forest (above Fresno, near Shaver Lake) Sierra Nevada Critical Zone Observatory (Sierra CZO) Funded by NSF Roger Bales PI (UC Merced)

417 views • 12 slides

Business Plan Indo-Sierra Furniture & Interior Design Services Limited, GmbH Indo-Sierra

Business Plan Indo-Sierra Furniture & Interior Design Services Limited, GmbH Indo-Sierra Furniture & Interior Design Services Limited, GmbH 1.1 Business Description Executive Summary Located in Cologne, Germany Provide furniture made

659 views • 46 slides

2020 & 2024 Final LCR Study Results Sierra Area Ebrahim Rahimi Lead Regional Transmission

2020 & 2024 Final LCR Study Results Sierra Area Ebrahim Rahimi Lead Regional Transmission Engineer Stakeholder Call April 10, 2019 ISO Public ISO Public Sierra Area Transmission System & LCR Subareas Legend: Table Mountain Sierra

975 views • 20 slides

2020 Census: Whats at Stake for Sierra Vista City of Sierra Vista, City Council City Council

2020 Census: Whats at Stake for Sierra Vista City of Sierra Vista, City Council City Council Chambers Presenter: Emily Verdugo, Partnership Specialist U.S. Census Bureau-Denver Region 1 Agenda Greetings Census Overview

535 views • 14 slides

Play Framework One Web Framework to rule them all Felix Mller Agenda Yet another web

Play Framework One Web Framework to rule them all Felix Mller Agenda Yet another web framework? Introduction for Java devs Demo Summary Yet another web framework? Yet another web framework? Why do we need another web framework?

657 views • 31 slides

TCAobjects Fabrizio Branca mail@fabrizio-branca.de Fabrizio Branca mail@fabrizio-branca.de

TCAobjects Fabrizio Branca mail@fabrizio-branca.de Fabrizio Branca mail@fabrizio-branca.de I started implementing my tcaobjects extension and when almost finished I found this: [...] The general idea with our mapper (called tcaObj)

869 views • 37 slides

Swarm Transparently distributed computation in the cloud Ian Clarke ian@uprizer.com

Swarm Transparently distributed computation in the cloud Ian Clarke ian@uprizer.com Sunday, September 13, 2009 Swarm Transparently Distributed Computation in the cloud Ian Clarke ian.clarke@gmail.com Sunday, September 13, 2009

1.09k views • 51 slides

<code/> Do It Yourself! Contributing Code Back to Canvas But why? Turnaround

<code/> Do It Yourself! Contributing Code Back to Canvas But why? Turnaround Maintainability Everyone Benefits Getting Started The Canvas Community Freenode Google Groups Discussion #canvas-lms canvas-lms-users Forums

1.08k views • 51 slides

Knuth-Bendix Completion Procedure (Rules 1) The KB procedure consists of 3 basic steps:

16ai Knuth-Bendix Completion Procedure (Rules 1) The KB procedure consists of 3 basic steps: orient equations to form directed rewrite rules form critical pairs and hence new equations use the rewrite rules to rewrite terms (and so

171 views • 5 slides

? S E L U R T L M 3 4 What Are Core Values? CORE VALUES Core values are

EVADING THE MLT POLICE? What MLT rulescan you think of? MAKING COMPROMISES WITHOUT COMPROMISING CORE VALUES Respond at PollEv.com/heathershoul495 OR Text HEATHERSHOUL495 to 37607 once to join, then text your response **Limit of 40

329 views • 5 slides

All A is B All A is B This is A This is B This is B This is A good form bad form Use to

The golden rule, like My perspective a diamond, reflects a is logic, which is different beauty from about forms of different perspectives. valid reasoning. All A is B All A is B This is A This is B This is B This is A good form

239 views • 9 slides

Majority Rule in the Absence of a Majority Klaus Nehring and Marcus Pivato ESSLLI August 13,

Majority Rule in the Absence of a Majority Klaus Nehring and Marcus Pivato ESSLLI August 13, 2013 Klaus Nehring and Marcus Pivato () Majority Rule ESSLLI August 13, 2013 1 / 34 Majoritarianism To fix ideas, cursory definition of

862 views • 36 slides

Stuart Sierra Program on Law & Technology Columbia Law School - PowerPoint PPT Presentation

Stuart Sierra Program on Law & Technology Columbia Law School http://altlaw.org/ - the site http://lawcommons.org/ - wiki & mailing list http://columbialawtech.org/ - my employer Talking Points AltLaw History, motivation

Municipal Building Project Cynthia Stuart | Stuart Consulting Introductions Cynthia Stuart,

Socorro/Sierra Socorro/Sierra Regional Water Plan Regional Water Plan Presentation to

Sierra Leone Legal Information Institute Can it be a tool for promoting the rule of law? Law via

Reclaiming the Sierra Elizabeth Izzy Martin CEO The Sierra Fund Original feather picture

2019 MAYOR'S STATE OF THE CITY ADDRESS City of Sierra Madre The Golden Age of Sierra Madre

Sun Corridor Inc. Presentation to Sierra Vista City Council Sierra Vista Technical Assistance

Gaseous Galaxy Halos Josh Peek Columbia / Hubble Fellow w ith Mary Putman Columbia Ryan Joung

Statistics Sierra Leone Statistics Sierra Leone PRESENTATION : Compilation process of Sierra

2016 Sierra Valley Groundwater Study Workshop Burkhard Bohm Plumas Geo-Hydrology February 24,

2021 & 2025 Draft LCR Study Results Sierra Area Ebrahim Rahimi Lead Regional Transmission

Mac OS 10.12 Sierra Introduction: ! Sierra 10.12 is the latest Macintosh operating system from

Mac OS 10.13 High Sierra Introduction: ! High Sierra 10.13 is the latest Macintosh operating

Mike Goulden UC Irvine (mgoulden@uci.edu) Ill talk about two projects in the Sierra National

Business Plan Indo-Sierra Furniture & Interior Design Services Limited, GmbH Indo-Sierra

2020 & 2024 Final LCR Study Results Sierra Area Ebrahim Rahimi Lead Regional Transmission

2020 Census: Whats at Stake for Sierra Vista City of Sierra Vista, City Council City Council

Play Framework One Web Framework to rule them all Felix Mller Agenda Yet another web

TCAobjects Fabrizio Branca mail@fabrizio-branca.de Fabrizio Branca mail@fabrizio-branca.de

Swarm Transparently distributed computation in the cloud Ian Clarke ian@uprizer.com

<code/> Do It Yourself! Contributing Code Back to Canvas But why? Turnaround

Knuth-Bendix Completion Procedure (Rules 1) The KB procedure consists of 3 basic steps:

? S E L U R T L M 3 4 What Are Core Values? CORE VALUES Core values are

All A is B All A is B This is A This is B This is B This is A good form bad form Use to

Majority Rule in the Absence of a Majority Klaus Nehring and Marcus Pivato ESSLLI August 13,

Sambuz

Useful Links

Newsletter

Mail Us

Stuart Sierra Program on Law & Technology Columbia Law School - PowerPoint PPT Presentation

Stuart Sierra Program on Law & Technology Columbia Law School http://altlaw.org/ - the site http://lawcommons.org/ - wiki & mailing list http://columbialawtech.org/ - my employer Talking Points AltLaw History, motivation

Municipal Building Project Cynthia Stuart | Stuart Consulting Introductions Cynthia Stuart,

Socorro/Sierra Socorro/Sierra Regional Water Plan Regional Water Plan Presentation to

Sierra Leone Legal Information Institute Can it be a tool for promoting the rule of law? Law via

Reclaiming the Sierra Elizabeth Izzy Martin CEO The Sierra Fund Original feather picture

2019 MAYOR'S STATE OF THE CITY ADDRESS City of Sierra Madre The Golden Age of Sierra Madre

Sun Corridor Inc. Presentation to Sierra Vista City Council Sierra Vista Technical Assistance

Gaseous Galaxy Halos Josh Peek Columbia / Hubble Fellow w ith Mary Putman Columbia Ryan Joung

Statistics Sierra Leone Statistics Sierra Leone PRESENTATION : Compilation process of Sierra

2016 Sierra Valley Groundwater Study Workshop Burkhard Bohm Plumas Geo-Hydrology February 24,

2021 &amp; 2025 Draft LCR Study Results Sierra Area Ebrahim Rahimi Lead Regional Transmission

Mac OS 10.12 Sierra Introduction: ! Sierra 10.12 is the latest Macintosh operating system from

Mac OS 10.13 High Sierra Introduction: ! High Sierra 10.13 is the latest Macintosh operating

Mike Goulden UC Irvine (mgoulden@uci.edu) Ill talk about two projects in the Sierra National

Business Plan Indo-Sierra Furniture &amp; Interior Design Services Limited, GmbH Indo-Sierra

2020 &amp; 2024 Final LCR Study Results Sierra Area Ebrahim Rahimi Lead Regional Transmission

2020 Census: Whats at Stake for Sierra Vista City of Sierra Vista, City Council City Council

Play Framework One Web Framework to rule them all Felix Mller Agenda Yet another web

TCAobjects Fabrizio Branca mail@fabrizio-branca.de Fabrizio Branca mail@fabrizio-branca.de

Swarm Transparently distributed computation in the cloud Ian Clarke ian@uprizer.com

&lt;code/&gt; Do It Yourself! Contributing Code Back to Canvas But why? Turnaround

Knuth-Bendix Completion Procedure (Rules 1) The KB procedure consists of 3 basic steps:

? S E L U R T L M 3 4 What Are Core Values? CORE VALUES Core values are

All A is B All A is B This is A This is B This is B This is A good form bad form Use to

Majority Rule in the Absence of a Majority Klaus Nehring and Marcus Pivato ESSLLI August 13,

Sambuz

Useful Links

Newsletter

Mail Us

2021 & 2025 Draft LCR Study Results Sierra Area Ebrahim Rahimi Lead Regional Transmission

Business Plan Indo-Sierra Furniture & Interior Design Services Limited, GmbH Indo-Sierra

2020 & 2024 Final LCR Study Results Sierra Area Ebrahim Rahimi Lead Regional Transmission

<code/> Do It Yourself! Contributing Code Back to Canvas But why? Turnaround