@OpenAustin WERE ON A MISSION Were building the most meaningful , - - PowerPoint PPT Presentation

openaustin we re on a mission
SMART_READER_LITE
LIVE PREVIEW

@OpenAustin WERE ON A MISSION Were building the most meaningful , - - PowerPoint PPT Presentation

@OpenAustin WERE ON A MISSION Were building the most meaningful , collabora.ve , and abundant data resource in the world by dismantling the barriers between data and people. A NEW KIND OF COMPANY Benefit Corpora.on Expanded purpose


slide-1
SLIDE 1

@OpenAustin

slide-2
SLIDE 2

WE’RE ON A MISSION

We’re building the most meaningful, collabora.ve, and abundant data resource in the world by dismantling the barriers between data and people.

slide-3
SLIDE 3

A NEW KIND OF COMPANY

Benefit Corpora.on

Notable Benefit Corpora.ons

  • Expanded purpose includes public benefit
  • Requires considera6on of shareholders and stakeholders
  • Flexibility to weigh public benefit in sale & IPO decisions
slide-4
SLIDE 4

OUR PRODUCT

A data plaIorm that helps people work together to solve problems faster by creaMng new ways to discover, prep, and collaborate.

slide-5
SLIDE 5

Jonathan Or6z

September 19, 2016

OPEN DATA WANTS TO BE LINKED DATA

Because data is a social animal, too.

slide-6
SLIDE 6

HUGE

There are a

OPEN DATA SETS

NUMBER

  • f
slide-7
SLIDE 7

TOO MUCH OF DATA’S GROWTH IS HAPPENING IN SILOS.

slide-8
SLIDE 8

DOWNLOADABLE

Only available as DOCUMENTS

DOWNLOADABLE DOWNLOADABLE

slide-9
SLIDE 9 CSV TSV XLS

XML

KML GML

“JSON”

“GeoJSON” “TopoJSON” BINARY Shapefiles NetCDF

OPEN DATA EXISTS IN MANY FORMATS

XML

KML GML

CSV TSV XLS BINARY Shapefiles NetCDF “ J S O N ” “GeoJSON” “TopoJSON”

XML

KML G M L

CSV T S V XLS BINARY S h a p e fi l e s NetCDF “ J S O N ” “ G e
  • J
S O N ” “ T
  • p
  • J
S O N ”

XML

GML

CSV XLS B I N A R Y S h a p e fi l e s N e t C D F “TopoJSON”

XML

KML GML

CSV T S V XLS B I N A R Y “JSON” “GeoJSON”

XML

CSV Shapefiles NetCDF “ T
  • p
  • J
S O N ”

KML

“JSON” “GeoJSON”

XML

CSV BINARY S h a p e fi l e s NetCDF “TopoJSON”

XML

C S V B I N A R Y Shapefiles N e t C D F “TopoJSON”
slide-10
SLIDE 10

Few formats convey MEANING about the contents in a way that can be

SHARED and EXTENDED.

slide-11
SLIDE 11

APIs

Some datasets are available via

But those APIs don’t generally have consistent interfaces

  • r pa+erns…
slide-12
SLIDE 12 JSON X M L CSV J S O N XML CSV JSON XML CSV

THEY LOAD IT IN

JSON

XML

CSV

JSON X M L CSV J S O N XML J S O N J S O N X M L C S V J S O N

XML

CSV

JSON X M L C S V JSON X M L CSV J S O N XML C S V

YOU PULL IT OUT

slide-13
SLIDE 13

EXISTS

GREAT

It is that this open data

slide-14
SLIDE 14

OPEN DATA FOR ALTERNATIVE RISK MODELS = $2B IN LOANS ACROSS 700+ INDUSTRIES

slide-15
SLIDE 15 https://www.one.org/international/follow-the-money/case-studies/access-to-oil-and-mining-data-leads-to-doubling-of-education-and-health-budgets/

PIURA

AREQUIPA

OIL AND MINING DATA IMPROVES REVENUE FORECASTING = 2X SPENT ON EDUCATION AND HEALTH

slide-16
SLIDE 16 http://iquantny.tumblr.com/post/144197004989/the-nypd-was-systematically-ticketing-legally

AN OPEN DATA BLOGGER IN NYC USED PUBLIC DATA TO PROVE THE NYPD ISSUED 1000’S OF PARKING TICKETS IN ERROR

slide-17
SLIDE 17
  • Mr. Wellington’s analysis idenJfied errors the department made in issuing parking summonses. It

appears to be a misunderstanding by officers on patrol of a recent, abstruse change in the parking

  • rules. We appreciate Mr. Wellington bringing this anomaly to our aNenJon.

The department’s internal analysis found that patrol officers who are unfamiliar with the change have

  • bserved vehicles parked in front of pedestrian ramps and issued a summons in error. When the rule

changed in 2009 to allow for certain pedestrian ramps to be blocked by parked vehicles, the department focused training on traffic agents, who write the majority of summonses. Yet, the majority of summonses wriNen for this code violaJon were wriNen by police officers. As a result, the department sent a training message to all officers clarifying the rule change and has communicated to commanders of precincts with the highest number of summonses, informing them of the issues within their command.

Thanks to this analysis and the availability of this open data, the department is also taking steps to digitally monitor these types

  • f summonses to ensure that they are being issued correctly.”

“ ”

slide-18
SLIDE 18

I was speechless. THIS is what the future of government could look like one day. THIS is what Open Data is all about. THIS was coming from the NYPD, who is not generally celebrated for its transparency, and yet it’s the most open and honest response I have received from any New York City agency to date. Imagine a city where all agencies embrace this sort of analysis instead of deflect and hide from it.

” “

slide-19
SLIDE 19

JUST IMAGINE WHAT PEOPLE ARE GOING TO DO WITH ALL THOSE DATA SETS

slide-20
SLIDE 20

JUST IMAGINE WHAT MACHINES ARE GOING TO DO WITH ALL THOSE DATA SETS

slide-21
SLIDE 21

it

UNDERSTANDING

But it and it can be a challenge

FINDING USING

slide-22
SLIDE 22

AND OVER AGAIN

This process happens as each data user does it individually

AND OVER OVER

slide-23
SLIDE 23

XML Data Science

So much HUMAN

SAME DATA

EFFORT

is wasted on the

WORKING & REWORKING

  • f the
slide-24
SLIDE 24

The End

slide-25
SLIDE 25

LINKED DATA

What is

?

slide-26
SLIDE 26

IMAGINE RELEARNING WEB BROWSING FOR EACH NEW SITE YOU VISIT.

That's what it's like when data isn't linked.

slide-27
SLIDE 27

SAME

WWW

It’s applying the architecture as the

  • f linked documents to…

DATA

slide-28
SLIDE 28

DATA

First, break into ATOMIC FACTS

slide-29
SLIDE 29

SUBJECT, PREDICATE, OBJECT

( )

(Turkey, "is a", Country) (Ankara, "is a", City) (Ankara, "is the capital of", Turkey)

slide-30
SLIDE 30

SUBJECT, PREDICATE, OBJECT

( )

slide-31
SLIDE 31

Turkey Country

"is a"

Ankara City

"is a" "is the capital of"

THE TRIPLE

slide-32
SLIDE 32

ENTITIES

Refer to and RELATIONSHIPS via URIs so theirMEANINGS can be discussed

slide-33
SLIDE 33

SUBJECT, PREDICATE, OBJECT

( )

hNp:/ /subject hNp:/ /predicate hNp:/ /object

slide-34
SLIDE 34

Turkey Country

"is a"

Ankara City

"is a" "is the capital of"

(dbpedia:Ankara, rdf:type, dbo:City) (dbpedia:Turkey, rdf:type, dbo:Country) (dbpedia:Ankara, dbo:capital, dbpedia:Turkey)

PUTTING IT TOGETHER

slide-35
SLIDE 35

Turkey Turkey

slide-36
SLIDE 36

(dbpedia:Turkey, rdf:type, dbo:Country) (dbpedia:Turkey_(bird), rdf:type, dbo:Bird) (dbpedia:Turkey, foaf:name, "Turkey") (dbpedia:Turkey_(bird), foaf:name, "Turkey")

TURKEY vs TURKEY

dbpedia:Turkey dbpedia:Turkey_(bird) dbo:Country “Turkey” dbo:Bird

rdf:type foaf:name foaf:name rdf:type

slide-37
SLIDE 37

“AAA” Principal

Can say

ANYONE ANYTHING

About

ANY TOPIC

slide-38
SLIDE 38

Triples are a universal format for represenMng facts - Any structured data can be mechanically transformed into triples.

CSV TSV XLS XML KML GML “JSON” “GeoJSON” “TopoJSON” BINARY Shapefiles NetCDF

YEA TRIPLES!

CSV “TopoJSON” XLS “TopoJSON” “GeoJSON” CSV GML XLS XLS “TopoJSON” “TopoJSON” Shapefiles
slide-39
SLIDE 39

TABULAR DATA AS A GRAPH

slide-40
SLIDE 40

LINKED

Why should you make your open data

?

slide-41
SLIDE 41

DISCOVERY

To make

  • f your data easier

INTEROPERABLE

To make your data To help the machines learn FASTER

slide-42
SLIDE 42

The End ?

slide-43
SLIDE 43

“NETWORK EFFECT”

Data can enjoy a

slide-44
SLIDE 44

Each dataset that is added to the network

INCREASES the incrementalVALUE

  • f every data set in the network
slide-45
SLIDE 45 NETWORK EFFECT

DATA NETWORK

slide-46
SLIDE 46

UNIVERSAL IDENTIFIERS

is about publishing data as

LINKED DATA ATOMIC FACTS

and using to refer to concepts and relaJonships, so we can agree upon the

SEMANTIC MEANING

  • f data.
slide-47
SLIDE 47

LINKED DATA

Your

OPEN DATA

wants to be

slide-48
SLIDE 48

So the PEOPLE and MACHINES who are using that data to solve HUMANITIES BIGGEST PROBLEMS can leverage the sum of accumulated knowledge as effectively as possible.

slide-49
SLIDE 49

OPEN DATA

The Jme to make your accessible as

LINKED DATA

is

NOW!

slide-50
SLIDE 50

The End

for real