SMS Platform & Linked Geo Data Ali Khalili, Antonis Loizou, - - PowerPoint PPT Presentation

sms platform linked geo data
SMART_READER_LITE
LIVE PREVIEW

SMS Platform & Linked Geo Data Ali Khalili, Antonis Loizou, - - PowerPoint PPT Presentation

SMS Platform & Linked Geo Data Ali Khalili, Antonis Loizou, Peter van den Besselaar Day 1 Vienna, 10-11 September 2015 Agenda SMS Platform Architecture Geo data available in RISIS datasets Exchanging [Linked] Geo data


slide-1
SLIDE 1

SMS Platform & Linked Geo Data

Day 1 Vienna, 10-11 September 2015

Ali Khalili, Antonis Loizou, Peter van den Besselaar

slide-2
SLIDE 2

Agenda

  • SMS Platform Architecture
  • Geo data available in RISIS datasets
  • Exchanging [Linked] Geo data
  • Geo-Services provided by the SMS platform

○ Basic Geo Services ○ Innovative Geo Services based on the new notion of distance

slide-3
SLIDE 3

SMS Architecture

Application Service Data

slide-4
SLIDE 4

http://sms.risis.eu

slide-5
SLIDE 5

http://datasets.risis.eu

slide-6
SLIDE 6

Geo data available in RISIS datasets

slide-7
SLIDE 7

Entity Types

slide-8
SLIDE 8

Geographical attributes available per entity type in each dataset

Country Name Address Postal Code NUTS Lat/Long CIB Firm Person, Firm ETER Institution Institution Institution Institution Institution EUPRO Person, Organisation JOREP Beneficiary, Funding Programme Funding Body LeidenRank University MORE1 Person Nano Person, Institution Person, Institution Person, Institution Profile Person SIPER VICO Patent Firm Firm

slide-9
SLIDE 9

Exchanging [Linked] Geo data

slide-10
SLIDE 10

Open Geospatial Consortium

Collaboration effort between more than 400 commercial, non-profit, governmental, and research organizations worldwide

DIVERSE LANDSCAPE OF STANDARDS

slide-11
SLIDE 11

Geodata may play a vital role in connecting and aggregating data from multiple (possibly external) sources based on geospatial references that they have in common. Semantic Web enables the use of implicit geodata on top of explicit geodata : ➢ Explicit Geodata; Direct geospatial references [ e.g. coordinates, surfaces, and distances ] ➢ Implicit Geodata; Indirect geospatial references requiring lookup [ e.g. place name -> (lookup) -> coordinates ]

GEODATA AS A SPINE

slide-12
SLIDE 12

Basic (RDF) Geo Vocabulary

WGS84 – Latitude, Longitude & Altitude W3 Recommendation

<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#"> <geo:Point> <geo:lat>55.701</geo:lat> <geo:long>12.552</geo:long> </geo:Point> </rdf:RDF>

slide-13
SLIDE 13
  • <country>
  • <countryCode>NL</countryCode>
  • <countryName>Netherlands</countryName>
  • <isoNumeric>528</isoNumeric>
  • <isoAlpha3>NLD</isoAlpha3>
  • <fipsCode>NL</fipsCode>
  • <continent>EU</continent>
  • <continentName>Europe</continentName>
  • ….
  • <west>3.362556</west>
  • <north>53.512196</north>
  • <east>7.227944</east>
  • <south>50.753918</south>
  • <postalCodeFormat>#### @@</postalCodeFormat>
  • </country>

GeoNames [ Bounding box ]

cont.

slide-14
SLIDE 14

Flickr [ESRI Shapefile]

  • <place place_id="4hLQygSaBJ92" woeid="3534"
  • latitude="45.512" longitude="-73.554"
  • place_url="/The_Netherlands" place_type="locality"
  • name="The Netherlands"
  • has_shapedata="1">
  • <!-- all the usual places hierarchy elements -->
  • <shapedata created="1223513357" alpha="0.012359619140625"
  • count_points="34778" count_edges="52">
  • <polylines>
  • <polyline>

40.702243804932,19.945163726807 40.70276260376,19.940700531006 ,...

</polyline>

  • </polylines>
  • </shapedata>
  • </place>
slide-15
SLIDE 15

<http://sws.geonames.org/3186084/> skos:exactMatch <http://www.flicker.com/place/.AkiwWpVVLNcL1o> ; geo:geometry "POLYGON((19.945163726807 40.702243804932, 19.940700531006 40.70276260376, etc...))

"^^virtrdf:Geometry .

Virtuoso [data type virtrdf:Geometry ]

slide-16
SLIDE 16

@prefix geo: <http://www.w3.org/2003/01/geo/wgs84_pos#> . @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix ex: <http://example.org/> . @prefix ngeo: <http://geovocab.org/geometry#> . _:polygon rdf:type ngeo:Polygon ; ngeo:exterior [ rdf:type ngeo:LinearRing ; ngeo:posList ( [ geo:lat "-29"; geo:long "16" ] [ geo:lat "-28"; geo:long "33" ] [ geo:lat "-34"; geo:long "27" ] [ geo:lat "-35"; geo:long "19" ] [ geo:lat "-29"; geo:long "16" ] ) ] ; ngeo:interior [ rdf:type ngeo:LinearRing ; ngeo:posList ( [ geo:lat "-29.5"; geo:long "27" ] [ geo:lat "-28.5"; geo:long "28.5" ] [ geo:lat "-29.5"; geo:long "29.5" ] [ geo:lat "-31"; geo:long "28" ] [ geo:lat "-29.5"; geo:long "27" ] ) ] .

Geo vocab

slide-17
SLIDE 17

GeoSPARQL

Prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> Prefix ogc: <http://www.opengis.net/ont/geosparql#> Prefix geom: <http://geovocab.org/geometry#> Prefix lgdo: <http://linkedgeodata.org/ontology/> Select ?school ?schoolLabel ?coffeeShop ?coffeeShopLabel From <http://linkedgeodata.org> { ?school a lgdo:School ; rdfs:label ?schoolLabel ; geom:geometry [

  • gc:asWKT ?schoolGeo

] .

?coffeeShop a lgdo:CoffeeShop ; rdfs:label ?coffeeShopLabel ; geom:geometry [

  • gc:asWKT ?coffeeShopGeo

] .

Filter ( bif:st_intersects (?schoolGeo, bif: st_point (4.892222, 52.373056), 5) && bif:st_intersects (?coffeeShopGeo, ?schoolGeo, 1) ) . }

Example Find all schools within a 5km radius around a specific location, and for each school find coffeeshops that are closer than 1km.

Offers

➢ A small ontology ➢ An extension to SPARQL

slide-18
SLIDE 18

SMS Platform & Linked Geo Data

Day 2 Vienna, 10-11 September 2015

Ali Khalili, Antonis Loizou, Peter van den Besselaar

slide-19
SLIDE 19

Agenda

  • Geo-Services provided by the SMS platform

○ Basic Geo Services

■ Geocoding ■ Conversion between various Geodata classifications ■ Geographical boundary-related services

○ Innovative Geo Services based on the new notion of distance

slide-20
SLIDE 20
slide-21
SLIDE 21

Geo data representation on the SMS Platform

  • Uses Virtuoso representation internally

○ performance ○ convenience

  • Provides a service that can ingest multiple

formats into Virtuoso.

  • Provides an export service from Virtuoso to

user-specified formats.

slide-22
SLIDE 22

Geocoding

Basic Geo Services

slide-23
SLIDE 23

Geo-coding

  • Converts an address (or place name)

to geo coordinates

○ can be a point or a polygon

  • Most RISIS datasets do not store

geo coordinates

○ augments RISIS datasets with useful information ○ enables calculating distance between RISIS entities ○ enables validation of NUTS regions (i.e. does the point lie inside the polygon)

Lat/Long CIB ETER Institution EUPRO JOREP LeidenRank MORE1 Nano Firm Profile SIPER VICO

slide-24
SLIDE 24

Geo-coding SMS services

  • Input: List of (partial) addresses
  • Output: [Address, lat/long, Postal

code, Country, NUTS, ...]

  • Implemented by wrapping 3rd party

services ○ e.g. Google Geocoding API, Bing, MapQuest, Yahoo! Place Finder, etc.

Lat/Long CIB ETER Institution EUPRO JOREP LeidenRank MORE1 Nano Firm Profile SIPER VICO

slide-25
SLIDE 25

Geo-coding

http://lda-apps.risis.ops.few.vu.nl/geocode/{address} For example: http://lda-apps.risis.ops.few.vu.nl/geocode/VU%20University%20Amsterdam

slide-26
SLIDE 26

Reverse Geo-coding

  • Converts geo coordinates to an address

○ can be a point or a polygon

  • Provides additional validation opportunities.

SMS services

  • Input: List of geo-coordinates
  • Output: [Address, Postal code, Country, NUTS, …]
slide-27
SLIDE 27

Conversions between various Geo representations and classifications

Basic Geo Services

slide-28
SLIDE 28

NUTS Regions

  • Hierarchy of NUTS regions

○ Available for download ○ Facilitates aggregation ■ e.g. Return all entities in a specified NUTS region, including all sub- regions

  • Polygons

○ generated by EnAKTing project in 2011 ○ e.g. Netherlands ○ Facilitates integration ■ Return all entities whose geo coordinates fall inside a NUTS region ■ Return the NUTS region for a list of coordinates/addresses.

slide-29
SLIDE 29

ISO 3166-2

  • 2 letter country codes
  • Can be mapped to NUTS easily

○ First 2 NUTS characters are the ISO 3166-2 ○ UK is the exception (NUTS: UK , ISO: GB)

  • Polygons for ISO 3162-2 are also available
  • nline

○ http://www.gadm.org/

slide-30
SLIDE 30

LAU Regions

  • Local Administrative Unit
  • LAU-1 and LAU-2, previously called NUTS-4 and NUTS-

5

  • low level administrative division of a country, ranked

below a province, region, or state.

  • Important when analyzing OECD FUAs.
  • LAU-2, consists of municipalities or equivalent units in the 28 EU

Member States

  • We are still negotiating to acquire shapefiles for LAU-1 and LAU-2
slide-31
SLIDE 31

OECD Urban Functional Areas

  • OECD.stat provides comprehensive

statistics for each area

  • ETL required to obtain geo polygons for

these areas

○ (to the best of our knowledge)

slide-32
SLIDE 32
  • Named Entity Recognition

○ e.g. DBPedia Spotlight

  • DBPedia provides the ISO 3166-2 code for

each country

Country Names

slide-33
SLIDE 33

Point to NUTS

  • For a given point, returns the associated NUTS region(s)
  • Example: http://api.risis.ops.few.vu.nl/PointToNUTS?long=4.865720&lat=52.333757
slide-34
SLIDE 34

Point to NUTS

http://lda-apps.risis.ops.few.vu.nl/PointToNUTS/4.8650/52.3339

slide-35
SLIDE 35

Geographical boundary- related services

Basic Geo Services

slide-36
SLIDE 36

Boundary-related services

For a given region, returns the list of entities of interest (e.g. Municipalities or PROs)

  • NUTS to Municipality
  • http://api.risis.ops.few.vu.nl/NUTStoMunicipality/{NUTS code} e.g. http://api.risis.ops.few.vu.

nl/NUTStoMunicipality/NL326

slide-37
SLIDE 37

NUTS to Polygon

  • For a given NUTS region, returns the bounding polygon (i.e. a set of ordered coordinates)
  • Example: http://api.risis.ops.few.vu.nl/NUTStoPolygon/NL326
slide-38
SLIDE 38

NUTS to Polygon

http://lda-apps.risis.ops.few.vu.nl/NUTS/NL326

slide-39
SLIDE 39

Combining API calls

Basic Geo Services

slide-40
SLIDE 40

Address to Municipality

http://lda-apps.risis.ops.few.vu.nl/addressToMunicipality

slide-41
SLIDE 41

New notion of distance driven by Linked Open Data

Innovative Geo Services

slide-42
SLIDE 42

Space is more than geography!

Distance is not uni-dimensional as there can be many facets of distance.

*For example, culturally and economically similar countries may lie far away in geographical terms.

*SOURCE: Bureau of Economic Analysis, CIA World Factbook

slide-43
SLIDE 43

Transportation Time and Cost

slide-44
SLIDE 44

Cultural Factors (e.g. Languages, religious

beliefs, traditions, laws, etc. )

slide-45
SLIDE 45

Climate Factors & Time Zones

slide-46
SLIDE 46

Map of scientific collaboration

http://olihb.com/2014/08/11/map-of-scientific-collaboration-redux/

slide-47
SLIDE 47

Semantic Web & (Linked) Open Data

  • pportunities in defining a multidimensional

notion of distance

slide-48
SLIDE 48

Wikipedia ->

Example: List of countries ranked by ethnic and cultural diversity level

https://en.wikipedia.org/wiki/List_of_countries_ranked_by_ethnic_and_cultural_diversity_level

slide-49
SLIDE 49

OpenFlights Data

  • Airports, Airlines, Routes, Schedules
slide-50
SLIDE 50

World Bank Linked Data

  • World Bank Indicators (GDP, Educational stat.,

Gender stat., etc.)

  • World Bank Finances
  • World Bank Projects and Operations
  • World Bank Climate Change
slide-51
SLIDE 51

New notion of distance: Summary

Input:

  • list of places
  • distance metrics

Output:

  • distance matrices

Process:

  • query open data on metrics
slide-52
SLIDE 52

SMS Architecture

slide-53
SLIDE 53

Time for

Feedback

Linked Data API Documentation:

  • https://github.com/risis-eu/RISIS_LinkedDataAPI
  • https://github.com/risis-eu/LDA-apps