Proving the Viability of RDF as a Universal Healthcare Exchange - - PowerPoint PPT Presentation

proving the viability of rdf as a universal healthcare
SMART_READER_LITE
LIVE PREVIEW

Proving the Viability of RDF as a Universal Healthcare Exchange - - PowerPoint PPT Presentation

- DRAFT - Proving the Viability of RDF as a Universal Healthcare Exchange Language David Booth, Ph.D. Latest version of these slides: http://dbooth.org/2014/proving/ See also associated paper Imagine a world in which all healthcare systems


slide-1
SLIDE 1
  • DRAFT -

Proving the Viability of RDF as a Universal Healthcare Exchange Language

David Booth, Ph.D. Latest version of these slides: http://dbooth.org/2014/proving/ See also associated paper

slide-2
SLIDE 2

Imagine a world

in which all healthcare systems speak the same language with the same meanings covering all healthcare.

slide-3
SLIDE 3

Semantic interoperability

Definition: The ability of computer systems to exchange data with unambiguous, shared meaning. – Wikipedia

slide-4
SLIDE 4

Healthcare today

Tower of Babel, Abel Grimmer (1570-1619)

slide-5
SLIDE 5
slide-6
SLIDE 6

PCAST report: "It is crucial that the Federal Government facilitate the nationwide adoption of a

universal exchange language

for healthcare information"

slide-7
SLIDE 7

Yosemite Manifesto

  • n RDF as a Universal Healthcare Exchange Language
  • 1. RDF is the best available candidate for a universal healthcare exchange language.
  • 2. Electronic healthcare information should be exchanged in a format that either: (a) is an

RDF format directly; or (b) has a standard mapping to RDF.

  • 3. Existing standard healthcare vocabularies, data models and exchange languages should

be leveraged by defining standard mappings to RDF, and any new standards should have RDF representations.

  • 4. Government agencies should mandate or incentivize the use of RDF as a universal

healthcare exchange language.

  • 5. Exchanged healthcare information should be self-describing, using Linked Data principles,

so that each concept URI is de-referenceable to its free and open definition

See http://YosemiteManifesto.org/

slide-8
SLIDE 8

What is RDF?

  • "Resource Description Framework"

– But think "Reusable Data Framework"

  • Language for representing information
  • Vendor-neutral international standard by W3C
  • Mature – 10+ years
  • Used in many domains, including biomedical and

pharma Why?

slide-9
SLIDE 9

Key things you need to know about RDF

#1: RDF is unambiguously self describing #2: RDF data is easy to meaningfully merge #3: RDF is easy to create from other data formats #4: RDF captures data meaning – not syntax #5: RDF enables multiple data models and vocabularies to be readily combined and interrelated #6: RDF facilitates smarter queries #7: RDF facilitates transformations between data models and vocabularies See: Key Things You Need to Know About RDF http://dbooth.org/2014/key/

slide-10
SLIDE 10

See: Key Things You Need to Know About RDF http://dbooth.org/2014/key/

slide-11
SLIDE 11

Why RDF as a Universal Healthcare Exchange Language?

All the preceding reasons, plus:

  • Emphasis on meaning and precise semantics – not syntax
  • Easy to map existing formats to RDF
  • Excellent for semantically connecting diverse vocabularies and

data models – Multi-schema friendly Non-technical:

  • Vendor-neutral international standard (W3C)
  • Mature – 10+ years
  • Supports standards and innovation
slide-12
SLIDE 12

Why is this important?

  • Healthcare involves many thousands of

concepts

– Over 120,000 in SNOMED alone – Nearly 3 million in UMLS

  • New concepts continually being defined and

re-defined

  • Critical to easily find authoritative, shared

definitions

slide-13
SLIDE 13

Why is this important?

  • Healthcare information exists in many

locations, formats, data models and vocabularies

  • Need to integrate information for:

– Better patient care – Better quality measurement – Better research

slide-14
SLIDE 14

The solution is standards! (Isn't it?) And yet, somehow standards do not seem to solve the problem . . . .

slide-15
SLIDE 15

See: Why Standards are Not Enough to Solve Healthcare's Interoperability Problem (And How RDF Can Help) http://dbooth.org/2014/standards/

slide-16
SLIDE 16

Why standards and innovation?

  • Dilemma: Standards are necessary for semantic

interoperability, BUT standards are a moving target

– Medical science and technology are continually changing – Medicine is too big and diverse to suddenly adopt

  • ne monster standard

– UMLS lists over 100 vocabularies!

  • Change is the norm!
slide-17
SLIDE 17

Embracing standards and innovation

A universal healthcare exchange language MUST:

  • Leverage existing and future standards – both de facto and

de jure

  • Support decentralized innovation – new terms, vocabularies

and data models

  • Allow continual incorporation of new standards
  • Support a graceful transition from innovation to standard –

without re-tooling

  • RDF is the best available candidate!
slide-18
SLIDE 18

Semantic alignment

  • @@ Illustrate four cases in transforming model A to

B: @@

  • A and B hold the same information – Lossless
  • A holds more information than B – Lossy
  • B holds more information than A – Cannot transform
  • A and B hold overlapping information – Go broader,

then transform lossy

slide-19
SLIDE 19

Pre-coordinated vs. post-coordinated

  • @@ TODO: Get SNOMED example @@
slide-20
SLIDE 20

RDF example

  • Same information

RDF (Turtle)

@prefix ex: <http://example/ex/> . @prefix loinc: <http://loinc.org/> . @prefix v: <http://example/v/> . ex:obs_001 a v:Observation ; v:code loinc:3727-0 ; v:display "BPsystolic, sitting" ; v:value 120 ; v:units v:mmHg .

RDF drawn as a graph RDF (N-Triples)

<http://example/ex/obs_001> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://example/v/Observation> . <http://example/ex/obs_001> <http://example/v/code> <http://loinc.org/3727-0> . <http://example/ex/obs_001> <http://example/v/display> "BPsystolic, sitting" . <http://example/ex/obs_001> <http://example/v/value> "120"^^<http://www.w3.org/2001/XMLSchema#integer> . <http://example/ex/obs_001> <http://example/v/units> <http://example/v/mmHg> .

slide-21
SLIDE 21

Same information, different formats

  • Can be represented as the same RDF!

OBX|1|CE|3727-0^BPsystolic, sitting||120||mmHg| <Observation xmlns="http://hl7.org/fhir"> <system value="http://loinc.org"/> <code value="3727-0"/> <display value="BPsystolic, sitting"/> <value value="120"/> <units value="mmHg"/> </Observation>

HL7 v2.x FHIR As RDF graph

slide-22
SLIDE 22

Why is this important?

  • Multiple data models and vocabularies can be:

– dynamically added – used together harmoniously

  • This is critical in domains that involve many or changing data

models/vocabularies

  • Even standards are are not static!

– Standards are continually revised or they become obsolete

  • Example: Unified Medical Language System (UMLS) includes
  • ver 100 standard vocabularies and millions of concepts!
slide-23
SLIDE 23

What would it mean to use RDF?

  • Healthcare information is exchanged as

RDF, using:

– Standard models and vocabularies whenever possible – Other models and vocabularies when necessary

  • Existing representations can still be used

internally

slide-24
SLIDE 24

How would it work? (Naive view)

If Sender and Receiver use the same data model and vocabularies:

  • Sender converts to RDF
  • Receiver converts from RDF

Sender

HL7 v2.x

Receiver

FHIR

RDF

To RDF From RDF

slide-25
SLIDE 25

Semantic alignment

  • Often semantic alignment is required:
  • RDF acts as a common substrate

language

Sender

HL7 v2.x

To RDF From RDF

Receiver

FHIR

Syntactic Syntactic RDF to RDF m1 m1 Semantic m2

slide-26
SLIDE 26

Same format, same information, but different vocabularies and data models

  • Transformation or inference is needed to

get from one to the other!

slide-27
SLIDE 27

How can we achieve semantic interoperability?

slide-28
SLIDE 28

Data transformation

  • Sometimes transformation cannot be

avoided

Sender

HL7 v2.x

Receiver

FHIR Transform

slide-29
SLIDE 29

Current interface engines

Sender

HL7 v2.x

Receiver

FHIR Acme Black Box Transforms

$$$

slide-30
SLIDE 30

Inside the Black Box . . .

Syntactic Transformation (proprietary) Syntactic Transformation (proprietary) Semantic Transformation (proprietary)

  • Converts to/from required formats on output/input
  • Typically uses proprietary schema-and-format-independent

information representation for semantic transformations

HL7 v2.x FHIR

slide-31
SLIDE 31

Data transformation

Sender

HL7 v2.x

Receiver

FHIR Transform

OBX|1|CE|3727-0^BPsystolic, sitting||120||mmHg| <Observation xmlns="http://hl7.org/fhir"> <system value="http://loinc.org"/> <code value="8580-6"/> <display value="Systolic BP"/> <value value="120"/> <units value="mm[Hg]"/> </Observation>

slide-32
SLIDE 32

Syntactic and Semantic Transformations

  • RDF acts as a common substrate

language

  • Based on URIs as universal identifiers

Sender

HL7 v2.x

To RDF From RDF

Receiver

FHIR

Syntactic Syntactic RDF to RDF m1 m1 Semantic m2

slide-33
SLIDE 33

Syntactic transformation to RDF

d1:obs042 a m1:PatientObservation ; m1:code "3727-0" ; m1:description "BPsystolic, sitting" ; m1:value 120 ; m1:units "mmHg" . OBX|1|CE|3727-0^BPsystolic, sitting||120||mmHg|

RDF

Sender

HL7 v2.x

To RDF m1 m1

slide-34
SLIDE 34

Semantic transformation RDF-to-RDF

CONSTRUCT { ?observation a m2:Observation ; a m2:BP_systolic ; m2:value ?value ; m2:units m3:mmHg ; m2:position m3:sitting . } WHERE { ?observation a m1:PatientObservation ; m1:code "3727-0" ; m1:value ?value ; m1:units "mmHg" . }

RDF to RDF m1 m1 m2

slide-35
SLIDE 35

Syntactic transformation from RDF

RDF

d2:obs-091 a m2:Observation ; m2:system "http://loinc.org/" ; m2:code "8580-6" ; m2:display "Systolic BP" ; m2:value 107 ; m2:units "mm[Hg]" . <Observation xmlns="http://hl7.org/fhir"> <system value="http://loinc.org/"/> <code value="8580-6"/> <display value="Systolic BP"/> <value value="107"/> <units value="mm[Hg]"/> </Observation>

Receiver

FHIR

From RDF m2

slide-36
SLIDE 36

Recipe for semantic interoperability

  • 1. Capture structured information, to enable machine processing.
  • 2. Use standard vocabularies whenever possible.
  • 3. Continually expand and update the set of acceptable standards.
  • 4. RDF-enable exchanged data.
  • 5. Include all relevant data – even data that has not yet been standardized.
  • 6. Map existing and new healthcare information standards to RDF.
  • 7. Make all RDF data be self-describing (as Linked Data), using URIs that can

be dereferenced to their definitions.

  • 8. Use free and open vocabularies for data exchange.
  • 9. Enact incentives for semantic interoperability.
slide-37
SLIDE 37

How can we represent these transformations?

slide-38
SLIDE 38

Many ways . . .

  • Transformations can be any kind of rules or functions
  • Declarative style

– Ontologies

v:AorticValve rdfs:subClassOf v:HeartValve .

  • Procedural style

– Rules { ?x a v:AorticValve . } => { ?x a v:HeartValve . } – Programs, e.g., Python, Java, C, etc.

slide-39
SLIDE 39

Where can we get these transformations?

slide-40
SLIDE 40

Transformation Definition Repository

  • Transformations (rules & functions) can be upload & downloaded
  • Collaborative – can be crowd sourced
  • Repository keeps versions and metadata
  • Could be used to lookup appropriate transformation both

manually and automatically

slide-41
SLIDE 41

Using Transformation Definitions

Map/ Translate Input Format RDF Output Format Mapping/ Translation Definitions To RDF From RDF

slide-42
SLIDE 42

Transformation Definition Repository

  • Instance data is transmitted peer-to-peer
  • Recipient downloads transformations from hub for

unknown data models and vocabularies

  • 2. Download

transformations

  • 1. Receive instance data
  • 3. Apply transforms

Transformation Definition Repository Sender Receiver

slide-43
SLIDE 43

Example scenario

  • Sender:

– Transforms internal format to RDF – Provides instance data in RDF – Class and property URIs indicate the vocabularies/data models used – Class and property URIs MUST be dereferenceable to definitions, i.e., as Linked Data

  • Receiver:

– Receives RDF data, and uses the wiki to lookup transformations for vocabularies / data models it does not understand – Downloads the desired transformations – Applies the transformations to the instance data

  • Instance data is now semantically aligned with receiver's ontology

– Transforms from RDF to internal format

slide-44
SLIDE 44

Transformation metadata

  • Transformation identified by URIs
  • Indicates:

– Source vocabularies/data models – Target vocabularies/data models

  • Includes usage measure/ratings, e.g.:

– Objective: Number of downloads, Author, Date, etc. – Subjective: Who/how many like it, reviews, etc.

  • License information? – TBD

– E.g., allow commercial transformations?

slide-45
SLIDE 45

Next steps

  • RDF is the "Best available candidate":

– Lots of uses, including in healthcare – Lots of believers: http://YosemiteManifesto.org/

  • It is time to move forward quickly.
slide-46
SLIDE 46

Questions?

slide-47
SLIDE 47

BACKUP SLIDES

slide-48
SLIDE 48

Why RDF? - Technical

  • Semantics, not syntax
  • RDF is syntax independent
  • RDF captures the information content
  • Multi-schema friendly
  • Multiple models, granularities and vocabularies can co-exist,

semantically interrelated

  • Designed for web-scale data integration
  • Self describing
  • Uses URIs as unique term and model identifiers
  • Term and model URIs can be dereferenceable to authoritative

definitions

slide-49
SLIDE 49

Why RDF? - Non-technical

  • Supports standards and innovation
  • Leverage existing & future standards
  • Accommodate new models and vocabularies,

with a graceful path toward standardization

  • Vendor-neutral international standard (W3C)
  • Mature
  • 10+ years
  • Used in a wide range of domains
slide-50
SLIDE 50

Why not XML?

  • XML places too much emphasis on syntax

– But it's the information that matters

  • Meaning is implicit

– E.g., what does nesting mean?

  • XML is schema centric – not multi-schema friendly:

– Different schemas compete in XML – they do not co-exist well

  • Thought experiment: Integrate 5 different XML models.

Good luck! :)

slide-51
SLIDE 51

Why not HL7?

  • Meaning is implicit
  • Too much emphasis on data transport and

syntax HOWEVER:

  • HL7 can be leveraged by mapping to RDF
slide-52
SLIDE 52

Why not JSON?

  • Meaning is implicit
  • JSON is not self-describing

– The same term may have different meaning in different contexts – (Compare RDF's use of unambiguous URIs)

  • JSON is schema-centric (not multi-schema friendly)

HOWEVER:

  • JSON is a very convenient syntax, and can be used as an RDF

serialization (JSON-LD)

  • Thought experiment: Integrate 5 JSON data models. It's easier

than in XML, but still harder than in RDF.

slide-53
SLIDE 53

Why is it so difficult to standardize?

  • Healthcare information is complex
  • Lack of incentive
  • Standardization takes time

– Progress goes toward zero as committee size grows

  • Moving target: medical science and

technology continually changing

slide-54
SLIDE 54

Issues

  • How to incent contributions of

transformations?

  • How to provide objective measures of

quality? E.g., number of downloads, who is using which transforms, etc.

  • Licensing: Allow commercial

transformations too?

slide-55
SLIDE 55

Modeling steps

1.Model existing data – as it is

  • Start with the data you know you need

2.Model desired data or queries – as they are

  • Start with what you know you need

3.Choose mappings or bridge models

  • Rules, hub ontologies, etc.

4.Iterate

slide-56
SLIDE 56

Issue: How to know if unrecognized data is needed?

  • Party B receives data from party A. Part of that

data is in an unknown model

– Solution: Metadata?

  • Party A needs to indicate what data is available

– Solution: Data summary? – E.g. # triples of each predicate, class, MB, etc. – MB might be helpful for images

slide-57
SLIDE 57
slide-58
SLIDE 58
slide-59
SLIDE 59
slide-60
SLIDE 60

Negotiating natural language

I speak:

  • English
  • French
  • German

I understand:

  • English
  • French
  • German
  • Mandarin
slide-61
SLIDE 61

Negotiating healthcare language

I speak:

  • http://...SNOMED
  • http://...LOINC
  • http://...ICD9
  • http://...3MHDD
  • http://...ACME7

I understand:

  • http://...SNOMED
  • http://...ACME7
  • Identified by URI
  • Represented in RDF
slide-62
SLIDE 62

Standardization

Standard

  • PROS: Most efficient; desirable whenever possible

– Only need n transformations instead of (n-1)*(n-1)

  • CONS: Infeasible when committee/standard gets too big
slide-63
SLIDE 63

Standards and diversity

Std 1

  • Cannot stop the world to wait for

standardization!

Std 2 Std 3

slide-64
SLIDE 64

Key requirements

  • Continually incorporate new vocabularies

and data models

  • Support existing and future healthcare

standards

  • Support decentralized innovation
slide-65
SLIDE 65

Why include non-standard concepts?

  • Important to send all requested

information in machine-processable form

  • Receiver may be able to use it
  • Helps bootstrap standardization
slide-66
SLIDE 66

Additional requirements for graceful adoption of new concepts

  • Enable new concepts to be semantically

linked to existing ones

  • Enable authoritative definitions of new

concepts to be obtained automatically Best available candidate: RDF

slide-67
SLIDE 67

Why RDF?

slide-68
SLIDE 68

Why RDF?

1.Semantics, not syntax

slide-69
SLIDE 69

Why RDF?

1.Semantics, not syntax 2.Self describing – derefenceable URIs

slide-70
SLIDE 70

Why RDF?

1.Semantics, not syntax 2.Self describing 3.Schema promiscuous

slide-71
SLIDE 71

Why RDF?

Schema promiscuous

  • Blue App has model

Country Address FirstName LastName Email City ZipCode Blue Model

slide-72
SLIDE 72

Why RDF?

Schema promiscuous

  • Red App has model

HomePhone Town ZipPlus4 FullName Country Red Model

slide-73
SLIDE 73

Why RDF?

Schema promiscuous

  • Merge RDF data
  • Same nodes (URIs) join automatically

HomePhone Town ZipPlus4 FullName Country Address FirstName LastName Email City ZipCode Red Model Blue Model Country

slide-74
SLIDE 74

Why RDF?

Schema promiscuous

  • Add relationships and rules
  • (Relationships are also RDF)

HomePhone Town ZipPlus4 FullName Country Address FirstName LastName Email City ZipCode Red Model Blue Model Country subClassOf sameAs hasLast hasFirst

slide-75
SLIDE 75

Why RDF?

Schema promiscuous

  • Later add Green model

(Using Red & Blue models)

HomePhone Town ZipPlus4 FullName Country Address FirstName LastName Email City ZipCode Red Model Blue Model Green Model Country subClassOf sameAs hasLast hasFirst

Multiple models peacefully coexist

slide-76
SLIDE 76

Why RDF?

Schema promiscuous

  • What the Blue app sees:

– No difference!

HomePhone Town ZipPlus4 FullName Country Address FirstName LastName Email City ZipCode Red Model Blue Model Green Model Country Country Address FirstName LastName Email City ZipCode Blue Model Country

slide-77
SLIDE 77

Why RDF?

Schema promiscuous

  • What the Red app sees

– No difference!

HomePhone Town ZipPlus4 FullName Country Address FirstName LastName Email City ZipCode Red Model Blue Model Green Model Country HomePhone Town ZipPlus4 FullName Country Red Model

slide-78
SLIDE 78

Why RDF?

Schema promiscuous

  • What the Green app sees

– No difference!

HomePhone Town ZipPlus4 FullName Country Address FirstName LastName Email City ZipCode Red Model Blue Model Green Model Country HomePhone Town ZipPlus4 Country FirstName LastName Email Green Model Country

slide-79
SLIDE 79

Why RDF?

1.Semantics, not syntax 2.Self describing 3.Schema promiscuous 4.Neutral, mature, international standard

slide-80
SLIDE 80

Why RDF?

1.Semantics, not syntax 2.Self describing 3.Schema promiscuous 4.Neutral, mature, international standard Best available candidate for a universal healthcare exchange language!

slide-81
SLIDE 81

How?

slide-82
SLIDE 82

Semantic interoperability involves data transformations

How?

Sender1 Sender2

HL7 v2.x FHIR

Receiver

CSV Universal Healthcare Exchange Language

slide-83
SLIDE 83

Syntactic and Semantic Transformations

Sender1 Sender2

HL7 v2.x FHIR

To RDF To RDF To CSV

Receiver

CSV

Syntactic Syntactic RDF to RDF RDF to RDF m1 m2 m1 m3 Semantic

RDF

slide-84
SLIDE 84

Sender1 Sender2

HL7 v2.x FHIR

To RDF To RDF To CSV

Receiver

CSV

RDF to RDF RDF to RDF m1 m2 m1 m3

Sender1 data: HL7 v2.x

(Fictitious examples for illustration) OBX|1|CE|3727-0^BPsystolic, sitting||120||mmHg|

Sender1

HL7 v2.x

slide-85
SLIDE 85

Sender1 Sender2

HL7 v2.x FHIR

To RDF To RDF To CSV

Receiver

CSV

RDF to RDF RDF to RDF m1 m2 m1 m3

Sender2 data: FHIR

(Fictitious example for illustration)

Sender2

FHIR

<Observation xmlns="http://hl7.org/fhir"> <system value="http://loinc.org"/> <code value="8580-6"/> <display value="Systolic BP"/> <value value="107"/> <units value="mm[Hg]"/> </Observation>

slide-86
SLIDE 86

Sender1 Sender2

HL7 v2.x FHIR

To RDF To RDF To CSV

Receiver

CSV

RDF to RDF RDF to RDF m1 m2 m1 m3

Receiver data expected: RDF

d1:obs042 a m3:Observation ; a m3:BP_systolic ; m3:value 120 ; m3:units m3:mmHg ; m3:position m3:sitting . d2:obs-091 a m3:Observation ; a m3:BP_systolic ; m3:value 107 ; m3:units m3:mmHg .

Receiver

CSV

slide-87
SLIDE 87

Sender1 Sender2

HL7 v2.x FHIR

To RDF To RDF To CSV

Receiver

CSV

RDF to RDF RDF to RDF m1 m2 m1 m3

Step 1: Syntactic transformation

  • Transform from source format to substrate model (RDF)
  • Allows data to be merged
  • Data may not join semantically due to differing vocabularies

Sender1 Sender2

HL7 v2.x FHIR

To RDF To RDF Syntactic m1 m2

RDF

slide-88
SLIDE 88

Sender1 Sender2

HL7 v2.x FHIR

To RDF To RDF To CSV

Receiver

CSV

RDF to RDF RDF to RDF m1 m2 m1 m3

Sender1 syntactic transformation

d1:obs042 a m1:PatientObservation ; m1:code "3727-0" ; m1:description "BPsystolic, sitting" ; m1:value 120 ; m1:units "mmHg" . OBX|1|CE|3727-0^BPsystolic, sitting||120||mmHg|

RDF

Sender1

HL7 v2.x

To RDF Syntactic m1 m1

slide-89
SLIDE 89

Sender1 Sender2

HL7 v2.x FHIR

To RDF To RDF To CSV

Receiver

CSV

RDF to RDF RDF to RDF m1 m2 m1 m3

Sender2 syntactic transformation

RDF

d2:obs-091 a m2:Observation ; m2:system "http://loinc.org/" ; m2:code "8580-6" ; m2:display "Systolic BP" ; m2:value 107 ; m2:units "mm[Hg]" . <Observation xmlns="http://hl7.org/fhir"> <system value="http://loinc.org/"/> <code value="8580-6"/> <display value="Systolic BP"/> <value value="107"/> <units value="mm[Hg]"/> </Observation>

Sender2

FHIR

To RDF m2

slide-90
SLIDE 90

Step 2: Semantic Transformations

Sender1 Sender2

HL7 v2.x FHIR

To RDF To RDF To CSV

Receiver

CSV

Syntactic Syntactic RDF to RDF RDF to RDF m1 m2 m1 m3 Semantic

RDF

slide-91
SLIDE 91

Sender1 Sender2

HL7 v2.x FHIR

To RDF To RDF To CSV

Receiver

CSV

RDF to RDF RDF to RDF m1 m2 m1 m3

Sender1 semantic transformation

CONSTRUCT { ?observation a m3:Observation ; a m3:BP_systolic ; m3:value ?value ; m3:units m3:mmHg ; m3:position m3:sitting . } WHERE { ?observation a m1:PatientObservation ; m1:code "3727-0" ; m1:value ?value ; m1:units "mmHg" . }

RDF to RDF m1 m1 m3

slide-92
SLIDE 92

Sender1 Sender2

HL7 v2.x FHIR

To RDF To RDF To CSV

Receiver

CSV

RDF to RDF RDF to RDF m1 m2 m1 m3

Sender2 semantic transformation

CONSTRUCT { ?observation a m3:Observation ; a m3:BP_systolic ; m3:value ?value ; m3:units m3:mmHg . } WHERE { ?observation a m2:Observation ; m2:system "http://loinc.org/" ; m2:code "8580-6" ; m2:value ?value ; m2:units "mm[Hg]" . }

RDF to RDF m2 m3

slide-93
SLIDE 93

Sender1 Sender2

HL7 v2.x FHIR

To RDF To RDF To CSV

Receiver

CSV

RDF to RDF RDF to RDF m1 m2 m1 m3

Merged RDF

d1:obs042 a m3:Observation ; a m3:BP_systolic ; m3:value 120 ; m3:units m3:mmHg ; m3:position m3:sitting . d2:obs-091 a m3:Observation ; a m3:BP_systolic ; m3:value 107 ; m3:units m3:mmHg .

  • m3 can be understood by Receiver
  • Ready for syntactic transform to CSV

To CSV

Receiver

CSV

Syntactic m3

slide-94
SLIDE 94

Summary of transformations

Sender1 Sender2

HL7 v2.x FHIR

To RDF To RDF To CSV

Receiver

CSV

Syntactic Syntactic RDF to RDF RDF to RDF m1 m2 m1 m3 Semantic

RDF Ideally, transformations should be standardized

slide-95
SLIDE 95

Proprietary vocabularies

  • Impede semantic interoperability
  • Exchanged healthcare information should

be based on free and open vocabularies

– But proprietary can be used internally

slide-96
SLIDE 96

Yosemite Manifesto

  • n RDF as a Universal Healthcare Exchange Language
  • 1. RDF is the best available candidate for a universal healthcare exchange

language.

  • 2. Electronic healthcare information should be exchanged in a format that

either: (a) is an RDF format directly; or (b) has a standard mapping to RDF.

  • 3. Existing standard healthcare vocabularies, data models and exchange

languages should be leveraged by defining standard mappings to RDF, and any new standards should have RDF representations.

  • 4. Government agencies should mandate or incentivize the use of RDF as a

universal healthcare exchange language.

  • 5. Exchanged healthcare information should be self-describing, using Linked

Data principles, so that each concept URI is de-referenceable to its free and

  • pen definition.

Sign at http://YosemiteManifesto.org/

slide-97
SLIDE 97

Research needed to prove feasibility

  • Build and demonstrate a reference implementation

– At least two senders and one receiver

  • Demonstrate all important features:

– Syntactic & semantic transformations – Selecting and applying transformations – Incorporate new vocabularies & deprecate old – Privacy & security – Hosting concept definitions

  • Run stress tests to simulate scaling to nationwide adoption
  • Recommend conventions
slide-98
SLIDE 98

Data Transformation Wiki

WIKITRANSFORMIA

For Health Data Languages Upload Upload Lookup / Download Lookup / Download

slide-99
SLIDE 99

What would it be like?

  • Better treatment
  • Better research
  • Lower cost

Goal: True semantic interoperability

slide-100
SLIDE 100

What does semantic interoperability involve?

  • Machine processable information
  • Common vocabularies
  • Unambiguous concepts
slide-101
SLIDE 101

Why is this so difficult to standardize?

  • Healthcare is complex: thousands of

interrelated concepts, many domains

  • Standardization progress diminishes

toward zero as committee size grows

  • Moving target: medical science and

technology continually changing

slide-102
SLIDE 102

Unambiguous concepts

slide-103
SLIDE 103

Semantic interoperability

  • Requires standardization

Sender1 Sender2 Receiver

slide-104
SLIDE 104

Assumption: Not standardizing internal systems

  • Not politically feasible
  • Too costly
  • Unwise: would inhibit innovation
slide-105
SLIDE 105

Semantic interoperability

Sender1 Sender2 Receiver

slide-106
SLIDE 106

Semantic interoperability-2

Sender1 Sender2

HL7 v2.x FHIR

Receiver

CSV Universal Healthcare Exchange Language

slide-107
SLIDE 107

Step 2: Semantic transformation

Sender1 Sender2

HL7 v2.x FHIR

m1 to m3 m2 to m3 HL7 v2.x to RDF FHIR to RDF RDF to CSV

RDF

Receiver

CSV

slide-108
SLIDE 108

Data transformations

Sender1 Receiver Sender2

HL7 v2.x FHIR CSV

RDF to RDF RDF to CSV

Semantic Transformations Syntactic Transformations Syntactic Transformations

RDF

slide-109
SLIDE 109

Data transformations

RDF to RDF

Semantic Transformations Syntactic Transformations Syntactic Transformations

RDF

slide-110
SLIDE 110

Step 2: Semantic transformation

RDF

Semantic Transformations Semantic Transformations Semantic Transformations

slide-111
SLIDE 111

Syntactic and Semantic Transforms

Sender1 Sender2

HL7 v2.x FHIR

HL7 v2.x to RDF FHIR to RDF RDF to CSV

Receiver

CSV

Syntactic Semantic (RDF) Syntactic RDF to RDF RDF to RDF

slide-112
SLIDE 112

Syntactic and Semantic Transforms

Sender1 Sender2

HL7 v2.x FHIR

To RDF To RDF To CSV

Receiver

CSV

Syntactic Syntactic RDF to RDF RDF to RDF m1 m2 m1 m3 Semantic

slide-113
SLIDE 113

Syntactic and Semantic Transforms

Sender1 Sender2

HL7 v2.x FHIR

To RDF To RDF To CSV

Receiver

CSV

RDF to RDF RDF to RDF m1 m2 m1 m3

slide-114
SLIDE 114

Step 2: Semantic transformation

Sender1 Sender2

HL7 v2.x FHIR

RDF to RDF RDF to RDF HL7 v2.x to RDF FHIR to RDF RDF to CSV Receiver

CSV Semantic Transformations Syntactic Transformations Syntactic Transformations

slide-115
SLIDE 115

Why RDF?

Schema promiscuous

HomePhone Town ZipPlus4 FullName Country Address FirstName LastName Email City ZipCode Red Model Blue Model Green Model Country subClassOf sameAs hasLast hasFirst

slide-116
SLIDE 116

HomePhone Town ZipPlus4 FullName Country Address FirstName LastName Email City ZipCode Red Model Blue Model Country subClassOf sameAs hasLast hasFirst

slide-117
SLIDE 117

HomePhone Town ZipPlus4 FullName Country Address FirstName LastName Email City ZipCode Red Model Blue Model Green Model Country

slide-118
SLIDE 118

HomePhone Town ZipPlus4 FullName Country Address FirstName LastName Email City ZipCode Red Model Blue Model Green Model Country

slide-119
SLIDE 119

HomePhone Town ZipPlus4 FullName Country Address FirstName LastName Email City ZipCode Red Model Blue Model Green Model Country

slide-120
SLIDE 120

Green model

HomePhone Town ZipPlus4 Country FirstName LastName Email Green Model

slide-121
SLIDE 121

Blue view

Country Address FirstName LastName Email City ZipCode Blue Model

slide-122
SLIDE 122

Blue view

HomePhone Town ZipPlus4 FullName Country Address FirstName LastName Email City ZipCode Red Model Blue Model Green Model Country Country Address FirstName LastName Email City ZipCode Blue Model Country

slide-123
SLIDE 123

Red view

HomePhone Town ZipPlus4 FullName Country Address FirstName LastName Email City ZipCode Red Model Blue Model Green Model Country HomePhone Town ZipPlus4 FullName Country Red Model

slide-124
SLIDE 124

Green view

HomePhone Town ZipPlus4 FullName Country Address FirstName LastName Email City ZipCode Red Model Blue Model Green Model Country HomePhone Town ZipPlus4 Country FirstName LastName Email Green Model Country

slide-125
SLIDE 125

HomePhone Town ZipPlus4 FullName Country Red Model

slide-126
SLIDE 126

Why RDF?

Schema promiscuous

HomePhone Town ZipPlus4 FullName Country Address FirstName LastName Email City ZipCode Red Model Blue Model Green Model Country subClassOf sameAs hasLast hasFirst

slide-127
SLIDE 127

Role of a common language

Universal Healthcare Exchange Language

slide-128
SLIDE 128

Role of a common language

Universal Healthcare Exchange Language Sender1 Sender2 Receiver

slide-129
SLIDE 129

Semantic relevance is relative

  • Blood Pressure measurement:

– Sitting versus Standing

  • Is the difference semantically relevant?
  • Depends on the application!
  • v:BP_Machine rdfs:subClassOf v:BP .
  • v:BP_Manual rdfs:subClassOf v:BP .
  • { ?bp a v:BP_Machine . }

=> { ?bp a v:BP . } .

slide-130
SLIDE 130

Sender1 Receiver Sender2

HL7 v2.x FHIR

RDF to RDF RDF to RDF HL7 v2.x to RDF FHIR to RDF

Merged RDF

slide-131
SLIDE 131

Sender1 Receiver Sender2

HL7 v2.x FHIR

HL7 v2.x to RDF FHIR to RDF

?

Step 1: Syntactic transformation

  • Transform from source format to substrate model
  • Allows data to be merged
  • Data may not link semantically due to differing vocabularies
slide-132
SLIDE 132

Sender1 Receiver Sender2

HL7 v2.x FHIR

?

Sender1 data: HL7 v2.x

(Fictitious examples for illustration)

slide-133
SLIDE 133

Yosemite Manifesto

  • n RDF as a Universal Healthcare Exchange Language
  • 1. RDF is the best available

candidate for a universal healthcare exchange language.

slide-134
SLIDE 134

Yosemite Manifesto

  • n RDF as a Universal Healthcare Exchange Language
  • 2. Electronic healthcare information

should be exchanged in a format that either: (a) is an RDF format directly; or (b) has a standard mapping to RDF.

slide-135
SLIDE 135

Yosemite Manifesto

  • n RDF as a Universal Healthcare Exchange Language
  • 3. Existing standard healthcare

vocabularies, data models and exchange languages should be leveraged by defining standard mappings to RDF, and any new standards should have RDF representations.

slide-136
SLIDE 136

Yosemite Manifesto

  • n RDF as a Universal Healthcare Exchange Language
  • 4. Government agencies should

mandate or incentivize the use of RDF as a universal healthcare exchange language.

slide-137
SLIDE 137

Yosemite Manifesto

  • n RDF as a Universal Healthcare Exchange Language
  • 5. Exchanged healthcare

information should be self- describing, using Linked Data principles, so that each concept URI is de-referenceable to its free and

  • pen definition.
slide-138
SLIDE 138

Using Transformation Definitions

Map/ Translate Input Format RDF Output Format Mapping/ Translation Definitions To RDF From RDF