BioSignalML Putting biosignals onto the Semantic Web David Brooks - - PowerPoint PPT Presentation

biosignalml
SMART_READER_LITE
LIVE PREVIEW

BioSignalML Putting biosignals onto the Semantic Web David Brooks - - PowerPoint PPT Presentation

BioSignalML Putting biosignals onto the Semantic Web David Brooks Background Biosignal -- time series data resulting from a biological process. Background Biosignal -- time series data resulting from a biological process.


slide-1
SLIDE 1

BioSignalML

Putting biosignals onto the Semantic Web David Brooks

slide-2
SLIDE 2

Background

  • Biosignal -- time series data

resulting from a biological process.

slide-3
SLIDE 3

Background

  • Biosignal -- time series data

resulting from a biological process.

  • Sampled, usually at a

regular rate, which is usually much greater than the highest frequency of interest.

slide-4
SLIDE 4

Background

  • Biosignal -- time series data

resulting from a biological process.

  • Sampled, usually at a

regular rate, which is usually much greater than the highest frequency of interest.

  • Electrical, pressure,

concentration, …

slide-5
SLIDE 5

Background

  • Biosignal -- time series data

resulting from a biological process.

  • Sampled, usually at a

regular rate, which is usually much greater than the highest frequency of interest.

  • Electrical, pressure,

concentration, …

  • Simulation time series data.
slide-6
SLIDE 6

Background

  • A lot of file formats:

– manufacturer; research; regulatory; …

WaveForm DataBase WFDB Standard Communication Protocol for ECG (CEN) SCP Medical waveform Format Encoding Rules (ISO) MFER General Data Format (an EDF derivative) GDF FDA standard for ECG FDAXML European Data Format plus EDF+ European Data Format EDF 24 bit version of EDF BDF

slide-7
SLIDE 7

Background

  • A lot of file formats:

– manufacturer; research; regulatory; …

  • Often developed for a

specific problem domain.

WaveForm DataBase WFDB Standard Communication Protocol for ECG (CEN) SCP Medical waveform Format Encoding Rules (ISO) MFER General Data Format (an EDF derivative) GDF FDA standard for ECG FDAXML European Data Format plus EDF+ European Data Format EDF 24 bit version of EDF BDF

slide-8
SLIDE 8

Background

  • A lot of file formats:

– manufacturer; research; regulatory; …

  • Often developed for a

specific problem domain.

  • All generally good at

storing time series data.

WaveForm DataBase WFDB Standard Communication Protocol for ECG (CEN) SCP Medical waveform Format Encoding Rules (ISO) MFER General Data Format (an EDF derivative) GDF FDA standard for ECG FDAXML European Data Format plus EDF+ European Data Format EDF 24 bit version of EDF BDF

slide-9
SLIDE 9

Background

  • A lot of file formats:

– manufacturer; research; regulatory; …

  • Often developed for a

specific problem domain.

  • All generally good at

storing time series data.

  • Metadata format is file

specific.

WaveForm DataBase WFDB Standard Communication Protocol for ECG (CEN) SCP Medical waveform Format Encoding Rules (ISO) MFER General Data Format (an EDF derivative) GDF FDA standard for ECG FDAXML European Data Format plus EDF+ European Data Format EDF 24 bit version of EDF BDF

slide-10
SLIDE 10

Background

  • A lot of file formats:

– manufacturer; research; regulatory; …

  • Often developed for a

specific problem domain.

  • All generally good at

storing time series data.

  • Metadata format is file

specific.

  • Metadata content tends to

be domain specific.

WaveForm DataBase WFDB Standard Communication Protocol for ECG (CEN) SCP Medical waveform Format Encoding Rules (ISO) MFER General Data Format (an EDF derivative) GDF FDA standard for ECG FDAXML European Data Format plus EDF+ European Data Format EDF 24 bit version of EDF BDF

slide-11
SLIDE 11

Difficulties

  • Polysomnography:

– “Currently, digital data from most PSG systems can

  • nly be viewed if one utilizes the system with which it

was collected.” [1] – “Unfortunately, not much has happened since … no consensus for data sharing has taken root.” [2]

[1] D. Rapoport, I. Ayappa, R. Norman, and S. Herman, “NPSG data interchange-dealing with the Tower of Babel.” Sleep,

  • vol. 29, no. 5, p. 599, 2006.

[2] D. M. Rapoport, email correspondence, November 2011.

slide-12
SLIDE 12

Difficulties

  • Polysomnography:

– “Currently, digital data from most PSG systems can

  • nly be viewed if one utilizes the system with which it

was collected.” [1] – “Unfortunately, not much has happened since … no consensus for data sharing has taken root.” [2]

  • Metadata terms:

– Different groups may have different meanings for a term.

[1] D. Rapoport, I. Ayappa, R. Norman, and S. Herman, “NPSG data interchange-dealing with the Tower of Babel.” Sleep,

  • vol. 29, no. 5, p. 599, 2006.

[2] D. M. Rapoport, email correspondence, November 2011.

slide-13
SLIDE 13

Difficulties

  • Polysomnography:

– “Currently, digital data from most PSG systems can

  • nly be viewed if one utilizes the system with which it

was collected.” [1] – “Unfortunately, not much has happened since … no consensus for data sharing has taken root.” [2]

  • Metadata terms:

– Different groups may have different meanings for a term. – Units: µV, uV, V×10-6 ??

[1] D. Rapoport, I. Ayappa, R. Norman, and S. Herman, “NPSG data interchange-dealing with the Tower of Babel.” Sleep,

  • vol. 29, no. 5, p. 599, 2006.

[2] D. M. Rapoport, email correspondence, November 2011.

slide-14
SLIDE 14
  • Web content that is meaningful to computers.

– Knowledge representation, ontologies, reasoning, intelligent agents, …

Semantic Web

slide-15
SLIDE 15
  • Web content that is meaningful to computers.

– Knowledge representation, ontologies, reasoning, intelligent agents, …

  • http://www.w3.org/standards/semanticweb/

– Resource Description Framework (RDF) – RDFS, OWL, SPARQL, …

Semantic Web

slide-16
SLIDE 16
  • Web content that is meaningful to computers.

– Knowledge representation, ontologies, reasoning, intelligent agents, …

  • http://www.w3.org/standards/semanticweb/

– Resource Description Framework (RDF) – RDFS, OWL, SPARQL, …

  • Linking Open Data

Semantic Web

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

slide-17
SLIDE 17

BioSignalML

  • Abstract common elements of storage formats.
slide-18
SLIDE 18

BioSignalML

  • Abstract common elements of storage formats.
  • Use Semantic Web standards/technologies.

– Objects have web identifiers. – Ontologies define terms, properties, relationships.

slide-19
SLIDE 19

http://repository.biosignal.org/recording3/signal/4

BioSignalML

  • Abstract common elements of storage formats.
  • Use Semantic Web standards/technologies.

– Objects have web identifiers. – Ontologies define terms, properties, relationships.

  • Time series data is in native format; everything

else is available as RDF metadata.

slide-20
SLIDE 20

BioSignalML as RDF

  • Core concepts:

– Recordings – Signals – Events and Annotations.

slide-21
SLIDE 21

BioSignalML as RDF

  • Core concepts:

– Recordings – Signals – Events and Annotations.

  • RDF graph:
slide-22
SLIDE 22

BioSignalML as an ontology

  • Classes, terms, properties, relationships:
slide-23
SLIDE 23

BioSignalML as an ontology

  • Classes, terms, properties, relationships:
slide-24
SLIDE 24

Metadata Signal Recordings Abstraction Layer Python API EDF WFDB HDF5 Triple Store

BioSignalML implementation

  • Biosignal repository:
slide-25
SLIDE 25

Internet

Metadata Signal Recordings Abstraction Layer Python API EDF WFDB HDF5 Triple Store

SPARQL Query

RESTful Web Services

RDF HTML Raw Files Stream Web Browsers

BioSignalML implementation

  • Biosignal repository:
  • Web based with HTTP

endpoints: – File import/export – RDF metadata – Data streamed via web-sockets.

slide-26
SLIDE 26

Internet

Metadata Signal Recordings Abstraction Layer Python API EDF WFDB HDF5 Triple Store

SPARQL Query

RESTful Web Services

RDF HTML Raw Files Stream

Applications

and Tools Web Browsers

BioSignalML implementation

  • Biosignal repository:
  • Web based with HTTP

endpoints: – File import/export – RDF metadata – Data streamed via web-sockets.

  • C client (plus Python,

Javascript, …)

slide-27
SLIDE 27

BioSignalML clients

  • Web browser:
slide-28
SLIDE 28

BioSignalML clients

  • Web browser:
  • RDF browser:
slide-29
SLIDE 29

BioSignalML clients

import import bi biosignalml import import bi biosignalml.u .units as units repo repo = = bi biosignalml.R .Repository(‘ht http://demo.bi biosignalml.o .org’) rec rec = = re repo.ne new_recording(‘ht http://ex example.org/recording/test’) sig sig = = re rec.ne new_signal(i (id=‘a1 a1’, , units=un units.mi millivolt) for

  • r data in da

datasource: si sig.a .append(data) rec rec.c .close() sig sig = = re repo.ge get_signal(‘ht http://ex example.org/recording/test/signal/a1’) print print si sig.ur uri, , si sig.l .label, si sig.u .units start = 0.0 start = 0.0 end = 10.0 end = 10.0 duration = 1.0 duration = 1.0 while start < end: while start < end: interval = interval = si sig.r .recording.interval(start, duration) for data in for data in si sig.r .read(interval): print data # print data # Si SignalSegment st start += duration

  • Python code:
slide-30
SLIDE 30

BioSignalML clients

  • CellML modelling:
slide-31
SLIDE 31

$ ./bwfilter http://devel.biosignalml.org/recording/physiobank/nifecgdb/ecgca102/signal/3

BioSignalML clients

  • CellML modelling:
slide-32
SLIDE 32

Ongoing work

  • Interfacing with simulation tools (OpenCOR,

SED/ML) – real world applications.

  • Adding a Semantic Web layer to PhysioBank.
  • Integrate Units of Measurement Expressions:

– http://www.sbpax.org/uome/index.html – Ontology to derive units from other units. – An extensible way to automate units validation and conversion.

slide-33
SLIDE 33

Thank you