Genome Biology Ontology + Gatekeeper Jasper Koehorst Laboratory of - - PowerPoint PPT Presentation

genome biology ontology gatekeeper
SMART_READER_LITE
LIVE PREVIEW

Genome Biology Ontology + Gatekeeper Jasper Koehorst Laboratory of - - PowerPoint PPT Presentation

Genome Biology Ontology + Gatekeeper Jasper Koehorst Laboratory of Systems and Synthetic Biology Current formats Not designed To store computational annotation meta-data For semantic data mining To query / ask questions


slide-1
SLIDE 1

Genome Biology Ontology + Gatekeeper

Jasper Koehorst Laboratory of Systems and Synthetic Biology

slide-2
SLIDE 2
  • Not designed

– To store computational annotation meta-data – For semantic data mining – To query / ask questions

  • Therefore

– No database system like query interface – No data provenance of predictions is included

Current formats

2

slide-3
SLIDE 3

Overview of the types in GBOL

3

Positions Sequence / Features Provenance Procedures Articles Sample

slide-4
SLIDE 4

Code generation: EMPUSA

  • Linked data graph is free format: Ontology defines

structure but does not enforce it.

  • NEED TO MANTAIN CONSISTENCY
  • From Ontology (protégé file)
  • OWL + ShEx
  • API: Java + R
  • Instance validation included
  • > 80.000 lines of code generated
  • HTML documentation (website)
  • OWL compatible file
slide-5
SLIDE 5

Semantic Annotation Platform with Provenance

Functional annotation

  • BLAST
  • Enzyme predictions
  • Domain annotation
  • Signal peptides
  • Transmembrane
  • Localization

Genetic elements

  • Gene prediction
  • tRNA/rRNA
  • Crispr

Conversion types

  • EMBL / GenBank
  • FASTA
  • GFF
  • QTL
  • VCF