OntologicalModelandApproaches intheIntegratedBiomedical - - PowerPoint PPT Presentation
OntologicalModelandApproaches intheIntegratedBiomedical - - PowerPoint PPT Presentation
OntologicalModelandApproaches intheIntegratedBiomedical DatabaseProject (IBMD) HiroshiTanaka TokyoMedicalandDentalUniversity Whatis theIntegratedBiomedicalDB project? project?
Whatis theIntegratedBiomedicalDB project? project?
IntegratedBiomedicalDBProjects
- Governmentcommissionedproject
– Startedat2007inTokyoMedical&DentalUniv.
- byMEXT(MinistryofEducation,Culture,Sports,
ScienceandTechnology) ScienceandTechnology)
- AsubprojectoftheNationalProjectsforDBs
integrationinLifescience fields
- Maingoalofthisprojectis
integrationofdiseasedatabasesinJapan
BiomedicalOntologyProjects inJapan
- Two governmentcommissionedmedical
- ntologyprojectsinJapan
- Ours:ScientificDBintegrationbyMEXT
- ClinicalInformationSystembyMHLW
(MinistryofHealth,LabourandWelfare) “JapanMedicalOntologyDevelopmentProject forAdvancedClinicalInformationSystem”
– Dr.Imai’stalk
- Goodcollaborationbetweenbothprojects
NationalProjectforDBintegrationin LifeScienceField(MEXT)
- IntegratedDBProjectinLifeScience
– Startedfrom2006
Background
- DBsinLSinJapanwerescatteredinvariousinstitutes
– DNAsequence(DDBJ)inNIGinMishima – DNAsequence(DDBJ)inNIGinMishima – ProteinDB(PDBJ)inOsakaUniversity – PathwayDB(KEGG)inKyotoUniversity – UnlikeNCBIinUSandEBIinEurope
Goal
- EstablishNationalCenterforDBintegrationinLS
(DBCLS)
– developintegratedDBservice – commonportalsforDBsinLS
AllocationOrganizationfor IntegratedDBProjects
- TokyoMedicalandDentalUniversity(TMDU),
InformationCenterforMedicalScience
– commissionedbyMEXT – tointegratescattereddatabasesonvariousdiseaseinJapan – tobeaNationalCenterforDiseaseDatabaseIntegration – tobeaNationalCenterforDiseaseDatabaseIntegration – from2007,IntegratedBiomedical(clinical)DB
- KyotoUniversityBioinformaticsCenter
– IntegrationofDrugInformation
- TokyoUniversityDept.HumanGenetics
– IntegrationofGeneticPolymorphism
Challengesfor IntegratedBiomedicalDBprojects
- Developmentof OntologybasedIntegrative
InterfaceforDiseaseDBsearch
- IntegrationoftheCoreClinicalDBsinJapan
– ParkinsonDiseaseDBinOsakaUniv. – ParkinsonDiseaseDBinOsakaUniv. – GEMDBJinNationalCancerCenter – Soon
- EstablishEthicalCodeforPublicizingClinical
CaseDB
Then,howcanwedescribe Disease? Isitphysicalentity,or Isitphysicalentity,or justconceptual ?
DiseaseModelinginDatabase
- Whatistheontologicalcharacteristics
- fdisease?
- Complexityof“Disease”
- Diseaseisamultifaceted,
DiagnosticTherapeutic Prognosis Diseaseprogression epidemiological populational beha social
Dise
multilayeredentity
– moleculargenetic – tissueorganic – individual – diagnostictherapeuticprognostic – behavioral(medicalpractice) – populationalepidemiological
moleculargenetic Organsystems Individualobservationa DiagnosticTherapeutic biological behavioral
Disease
tissueorganic
DiseaseModelinginDatabase
- Complexityofthe“Disease”
– Multifaceted,multilayeredentity – Incompletenessofdiseaseknowledge – Multiplicityofrelatedsciences – Multiplicityofrelatedsciences
- biological,psycological,behavioral,conceptual,
socialscience
- Mainoppositestandpoints
– PhysicalvsConceptual – CausativevsObservational
Howdowedescribedisease
- FormalDescriptionofDisease
- DiseaseViewarenowchangingsince
RevolutionofMolecularmedicine RevolutionofMolecularmedicine
- ConventionalView
– MultilayeredPhenotypicalDescription – EssentiallyObservational – (Place,Organ)X(Pathomorphology)
- myocardial(place)infarction(pathology)
RecentChangesofDiseaseView
- AdvancesinMolecularMedicine
– diseasegenetics
- diseasecausative(related)gene
- geneticpolymorphism(SNP,ms)
SNPs
- geneticpolymorphism(SNP,ms)
– diseaseomics
- geneticexpressionprofile
- proteome,metabolome
– diseasemolecularpathway
- distortedsignalpathwayor
regulatorynetwork
DNAmicroarray SNPs
IndividualLevel IndividualLevel
Top&dow Bottom
- Diseaseishierarchicallyorganized
“distortedmolecularnetwork”
Tissue Tissue& &OrganLevel OrganLevel CellularnetworkLevel CellularnetworkLevel
downCausality
- ttom&upCausality
!
- !"
- !"
Ontology
- Formalrepresentationofasetofconcepts
withinadomainandtherelationshipsbetween thoseconcepts
– usedtoreasonaboutthepropertiesofthatdomain, andtodefinethedomain andtodefinethedomain – “Formal,explicitspecificationofashared conceptualisation“(Gruber,klsstanford)
- Controlled(Formal)Vocabulary
– usedtomodeladomainforknowledgesharingand reuse – thetypeofobjects/conceptsthatexist,andtheir propertiesandrelations
ChallengesinBiomedicalOntology
Ontologymismatchbetween clinicalthinkingandOmicsmedicine
- Eachinformationsystem
hasitsontologyasa basis
Information system
Information system
basis
- Mismatchbetween
Clinicalontologyand Bio&ontology
terminology
- ntology
terminology
- ntology
Clinical Ontology Bio&ontology
MismatchofThinking
- Clinicalthinking
– Organsanddiseasesareunitsofconcepts – Clinicalphenotypical;diseaseisdefinedonpathological, morphological(changes)base – Essentiallygoal&oriented:diseasecaredirected – Essentiallygoal&oriented:diseasecaredirected – Topdown
- Molecular(Omics)medicinethinking
– Molecularfunctionandtheirfunctionalrelationtoother molecules – Productsofgeneexpressionareunitsofconcepts – Bottomup
! Clinicalthinkingand“Omicspace”
symptomatology symptomatology
Clinicalinformation Clinicalinformation
Pathway Pathway signalome signalome networkome networkome
Omicspace Omicspace
#
- Pathophsiology
Pathophsiology Etiology Etiology Genome Genome SNP,Haplotype SNP,Haplotype Transcriptome Transcriptome Proteome Proteome metabolome metabolome
ExistingClinicalandBio&Ontology
- Clinicalontology
– Semanticnetwork,UMLS,Galen(SNOMED)
- GeneOntology
– Molecularfunction,process,cellularlocationof geneproducts – Nowonly,Eukaryotes
SeveralprojectsforIntegrationof Bio/ClinicalOntology
- OpenBiomedicalOntologies(OBO)
– NationalCenterforBiomedicalOntology(Mussen) – OBOFoundry(Smith)
- OtherProjects
- OtherProjects
– OntologyforBiomedicalInvestigations – ULMSplanstoinvolve GeneOntology – DiseaseOntology – Soon
IntegrativeClinicoOmicOntology (possibletransitform)
- Globalstructurefollows
theframeworkof clinicalontology
- Withintheframework
$ %!"
- Withintheframework
bio&ontologyis employedtoprovide bottom&uprelationof themeaningof phenotypicalentities
molecular manifes& tations (makers) therapy (drugs) molecular
PracticalSolution
- ClinicalNosologicalOntology
– CoventionalTextbookknowledgeOntology withpatientmedicalinformationontology – usedforOntologyforDiseaseDBs – usedforOntologyforDiseaseDBs integration
- MultilayeredClinicalOmicsOntology
– Stillunderdevelopment – Butwithlinkedmultilayereddataschema – integratedClinicalOmicsDatabase(iCOD)
SeveralResultsofIBMDBproject
ResultsforOngiongIBMDBproject
- DevelopedthefirstversionofNosological
IBMBDOntology
- MutlilayeredIntegrativeClinicalOmicsDB
- MutlilayeredIntegrativeClinicalOmicsDB
- DevelopedOntologybasedInterDB
SearchSystem
- TrialsystemforintegrativeDBsearch
betweenParkinsonDiseaseDBandiCOD
IBMDBOntology
Disease Clincial
Classification
Systems Omics Pathology
Ontol
Concep Concep IntegrationLevel IntegrationLevel
EstablishedKnowledge (textbook) EHR CENen13606
Archetype
2linesofintegrationapproach
Nosology3based integration Systems3pathobiology3 basedintegration
MetaClassification
Nosologicalclassification SystemsPathoBiological classification
Integrationbasedon establishedconcept classification
&'() 3
3 3 3 Levelmodelfordatabaseintegration
Disease Clinical
Cancer Data DataStructure Terminology NeurologicalDisorder
Systems Omics Pathology
Ontology Info.Model
- nceptLevel
- nceptLevel
Info.ModelLevel Info.ModelLevel Concept Library Reference Model
interlock
HL7
SNOMEDCT
2Level BIMS CommonTemplate Data DataStructure Data DataStructure
DiseaseTerminolologyandClassification
CondensedCross ClinicalOntology DiseaseTerminology DiseaseClassification
Nakaya,J.,Sasaki,K.,andTanaka,H.(2006)CondensedCrossClinicalKnowledge,ComputerScience,IJCSNS.6(7A).6&11.
Thisisthediseaseterminologyandclassification. Diseasesareclassifiedwithcombinatedanatomicalhierarchyandetiologicalhierarchy. AconceptualunitofaDiseasseisdescribedwiththe3rdnormalizedskeletontemplatewhichcanbe calledasacontentmodel.
#* + +*
, ,)
- ./"
, #
- /
# 0 ,
- $
+ +
- /
, " * ',* %* +0%
- 1
- 2#
, , $
- /
- ,
- /
, /( , $ ,
3 1( '!(
2# 1/ 3 $ /% % %
- 45
,, , )
,*
NosologicalOntology
(anexampleofCompositeindexofanatomyandetiology)
AnexampleofLiverdiseaseindex
ApartofParkinsonDisease
integratedClinialOmicsDB (iCOD) (iCOD)
- Integratethemolecularomicsinformationand
clinicalandpathodological,lifestyleinformation
- Comprehensivedatabasebasedontheconceptof
“omics&basedsystemsmedicine”
$ $
Purpose
$ $
- $
$
- Data
Data Mining Mining $ $ ', ',
Cases
cases
- mics information
transcriptome CNV specimen normal specimen
- rmal
Hepatocellular carcinome 193 193 193 193 152 152 152 152 96 96 96 96 102 102 102 102 35 35 35 35 stored 41 29 0 34 0 fresh 134 105 81 66 35 metastasis 18 18 15 0 0 metastasis 18 18 15 0 0 colon cancer 184 184 184 184 131 131 131 131 28 28 28 28 39 39 39 39 40 40 40 40 colon 128 102 28 36 36 rect 37 29 0 3 4
- ral tumor
148 148 148 148 20 20 20 20 0 0 0 0 64 64 64 64 2 2 2 2 stored 64 0 0 64 2 fresh 84 20 0 0 0 total 525 525 525 525 303 303 303 303 124 124 124 124 205 205 205 205 77 77 77 77
iCOD:integratedClinicalOmicsDatabase
Toppage CaseArchvie CaseList
//
CaseDetails
(!
Toppage
CaseList
//
CaseDetails(1)
4$5 6!
CaseDetails(2)
ClinicalOmicsAnalysis
Clinical3&layeredOmicsMap
Omics dataanalysis
'
Pathwaymap OmicDownload PosMed Legenda
- TranscriptomemappedonKEGG
SemanticNavigationSystem
Weshowthesemanticnavigationsystem whichwecansayasacontentinterfacebasedonthecontentmodel.
IntegratedDBGuidingSystem
- forCoreDB (2DBs+
+ + +α( ( ( (10DBs) ) ) ) : Nowininvestigation) WearedevelopingSearchingSystembasedontheontology
- UserI/FManagerconvertuserinputtostandardizedwordwithterminology
andthesaurus.
- Navigatingenginenavigatesuserstotargeteddatabasessemantically.
- DataformatmapperabsorbsformatdifferencesofDBs.
DBsearch
DataFormat1 DataFormatn
Thecontentmodelisthebasic templateofbothontologyanddata formatinthesystem ユーザー User I/F Manager
DB Manager Navigating Engine
Nosological Ontology Terminology Thesaurus DataFormat Mapper
DataFormat2 DataFormat3
Standardized DataFormat
Data Formatn Data Formatn Data Formatn Data Formatn DataFormatn
ContentModel
PrototypeDemo
47
PPTversion
IBMDTopPage
http://ibmd.tmd.ac.jp
48
InputBoxforSearch
DBMapofIBMD
- nDiseaseClassification
SearchWindow
49
ResultsofSearching
Example
– Inquiry“Symptom Symptom: : : : : : : :Depressivestate” Depressivestate”
- CancerDB inTMD
– <D> 0 cases – <S> byThesaurus,semantictransform “Depressivestate→Anorexia” Depressivestate→Anorexia” ⇒ 2 2 2 2 2 2 2 2 cases cases – <S> “Depressivestate→Lassitude” Depressivestate→Lassitude” ⇒ 4 4 4 4 4 4 4 4 Cases Cases » onecaseisdoubled5cases
- ParkinsonDiseaseDB inOsakaUniv.
– <D>0 cases – <S>Ontology、” ” ” ” ” ” ” ”Depressivestate→slowmovement” Depressivestate→slowmovement” – – ⇒ 6 6 6 6 6 6 6 6 cases cases
Input
1.Category:”Symtoms Symtoms”、SearchInputWord:”Depressivestate Depressivestate”
51
2.Push SearchButton
Results
52
1.FromTMD TMDDB DB DB DB DB DB DB DB <Direct>Ocases
<Semantic>「Symptom:Depressivestate→Anorexia Anorexia」 2 2cases <Semantic>「Symptom:Depressivestate→Lassitude Lassitude」 4 4cases Duplicatecase1case→ (Total5cases Total5cases)
2.FromOsakaUniv. OsakaUniv.DB DB DB DB DB DB DB DB <Direct>0cases
<Semantic>Symptom:Depressive→SlowMovement Symptom:Depressive→SlowMovement 6cases
DetailedData
Clickherefordetaileddata
53
DetailedDatasubWindow
HittedPatientInformation isdisplayed accordingpatientcontent Thispanelisbasedon thearchetypetemplate(thecontentmodel)ofdisease
54
accordingpatientcontent model(template)
PatientContent
Ex:PatientHistory subwindow
55
ICD11andHIM3TAG
TAG3HIM presentstatus
- 1.
ContentModelGroup – TreattheoriginalusecaseofICD11(Stefany)
- 2.Informationgroup
– AlanRector(Univ.ofManchester,UK),John,Chris,SCT
- 3.
ContentModelfrontendforeachTAG, – CategorialStructureforRaredisease
- JeanMariewithRareDiseaseTAG
– IBMDBmodelismodifiedforinternalmedicine
- JunNakayaandHiroshiTanakawithInternalMedicineTAG
– Robertwilltakecareofthesethings
- 4.
SCT(SNOMED)Coordination – IHTSDOharmonizationpanelwilltakethisissue.
- (Kent,AlanRector(Univ.ofManchester,UK),Chris,Olivier)
– GOandotherontologieswillcovertheremainedarea.
DataExchangeFormat (asanInfo.Model)
/76"' /76"' 4/6'5 4/6'5 /$89:;<:=Passed/
US,UK,Canada,Korea,Italy,Israel,Australia,Japan
LedbyJunNakaya
Nakaya,J.,Hiroi,K.,Yang,W.,Ido,K.,Kimura,M.(2006)"GenomicSequenceVariationMarkupLanguage (GSVML)forGlobalInteroperabilityofClinicalGenomicsData(#!)".AsiaPacificAssociationfor MedicalInformatics2006Proceedings.A01.1&8. ; JunNakaya,MichioKimura,HiroshiTanaka
OutlinedStructureofGSVML
TheGSVMLhashierarchicalstructure. TheentrypointofGSVMLisgenomicsequencevariation.
Nakaya,J.,Hiroi,K.,Yang,W.,Ido,K.,Kimura,M. (2006)(BestPaperAward)APAMI2006A01:1&8.
TheGSVMLhas3Datacriteriaasvariationdata,directannotation,andindirectannotation. Thesecriteriahavetheinternalrelationsmainlybasedonthestatitics.
- HiroshiTanaka