when and why to use a classifier when and why to use a
play

When and Why to use a Classifier? When and Why to use a Classifier? - PowerPoint PPT Presentation

When and Why to use a Classifier? When and Why to use a Classifier? Alan Rector Alan Rector with acknowledgement to with acknowledgement to Jeremy Rogers, Pieter Zanstra Zanstra, & the GALEN Consortium , & the GALEN Consortium


  1. When and Why to use a Classifier? When and Why to use a Classifier? Alan Rector Alan Rector with acknowledgement to with acknowledgement to Jeremy Rogers, Pieter Zanstra Zanstra, & the GALEN Consortium , & the GALEN Consortium Jeremy Rogers, Pieter Nick Drummond, Matthew Horridge, Hai Wang in CO- -ODE/ ODE/HyOntUSE HyOntUSE Nick Drummond, Matthew Horridge, Hai Wang in CO Information Management Group Dept of Computer Science, U Manchester ter Information Management Group Dept of Computer Science, U Manches Holger Knublauch, Ray Fergerson Fergerson, … and the Protégé , … and the Protégé- -Owl Team Owl Team Holger Knublauch, Ray rector@cs.man.ac.uk rector@cs.man.ac.uk co- -ode ode- -admin@cs.man.ac.uk admin@cs.man.ac.uk co www.co- -ode.org ode.org www.co protege.stanford.org protege.stanford.org www.opengalen.org www.opengalen.org Open GALEN 1

  2. When to use a classifier When to use a classifier 1. At author time: As a compiler – Ontologies will be delivered as “pre-coordinated” ontologies to be used without a reasoner – To make extensions and additions quick, easy, and responsive, distri but developments, empower users to make changes Part of an ontology life cycle – 2. At delivery time: As a service: Many fixed ontologies are too big and too small – • Too big to find things; too small to contain what you need – Create them on the fly – Part of an ontology service 3. At application time: as a reasoner Decision support, query optimisation, schema integration, …, …, … – – Part of a reasoning service Open GALEN 2

  3. When to use a classifier 1: When to use a classifier 1: Pre- -coordinated delivery coordinated delivery Pre classifier as compiler classifier as compiler • The life cycle – Gather requirements, sketch, experiment – Establish patterns – design a “language” • Criteria for success: What a subject domain expert can learn in a few days – Bulk authoring Development & evolution – Classification – Quality assurance – Commit classifier results to a pre-coordinated ontology & deliver • Polyhierarchies (Protégé, DAG-Edit, OWL-Lite, RDF(S), Topic Maps, … – Query and use with you favourite tool Open GALEN 3

  4. Commit Results to a Pre- -Coordinated Coordinated Commit Results to a Pre Ontology Ontology Assert (“Commit”) changes inferred by classifier Open GALEN 4

  5. When to use a classifier 2: When to use a classifier 2: Post Coordination Post Coordination Classifier as an inference engine Classifier as an inference engine • When the ontology too big – “Lazy classification” on demand API kernel model Externally available resource Big on the outside: small kernel on the inside Big on the outside: small kernel on the inside Open GALEN 5

  6. Often combined with other services: Often combined with other services: Example - - the GALEN Server the GALEN Server Example Ontology Services Client Multilingual Module Multilingual Lexicons Client Service API Ontology Applications Module Users Ontologies & Classifie r External Resources & “ Coding ” Module Resource References Run time classifier Run time classifier Open GALEN 6

  7. …but… Why use a Classifier? but… Why use a Classifier? … • To compose concepts – Allow conceptual lego • To manage polyhierarchies – Adding abstractions (“axes”) as needed – Normalisation • Untangling – labelling of “kinds of is-a” • To avoid combinatorial explosions – Keep bicycles from exploding • To manage context – Cross species, Cross disciplines, Cross studies • To check consistency and help users find errors Open GALEN 7

  8. Logic- -based Ontologies: based Ontologies: Logic Conceptual Lego Conceptual Lego gene hand protein cell extremity expression body Lung chronic inflammation acute infection bacterial abnormal deletion normal polymorphism ischaemic Open GALEN 8

  9. Logic- -based Ontologies: based Ontologies: Logic Conceptual Lego Conceptual Lego “ SNPolymorphism of CFTRGene causing Defect in MembraneTransport of ChlorideIon causing Increase in Viscosity of Mucus in CysticFibrosis …” “Hand which is anatomically normal” Open GALEN 9

  10. Linking taxonomies: Linking taxonomies: Species Genes Conceptual Lego Conceptual Lego Normalisation Normalisation Protein Function CFTRGene in humans Disease Protein coded by (CFTRgene & in humans) Membrane transport mediated by (Protein coded by (CFTRgene in humans)) Disease caused by (abnormality in (Membrane transport mediated by (Protein coded by (CTFR gene & in humans)))) Open GALEN 10

  11. Logic Based Ontologies: The basics Logic Based Ontologies: The basics Validating Primitives Descriptions Definitions Reasoning (constraining cross products) Thing Feature Structure Thing pathological red + feature : pathological Heart MitralValve MitralValve Encrustation Encrustation Structure * ALWAYS partOf : Heart * ALWAYS feature : pathological + feature : pathological + involves : Heart red red Encrustation + partOf : Heart + partOf : Heart + involves : MitralValve + ( feature : pathological) Open GALEN 11

  12. Example demonstrations: Example demonstrations: Take a Few Simple Concepts & Properties Take a Few Simple Concepts & Properties Open GALEN 12

  13. Combine them in Descriptions Combine them in Descriptions which can be simple…. which can be simple…. Sickle cell disease is a disease caused Sickle cell disease is a disease caused some sickling sickling haemoglobin haemoglobin some Open GALEN 13

  14. or which can be as complex as you like or which can be as complex as you like Cytstic fibrosisis fibrosisis is caused by some non is caused by some non- - Cytstic normal ion transport that is the function of normal ion transport that is the function of a protein coded for by a CFTR gene a protein coded for by a CFTR gene Open GALEN 14

  15. Add some definitions Add some definitions “Diseases linked to CFTR Genes” Diseases linked to CFTR Genes” “ Open GALEN 15

  16. We have built a simple tree We have built a simple tree easy to maintain easy to maintain Open GALEN 16

  17. 17 Let the classifier organise it Let the classifier organise it Open GALEN

  18. If you want more abstractions, If you want more abstractions, just add new definitions just add new definitions (re- -use existing data) use existing data) (re “Diseases linked to abnormal proteins” Open GALEN 18

  19. 19 And let the classifier work again And let the classifier work again Open GALEN

  20. And again – – even for a quite different even for a quite different And again category category “Diseases linked genes described in the mouse” Open GALEN 20

  21. And let classifier check consistency And let classifier check consistency (My first try wasn’t) (My first try wasn’t) Open GALEN 21

  22. Represent context and views by Represent context and views by variant properties variant properties is_part_of OrganPart Organ Heart CardiacValve Pericardium is_structurally_part_of is_clinically_part_ of Disease of (Heart or part-of-heart) Disease of Pericardium Open GALEN 22

  23. Integration of Contexts in Protégé- -OWL OWL Integration of Contexts in Protégé Generic part-of FMA Functional Clinical ‘Internal’ FMA Functional Clinical Open GALEN 23

  24. Protégé- -OWL alternative views OWL alternative views Protégé Disorder of “Clinical heart” “Disorder of heart of any part of the heart” (including clinical and functional parts) Disorder of “FMA heart” “Disorder of heart or any structural part of the heart” Open GALEN 24

  25. Consequences for classification of Consequences for classification of diseases diseases Disorders of the things Doctors consider it a heart anatomists recognise as parts disease, even though of the heart developmentally/structurally it is not Open GALEN 25

  26. Summary: Why Classify? Summary: Why Classify? • To compose concepts • To untangle polyhierarchies – to Normalise • To avoid combinatorial explosions • To manage context • To check consistency and help users find errors Open GALEN 26

  27. Summary: When to Classify? Summary: When to Classify? Applications do not need a classifier Applications do not need a classifier to benefit from classification to benefit from classification • Pre-coordination – If concepts/terms can be predicted – When classifier is not available at run time – When we must fit with legacy applications • Post-coordination – When a a few concepts are needed from a large potential set – When a classifier is available • and time cost is acceptable – When applications can be built or adapted to take advantage Open GALEN 27

  28. Remember: Remember: Think about the Life Cycle Think about the Life Cycle • The life cycle for pre-coordinated ontologies – Gather requirements, sketch, experiment – Establish patterns – design a “language” • What a subject domain expert can learn in a few days – Bulk authoring – Classification Development & evolution – Quality assurance – Commit classifier results to a pre-coordinated ontology & deliver • Taxonomies (Protégé, DAG-Edit, OWL-Lite, RDF(S), Topic Maps, … – Query and use with you favourite tool www.co- -ode.org ode.org www.co Open GALEN 28

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend