Generating Knowledge Networks from Phenotypic Descriptions Fagner - PowerPoint PPT Presentation

Generating Knowledge Networks from Phenotypic Descriptions Fagner Leal Patr´ ıcia Cavoto, Julio dos Reis, Andr´ e Santanch` e pantoja.ti@gmail.com Laboratory of Information Systems University of Campinas Campinas - S˜ ao Paulo - Brazil October 24, 2016

Research Scenario Phenotype Descriptions 1 / 16

Research Scenario Phenotype Descriptions ◮ Morphological structures ◮ Behavior traits ◮ Life cycles; etc. 1 / 16

Research Scenario Phenotype Descriptions ◮ Morphological structures ◮ Behavior traits ◮ Life cycles; etc. Examples: 1. No dark longitudinal stripes on head and body. 1 / 16

Research Scenario Phenotype Descriptions ◮ Morphological structures ◮ Behavior traits ◮ Life cycles; etc. Examples: 1. No dark longitudinal stripes on head and body. 2. Scattered breast melanophores (Fuiman et al., 1983). Pteronotropis hubbsi can also be distinguished from Notropis chalybaeus by the presence of two caudal spots, one large spot centered at the base of the caudal fin below the flexed notochord and a smaller spot located dorsally above it, and by the presence of 9 dorsal rays in late metalarvae. Notropis chalybaeus has a single caudal spot in which no part extends above the notochord and 8 dorsal rays (Marshall, 1947). 1 / 16

Research Scenario ◮ Biology Knowledge Bases 1 http://www.fishbase.org 2 / 16

Research Scenario ◮ Biology Knowledge Bases e.g. , FishBase: knowledge base about fishes 1 1 http://www.fishbase.org 2 / 16

Research Scenario ◮ Biology Knowledge Bases e.g. , FishBase: knowledge base about fishes 1 ◮ Identification Keys (IK)s 1 http://www.fishbase.org 2 / 16

Research Scenario ◮ Biology Knowledge Bases e.g. , FishBase: knowledge base about fishes 1 ◮ Identification Keys (IK)s ◮ Artifacts to identify specimens ◮ Observable characteristics 1 http://www.fishbase.org 2 / 16

Research Scenario Identification Keys Example of IK to Teleostean families 3 / 16

Research Scenario Identification Keys Example of IK to Teleostean families Drawbacks: ◮ Need previous knowledge ◮ Need to follow the flow 3 / 16

Goal To recognize and explicit phenotype elements locked in the Identification Keys. Using the Entity-Quality (EQ) representation: ◮ Entity : morphological structure ◮ Quality : qualifier state of the Entity 4 / 16

Related Work Information Extraction Reference Context Approach Ciaramita et al. , 2005 Interactions in molecular biology Unsupervised Learning and Rules over Dependency Trees Song et al. , 2015 Biomedical anatomic entities Dictionary-based Pyysalo and Ananiadou, 2014 Biomedical Anatomic entities Supervised learning Ramakrishnan et al. , 2008 Biomedical Anatomical entities Dictionary-based, Rules over Dependencies Trees and Statis- tical Learning Fundel et al. , 2007 Gene and Protein Interaction Rules over Dependency Trees Cui, 2012 Morphological structures of or- Unsupervised Learning ganisms 5 / 16

Method General View Step 1 : It explores isolated sentences Step 2 : It explores the sentence correlations 6 / 16

Method Step 1 - General View Assumption : The typical way in which phenotype descriptions are written can guide the extraction of EQ elements. 7 / 16

Method Step 1 - Match Algorithm Identifying Entities and Qualities : 8 / 16

Method Step 1 - Output 9 / 16

Method Step 2 - General View Assumption : The structure of Identification Keys holds correlations that can be exploited to improve the extraction of EQ statements. 10 / 16

Method Step 2 - General View Assumption : The structure of Identification Keys holds correlations that can be exploited to improve the extraction of EQ statements. Generally, in phenotype descriptions: 1. Alternative sentences refer to the same Entities . 2. Alternative sentences assign complementary Qualities to Entity . 10 / 16

Method Step 2 - Algorithm 11 / 16

Method Step 2 - Algorithm Compare the two relations, based on: (a) Existence of antonymy between the quality parts (b) Relation Type (c) Grammatical classes of quality parts (d) Relation Directions 11 / 16

Method Step 2 - Algorithm Compare the two relations, based on: (a) Existence of antonymy between the quality parts (b) Relation Type (c) Grammatical classes of quality parts (d) Relation Directions Similarity = � d i = a v i 11 / 16

Method Step 2 - Output 12 / 16

Evaluation - Numerical Assessment Gold Standard-based Assessment Gold standard set: 100 phenotype descriptions (randomly selected) were manually annotated ❳❳❳❳❳❳❳❳❳❳❳ Elements EQ pair Entity Measures Recall 0,45 0,76 Precision 0,87 0,94 F-measure 0,59 0,84 13 / 16

Evaluation - Application Experiments EQ sharing through taxons 14 / 16 Figure 1: Bipartite network of Species and EQs

Evaluation - Application Experiments EQ sharing through taxons Figure 2: Projection of bipartide network 15 / 16

Conclusion Original approach to automatically recognize Entities and Qualities , exploring : ◮ Writing characteristics of phenotype descriptions ◮ Organizational structure of IKs Future Work ◮ To compare against other approaches ◮ To recognize complete EQs in Step 2 (not only the quality part) ◮ To calibrate the parameters and thresholds 16 / 16

Thank you!

Classical Measures TP Recall = (1) TP + FN TP Precision = (2) TP + FP F - measure = 2 ∗ Precision ∗ Recall (3) Precision + Recall Examples of: ◮ True Positive: ◮ expected: E [ lips ] Q [ notfringed ] ◮ recognized: E [ lips ] Q [ notfringed ] ◮ False Positive: ◮ expected E [ vertebrae ] Q [ 119 to 132 ] ◮ recognized: E [ vertebrae ] Q [132] ◮ False Negative: ◮ recognized E [ breastmelanophores ] Q [ Scattered ]

Considering Partial Matches ◮ Complete Miss (CM): false negative ◮ Wrong Hit (WH) : false Positive ◮ Full Match (FM): true Positive Partial Match Partial Precision = Full Match + Partial Match + Wrong Hit (4) Full Match Full Precision = (5) Full Match + Partial Match + Wrong Hit PartialMatch Partial Recall = Full Match + Partial Match + Complete Miss (6) FullMatch Full Recall = (7) Full Match + Partial Match + Complete Miss

Considering Partial Matches Total Precision = Partial Precision + Full Precision Total Recall = Partial Recall + Full Recall ❳❳❳❳❳❳❳❳❳❳❳ Elements EQ pair Entity Measures Partial-Recall 0.05 0.08 Full-Recall 0.39 0.67 Partial-Precision 0.11 0.1 Full-Precision 0.75 0.84 Table 1: Results concerning Perfect and also Partial Matches ❳❳❳❳❳❳❳❳❳❳❳ Elements EQ pair Entity Measures Total Recall 0,45 0,76 Total Precision 0,87 0,94 Total F-measure 0,59 0,84 Table 2: Total Results

Generating Knowledge Networks from Phenotypic Descriptions Fagner - PowerPoint PPT Presentation

Generating Knowledge Networks from Phenotypic Descriptions Fagner Leal Patr cia Cavoto, Julio dos Reis, Andr e Santanch` e pantoja.ti@gmail.com Laboratory of Information Systems University of Campinas Campinas - S ao Paulo -

Phenotypic HCV-NS3-PI-Resistance Test Arevir Meeting, 11./12.04.2013 Daniel Rupp, MD Phenotypic

H2 F2009 H2 F2009 GENERATING GENERATING GENERATING GENERATING FREE CASH FLOW FREE CASH FLOW

Chemspace Phenotypic Screening Set Description Phenotypic screening (PS) is one of the two major

Modelling phenotypic variation Modelling phenotypic variation in monthly weights of in monthly

Deep Image Description Rui-Wei Zhao rw.du.zhao@gmail.com 1 Outline Generating descriptions

Knowledge-Based Agents knowledge knowledge representation, knowledge base, types of knowledge

Advanced Electric Generating Advanced Electric Generating Advanced Electric Generating

Ratchaburi Electricity Generating Holding PCL. Ratchaburi Electricity Generating Holding PCL.

Recursive Definitions Generating Functions Lecture 18 Generating Functions A generating

Phenotypic assays for HBV resistance tests by Maria Fraune AREVIR-GenaFor-Meeting 10.-11.04.2008

Clinical and Phenotypic Data Capture Co-chairs: David Hansen & Melissa Haendel

BMBF consortium: Hepatitis B therapies optimized by phenotypic evaluation (HOPE) Project Aims 1.

Statistical Analysis of Genetic and Phenotypic Data for Breeders: Hands on Practical Sessions

Learning Household Task Knowledge from WikiHow Descriptions Yilun Zhou, Julie A. Shah, Steven

Generating Fine-Grained Open Vocabulary Entity Type Descriptions Rajarshi Bhowmik and Gerard de

Generating Subfields Mark van Hoeij June 15, 2017 Mark van Hoeij Generating Subfields Overview

PMI Honolulu Hawaii Chapter 2020 Sponsorship Value Proposition Contact Sponsorship Director

The Atacama Cosmology Telescope: ACTPol Suzanne Staggs CMB2013 Okinawa, 11 June The Atacama

Handling astrophysical uncertainties on direct detection experiments Anne Green University of

Closing in on mass-degenerate dark matter scenarios with antiprotons and direct detection On the

Patriarchy A system of society or government in which

Asset ownership and female empowerment in Pakistan: Evidence from a natural experiment Sarah Khan,

Racism = Prejudice + Power A Town Hall Discussion on Racism in Astronomy Town Hall Axiom: people

Leading with Innovation NIC Virtual Conference November 9, 2016 2 When Gender-Neutral is Not

Sambuz

Useful Links

Newsletter

Mail Us

Generating Knowledge Networks from Phenotypic Descriptions Fagner - PowerPoint PPT Presentation

Generating Knowledge Networks from Phenotypic Descriptions Fagner Leal Patr cia Cavoto, Julio dos Reis, Andr e Santanch` e pantoja.ti@gmail.com Laboratory of Information Systems University of Campinas Campinas - S ao Paulo -

Phenotypic HCV-NS3-PI-Resistance Test Arevir Meeting, 11./12.04.2013 Daniel Rupp, MD Phenotypic

H2 F2009 H2 F2009 GENERATING GENERATING GENERATING GENERATING FREE CASH FLOW FREE CASH FLOW

Chemspace Phenotypic Screening Set Description Phenotypic screening (PS) is one of the two major

Modelling phenotypic variation Modelling phenotypic variation in monthly weights of in monthly

Deep Image Description Rui-Wei Zhao rw.du.zhao@gmail.com 1 Outline Generating descriptions

Knowledge-Based Agents knowledge knowledge representation, knowledge base, types of knowledge

Advanced Electric Generating Advanced Electric Generating Advanced Electric Generating

Ratchaburi Electricity Generating Holding PCL. Ratchaburi Electricity Generating Holding PCL.

Recursive Definitions Generating Functions Lecture 18 Generating Functions A generating

Phenotypic assays for HBV resistance tests by Maria Fraune AREVIR-GenaFor-Meeting 10.-11.04.2008

Clinical and Phenotypic Data Capture Co-chairs: David Hansen &amp; Melissa Haendel

BMBF consortium: Hepatitis B therapies optimized by phenotypic evaluation (HOPE) Project Aims 1.

Statistical Analysis of Genetic and Phenotypic Data for Breeders: Hands on Practical Sessions

Learning Household Task Knowledge from WikiHow Descriptions Yilun Zhou, Julie A. Shah, Steven

Generating Fine-Grained Open Vocabulary Entity Type Descriptions Rajarshi Bhowmik and Gerard de

Generating Subfields Mark van Hoeij June 15, 2017 Mark van Hoeij Generating Subfields Overview

PMI Honolulu Hawaii Chapter 2020 Sponsorship Value Proposition Contact Sponsorship Director

The Atacama Cosmology Telescope: ACTPol Suzanne Staggs CMB2013 Okinawa, 11 June The Atacama

Handling astrophysical uncertainties on direct detection experiments Anne Green University of

Closing in on mass-degenerate dark matter scenarios with antiprotons and direct detection On the

Patriarchy A system of society or government in which

Asset ownership and female empowerment in Pakistan: Evidence from a natural experiment Sarah Khan,

Racism = Prejudice + Power A Town Hall Discussion on Racism in Astronomy Town Hall Axiom: people

Leading with Innovation NIC Virtual Conference November 9, 2016 2 When Gender-Neutral is Not

Sambuz

Useful Links

Newsletter

Mail Us

Clinical and Phenotypic Data Capture Co-chairs: David Hansen & Melissa Haendel