annotation analytics for gene and protein functions
play

Annotation Analytics for Gene and Protein functions Nigam Shah, - PowerPoint PPT Presentation

Annotation Analytics for Gene and Protein functions Nigam Shah, MBBS, PhD nigam@stanford.edu Annotation service Process textual metadata to automatically tag text with as many ontology terms as possible. 107 million calls, ~1000 GB data


  1. Annotation Analytics for Gene and Protein functions Nigam Shah, MBBS, PhD nigam@stanford.edu

  2. Annotation service Process textual metadata to automatically tag text with as many ontology terms as possible. 107 million calls, ~1000 GB data

  3. Resource index Won 1 st prize at the 2010 Semantic Web Challenge @ ISWC Pubmed Abstracts Adverse Events (AERS) GEO : Clinical Trials Drug Bank

  4. Understanding the genome • Units of study range in length from ‘whole chromosome’ to ‘singe nucleotide’ • E.g. three copies of Chr. 21  Down’s syndrome • The focus in on finding the functional associations of strings in the genome

  5. Generic GO based analysis routine Genome Study Set • Get annotations for each gene in a set • Count the occurrence of each annotation term in the study set • Count the occurrence of that term in some reference set (whole genome?) • P-value for how surprising their overlap is. Reference set

  6. Annotation Analytics Landscape SNOMED-CT NCIT ICD-9 ? MeSH Genes2MSH : Drugs, Chemicals Cell Type Human Disease Gene Ontology GOPubMed Grant Drug Health Indicator Warehouse Gene Patient Paper Sets datasets Sets Sets Sets Sets

  7. Mutation enrichment

  8. Profiling a set of Aging genes 261 Age-related genes Genome Disease Ontology ~ 30% of genome

  9. Annotation Analytics Landscape Mutations SNOMED-CT 1. Units of study range in length from ‘whole chromosome’ to NCIT ‘singe nucleotide’ ICD-9 What else 2. The focus in on finding the functional associations of strings MeSH Genes2MSH in the genome can we do? : 3. For each type of “string”, there Drugs, Chemicals will be some textual descriptions Cell Type that you can process computationally . Aging Human Disease Gene Ontology GOPubMed Drug Health Indicator Warehouse Gene Paper Patient Grant Sets datasets Sets Sets Sets Sets

  10. The team @ www.bioontology.org/project-team NIH Roadmap grant U54 HG004028 10

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend