Chemical Names: Terminological Resources and Corpora Annotation
Corinna Kol´ aˇ rik, Roman Klinger,
- C. M. Friedrich, M. Hofmann-Apitius, J. Fluck
Workshop BERBTM ’08 at LREC ’08 Marrakech, Morocco
26 May 2007
Chemical Names: Terminological Resources and Corpora Annotation - - PowerPoint PPT Presentation
Chemical Names: Terminological Resources and Corpora Annotation Corinna Kol a rik, Roman Klinger, C. M. Friedrich, M. Hofmann-Apitius, J. Fluck Workshop BERBTM 08 at LREC 08 Marrakech, Morocco 26 May 2007 Outline Introduction
26 May 2007
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 2/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 3/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
Novel nonnarcotic analgesics with an improved therapeutic ratio. Structure-activity relationships of 8-(methylthio)- and 8-(acylthio)-1,2,3,4,5,6-hexahydro-2,6-methano-3- benzazocines. Conversion of the 8-phenolic 1,2,3,4,5,6-hexahydro-2,6-methano-3-benzazocines to the corresponding 8-thiophenolic analogues was achieved by three different routes. Diazo- tization of 8-amino-2,6-methano-3-benzazocine (2) followed by the reaction with CH3SNa afforded 8-(methylthio)-1,2,3,4,5,6-hexahydro-2,6-methano-3-benzazocine (3). Another route using Grewe cyclization was also examined for the synthesis of 3. As the most ef- fective route, Newman-Kwart rearrangement of benzazocines was selected and closely
dimethylcarbamoyl)thio derivatives (7a-e) in good yields. Reductive cleavage of 7a-e and subsequent methylation or acylations gave the title compounds (3, 8-24). Although anal- gesic activities of sulfur-containing benzazocines decreased compared to the correspond- ing hydroxy compounds , the N-methyl derivative (S-metazocine, 8) showed potent anal- gesic activity.
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 4/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
InChI=1/C9H8O4/c1-6(10)13-8-5-3-2-4- 7(8)9(11)12/h2-5H,1H3,(H,11,12)/f/h11H
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 5/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
g mol
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 6/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 7/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 8/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
(referred to as MeSH C)
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 9/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 10/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
1000 10000 100000 1e+06 1e+07 Pubchem MeSH_T ChEBI DrugBank HMDB MeSH_C KEGG Number
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 11/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
20 40 60 80 100 Pubchem MeSH_T ChEBI DrugBank HMDB MeSH_C KEGG Percentage of overlap with PubChem
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 12/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 13/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 14/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 15/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 16/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 17/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
50 100 150 200 250 300 350 400 450 IUPAC PART TRIVIAL ABB. SUM FAMILY Number of Entities in Test Corpus
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 18/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 19/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
0.2 0.4 0.6 0.8 1 IUPAC PART SUM TRIV ABB FAM All Recall PubChem ChEBI MeSH_C MeSH_T HMDB KEGG_C KEGG_D DrugBank Combined
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 20/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
2e-05 4e-05 6e-05 8e-05 0.0001 0.00012 0.00014 IUPAC PART SUM TRIV ABB FAM All Normalized Recall PubChem ChEBI MeSH_C MeSH_T HMDB KEGG_C KEGG_D DrugBank Combined
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 21/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 22/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 23/25
Outline Introduction Terminological Resources Test Corpus ML-based Recognition Conclusion & Summary
Roman Klinger – Chemical Names: Terminological Resources and Corpora Annotation 24/25