Ontology Matching for Patent Classification
Christoph Quix, Sandra Geisler, Rihan Hai, Sanchit Alekh Ontology Matching Workshop@ISWC, October 21, 2017
Ontology Matching for Patent Classification Christoph Quix, Sandra - - PowerPoint PPT Presentation
Ontology Matching for Patent Classification Christoph Quix, Sandra Geisler, Rihan Hai, Sanchit Alekh Ontology Matching Workshop@ISWC, October 21, 2017 Agenda Motivation Ontology Modeling Overview of the approaches Evaluation
Christoph Quix, Sandra Geisler, Rihan Hai, Sanchit Alekh Ontology Matching Workshop@ISWC, October 21, 2017
Motivation Ontology Modeling Overview of the approaches Evaluation Results Conclusion
OM 2017 2
A significant amount of information about technological innovations are available only in patents Identification of new trends is important for industry & research, short innovation cycles Patents can be helpful to find partners for research projects, especially in interdisciplinary research fields, such as medical engineering (ME)
recommender system for projects in medical engineering
OM 2017 3
www.iem.rwth-aachen.de Wikipedia http://dbis.rwth-aachen.de/mi-Mappa
Patents have a special language and terminology Patent classification scheme IPC is not detailed enough to cover specific areas within a research field
OM 2017 4
The computer program is stored on a computer-readable medium comprising software code adapted to perform the steps of the method 100 according some embodiments when executed on a data-processing apparatus.
Mapping of patents and their inventors to competence fields in medical engineering
Based on
OM 2017 5
Defined by an expert board to identify the innovative areas in medical engineering
Motivation Ontology Modeling Overview of the approaches Evaluation Results Conclusion
OM 2017 6
Coverage of the ME domain in existing ontologies is low Creation of a new ontology according to NeON methodology Modeled as an extension of existing ontologies (refers to existing classes, i.e., equivalence or subclass relationships)
OM 2017 7
OM 2017 8
Motivation Ontology Modeling Overview of the approaches Evaluation Results Conclusion
OM 2017 9
Topic Modeling using LDA Extraction of references to scientific publications from patent data Lookup of publications in PubMed, retrieval of MeSH terms Mapping to CFO by using alignment to CFO
OM 2017 10
CFO: 535 classes MeSH: 281.776 classes Alignment computed by AgreementMaker Light
cardinality filter had best results
OM 2017 11
OM 2017 12
extracted topics to CFO
publications are mapped to CFO according to computed alignment
MeSH, then alignment to CFO is used
Motivation Ontology Modeling Overview of the approaches Evaluation Results Conclusion
OM 2017 13
59 patents have been assigned to CFs individually by experts Multiple CFs can be assigned to a patent (random precision <10%) Approaches 2+3 clearly outperform baseline approach 1 Combined approach has best performance Approach 2+3 are complementary to each other Classification approach with SVM achieves about 80%
OM 2017 14
Quality of mapping to CFO still low (f-measure about 50%, after some recent minor improvements and bug fixing >60%) Publications are annotated with very general MeSH terms (e.g., human, animal) Computed similarities are very low (because of combination by multiplication
normalization on [0,1] Expert mappings highly subjective
OM 2017 15
Motivation Ontology Modeling Overview of the approaches Evaluation Results Conclusion
OM 2017 16
Patent analysis and classification can be an interesting field for ontology engineering & ontology matching Mapping along semantically rich ontology (MeSH) significantly better than direct matching (approach 1 vs. 2) Use of semantic annotations (MeSH terms of publications) can provide additional information Next steps
OM 2017 17