making sense of massive amounts of scientific
play

Making Sense of Massive Amounts of Scientific Publications: The - PowerPoint PPT Presentation

Making Sense of Massive Amounts of Scientific Publications: The Scientific Knowledge Miner Project Francesco Ronzano, Ana Freire, Diego Saez-Trumper, Horacio Saggion 20 seconds 1 paper The Rise of Open Access Science 04 Oct 2013 Vol. 342,


  1. Making Sense of Massive Amounts of Scientific Publications: The Scientific Knowledge Miner Project Francesco Ronzano, Ana Freire, Diego Saez-Trumper, Horacio Saggion

  2. 20 seconds… 1 paper The Rise of Open Access Science 04 Oct 2013 Vol. 342, Issue 6154, pp. 58-59 The Scientific Knowledge Miner Project

  3. Information Overload (scientific repositories) The Scientific Knowledge Miner Project

  4. Information Overload (scientific repositories) 90M 24,6M 57M 1M The Scientific Knowledge Miner Project

  5. Sometimes between 2017 and 2021, more than half of the papers available globally are expected to be published as Open Access articles. Lewis, David W. " The inevitability of open access ." College & Research Libraries 73.5 (2012): 493-506. The Scientific Knowledge Miner Project

  6. The peculiarities of research publications TITLE CAPTION ABSTRACT BIBLIOGRAPHIC ENTRY (SUB)SECTION The Scientific Knowledge Miner Project

  7. Scientific publications: claims In order to take full advantage of the knowledge present in scientific publications proper semantic indexing , search and content aggregation approaches, are required. Benefits: § Search of new information on specific scientific problems § Semi-automatic assessment of papers and research proposals § Hypothesis formulation § Tracking of scientific and technological advances § Scientific intelligence § Assisted report and review writing § Question answering § … The Scientific Knowledge Miner Project

  8. The Scientific Knowledge Miner Project (SKM) Facilitate the extraction of knowledge from scientific publications across many disciplines. Improve a variety of use cases such as: - Citation Characterization - Citation Recommendation - Summarization - … Ø KEY: Papers are enriched with structural , linguistic and semantic information Datasets Scientific Better Semantic Scientific Information Publications Software Knowledge applications SKM The Scientific Knowledge Miner Project

  9. The Scientific Knowledge Miner Project (SKM) The SKM approach to the analysis of scientific literature: • Relies on a finer-grained analysis of the contents of publications • Is grounded on the automated characterization of a varied set of semantic aspects of papers, including the rhetorical structure or the purpose of citations. The Scientific Knowledge Miner Project

  10. The Scientific Knowledge Miner Project (SKM) Online Scientific Publications METADATA + SEMANTIC INFORMATION Indexing Storage Analysis Crawler + METADATA The Scientific Knowledge Miner Project

  11. The Scientific Knowledge Miner Project (SKM) CRAWLING Online Scientific Publications METADATA + SEMANTIC INFORMATION Indexing Storage Analysis Crawler + METADATA The Scientific Knowledge Miner Project

  12. Crawling + METADATA Title, author, conference, year, etc. Data Base The Scientific Knowledge Miner Project

  13. The Scientific Knowledge Miner Project (SKM) Online Scientific Publications METADATA + SEMANTIC INFORMATION Indexing Storage Analysis Crawler + METADATA The Scientific Knowledge Miner Project

  14. The Scientific Knowledge Miner Project (SKM) Online Scientific TEXT ANALYSIS Publications METADATA + SEMANTIC INFORMATION Indexing Storage Analysis Crawler + METADATA The Scientific Knowledge Miner Project

  15. Dr. Inventor Text Mining Framework • Integrate and customize text mining tools and on-line services to enable and ease a wide range of scientificpublicationanalyses • Papers are enriched with structural , linguistic and semantic information http://backingdata.org/dri/library/ • Self-contained librarymanaged by • Focused on textual content • Relying on a shared data model (java classes) to representa paper • Exposinga convenient API to access the mined information • Based on to manage textual annotations The Scientific Knowledge Miner Project

  16. Dr. Inventor Text Mining Framework PDF to text converter Text Mining Framework Inline citation spotter Sentence splitter Dr. Inventor Web based reference parser Citation-aware dep. parser Rhetorical annotator Babelfy WSD and Entity Linker Citation Classifier Extractive summarizer VIZ The Scientific Knowledge Miner Project

  17. The Scientific Knowledge Miner Project (SKM) Online Scientific Publications METADATA + SEMANTIC INFORMATION Indexing Storage Analysis Crawler + METADATA The Scientific Knowledge Miner Project

  18. The Scientific Knowledge Miner Project (SKM) Online Scientific CONTENT Publications AGGREGATION METADATA AND INDEXING + SEMANTIC INFORMATION Indexing Storage Analysis Crawler + METADATA The Scientific Knowledge Miner Project

  19. Indexing The Scientific Knowledge Miner Project

  20. The Scientific Knowledge Miner Project (SKM) Online Scientific Publications METADATA + SEMANTIC INFORMATION Indexing Storage Analysis Crawler + METADATA The Scientific Knowledge Miner Project

  21. The Scientific Knowledge Miner Project (SKM) Online Scientific Publications METADATA + EXPLORATORY SEMANTIC VISUAL INFORMATION ANALYTICS Indexing Storage Analysis Crawler + METADATA The Scientific Knowledge Miner Project

  22. Analysis http://backingdata.org/dri/viz/ The Scientific Knowledge Miner Project

  23. Use Case 1: Citation Characterization Experiment new metrics: what do others say about one paper? Enrich citation CITATION PURPOSE counts with Criticism semantics Comparison Use Substantiation Basis Neutral + 17 sub-purposes The Scientific Knowledge Miner Project

  24. Use Case 2: Citation Recommendation Recommend similar papers / authors SENTENCE RHETORICAL CATEGORY Background Approach Challenge Outcome Future Work + 3 sub-categories The Scientific Knowledge Miner Project

  25. Use Case 3: Scientific Document Summarization Extractive summarization SENTENCE SUMMARY RELEVANCE (1 to 5 ratings) and HAND-WRITTEN SUMMARY The Scientific Knowledge Miner Project

  26. Conclusions and future work Scientific Knowledge Miner (SKM) aims at facilitating the extraction, aggregation and navigation of knowledge from scientific publications. • Consolidate the SKM publication mining infrastructure • Exploit the semantics of papers to perform large scale investigations of: o Alternative metrics to evaluate a paper based on citation semantics o Semantically motivated recommendation of scientific publications o Summarization of scientific literature The Scientific Knowledge Miner Project

  27. Acknowledgements The Scientific Knowledge Miner Project

  28. Making Sense of Massive Amounts of Scientific Publications: The Scientific Knowledge Miner Project {francesco.ronzano, ana.freire, diego.saez, horacio.saggion}@upf.edu

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend