scientific papers in social sciences
play

Scientific Papers in Social Sciences Alexander Garcia / Philipp Mayr - PowerPoint PPT Presentation

Simple Semantic Enrichment of Scientific Papers in Social Sciences Alexander Garcia / Philipp Mayr / Leyla Jael Garcia Florida State University / GESIS / biotea.ws Outline Motivation What data do we have? Why we are doing this?


  1. Simple Semantic Enrichment of Scientific Papers in Social Sciences Alexander Garcia / Philipp Mayr / Leyla Jael Garcia Florida State University / GESIS / biotea.ws

  2. Outline  Motivation  What data do we have?  Why we are doing this?  What are we doing? What do we aim to achieve?  RDF generation  Metadata and Content  Content enrichment  Consuming and delivering the data  A first approach SWIB 2012, Köln 2 12/4/2012

  3. Motivation What data do we have?  GESIS  Leibniz Institute for the Social Sciences  Support for the research cycle  Journals: ISI, MDA  MDA – Methods, Data, Analysis  Journal for Empirical Social Science Research  Focus on  Survey methodologies  Methods in empirical social research  Open-access, full-text SWIB 2012, Köln 3 12/4/2012

  4. Motivation Why we are doing this?  The World Wide Web  Dissemination infrastructure: Scientific and non- scientific contributions  Information:  Still locked up in discrete documents  Not interconnected, not machine-processable  RDF technology:  Connectivity tissue  But how does it impact to the scientific communication? SWIB 2012, Köln 4 12/4/2012

  5. Motivation What are we doing? What do we aim to achieve?  Question: How can scientific publications be delivered into the Semantic Web?  Our approach  RDF for research articles  Entry point to the Web of Data  Part of the Linked Open Data  Semantic enrichment  Interoperable with online data  Richer user interface  A different read experience  Interconnected with external related elements  Collaborative environment SWIB 2012, Köln 5 12/4/2012

  6. RDF Generation Metadata and Content http://pdfx.cs.man.ac.uk/ MDA PDF MDA XML RDF BIBO Generation Reference Enrichment Metadata+ Content + References SWIB 2012, Köln 6 12/4/2012

  7. RDF Generation Content enrichment Metadata+ Content + References Automatic Manual Annotation Annotation Automatically Annotated RDF Manually Annotated RDF SWIB 2012, Köln 7 12/4/2012

  8. Lessons learnt  Biotea, a similar project on the biomedical domain  XML to RDF works well  RDF annotation works well but … annotators are not perfect  Format is not translated  bold, italics  Modeling tables is not easy  Dictionary – based entity recognition tools works better  This project  PDF to XML is not perfect SWIB 2012, Köln 8 12/4/2012

  9. Consuming and delivering the data  What does it make possible ?  How similar are two articles?  based on concepts  semantic similarity  What articles use this reference in a section with title “Results”?  Which annotation co-occurs more with this “X” annotation?  Which articles include term “A” but not term “B”? SWIB 2012, Köln 9 12/4/2012

  10. Consuming and delivering the data A first approach SWIB 2012, Köln 10 12/4/2012

  11. Consuming and delivering the data A first approach SWIB 2012, Köln 11 12/4/2012

  12.  Contact  Alex García, alexgarciac@gmail.com  Philipp Mayr, philipp.mayr@gesis.org SWIB 2012, Köln 12 12/4/2012

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend