The application of Semantic Publishing technologies in the Science - - PowerPoint PPT Presentation

the application of semantic publishing technologies in
SMART_READER_LITE
LIVE PREVIEW

The application of Semantic Publishing technologies in the Science - - PowerPoint PPT Presentation

The 6th ISSI Doctoral Forum, September 2 ISSI2019, September 2-5, 2019, Sapienza University, Rome, Italy The application of Semantic Publishing technologies in the Science of Science research domain for the Humanities field Ivan Heibi Digital


slide-1
SLIDE 1

The application of Semantic Publishing technologies in the Science of Science research domain for the Humanities field

Ivan Heibi

Digital Humanities Advanced Research Centre (DHARC), Department of Classical Philology and Italian Studies, University of Bologna, Bologna (Italy) ivan.heibi2@unibo.it

The 6th ISSI Doctoral Forum, September 2 ISSI2019, September 2-5, 2019, Sapienza University, Rome, Italy

slide-2
SLIDE 2

About me

Bachelor and Master degree in Computer Science at the University of Bologna, Italy

Research fellow under the supervision of Silvio Peroni on the “OpenCitations Enhancement

Project”, funded by the Alfred P. Sloan Foundation

Ph.D. student at the Department of Classical

Philology and Italian Studies, University of Bologna, Bologna, Italy; The application of Semantic Publishing technologies in the Science of Science research domain

2017-18 …-2017 2018-...

slide-3
SLIDE 3

Background

Science of Science

Quantify and predict scientific research and its resulting outcomes

Social science Network analysis Large-scale data analysis

Semantic Publishing

The application of Semantic Web technologies in the scholarly publishing domain.

for Humanities

slide-4
SLIDE 4

Background: Semantic Web technologies essentials

  • What is it ?

An extension of the World Wide Web -> the web of data that can be processed by machines

  • How is that possible ?

Resources on the web are described using the RDF data model: a Triple (subject, predicate, object), as a graph.

subject

  • bject

predicate Document A Document B cites

Semantic publishing General definition

Shotton, D. (2009). Semantic publishing: the coming revolution in scientific journal publishing. DOI: https://doi.org/10.1087/2009202

Notable examples: OpenCitations, Wikidata/WikiCite, Springer Nature SciGraph, e Microsoft Academic Knowledge Graph

slide-5
SLIDE 5

Analysis strategy: Citations in Humanities

  • What are the common formalisms and constructional patterns

adopted for citations inside Humanities documents (e.g. a classification according to the document sections) ?

  • What are the reasons, i.e. the citation function, to cite other works

and what are the most important ones ?

Purpose

[1] Teufel, S., Siddharthan, A., & Tidhar, D. (2006). An annotation scheme for citation function.

1

slide-6
SLIDE 6
  • Resources

Focus on journal article-oriented study fields. Books seems to be the most cited doc-type by humanities fields, yet less available. Literature and History are two highly reasonable study fields to take in consideration.

  • Limitations

The lower proportion of journal articles cited by humanities compared to the scientific oriented domains case; The local relevance; The presence of a division between publications directed toward researchers and writings directed to a public audience … etc

  • Methodologies

The Semantic Publishing methods, E.g. using the CiTO (the Citation Typing Ontology) to semantically represent bibliographic citations, or the DataCite Ontology for resources identification … etc

  • Desirable outcomes

Broaden the knowledge over citations and their usage will lead future researchers improve their works and effectively address their research questions; Developing new applications that assist the community into a functional usage of the discoveries made.

The Research

slide-7
SLIDE 7

Project workflow

Learning from previous approaches

Bibliometric and Scientometric techniques from the related communities; Study the past approaches/methods adopted to

  • verwhelm this research limitations

Defining the datasets, resources and tools to use

  • Dataset: OpenCitations Indexes: mainly

COCI, the OpenCitations Index of Crossref open DOI-to-DOI citations.

  • Tool: DIPAM, a Dashboard Interface for

Python-based Applications Mashup.

  • Tool: OSCAR, the OpenCitations RDF

Search Application.

Answering the research questions Building applications to highlight the discoveries made

2 1 3 4

Ivan Heibi, Silvio Peroni, Paolo Ferri, & Luca Pareschi. (2019, June 27). catarsi/mitao: MITAO first release (Version v1.1-beta). Zenodo. http://doi.org/10.5281/zenodo.3258328 Heibi, I., Peroni, S., & Shotton, D. Enabling text search on SPARQL endpoints through OSCAR. Data Science, (Preprint), 1-23. DOI: https://doi.org/10.3233/DS-190016 Heibi I, Peroni S, Shotton D (2019). COCI, the OpenCitations Index of Crossref open DOI-to-DOI citations. arXiv:1904.06052

slide-8
SLIDE 8

Thank you for your attention

The application of Semantic Publishing technologies in the Science of Science research domain for the Humanities field

Ivan Heibi

Digital Humanities Advanced Research Centre (DHARC), Department of Classical Philology and Italian Studies, University of Bologna, Bologna (Italy) ivan.heibi2@unibo.it – @ivanheib – https://ivanhb.github.io