Doctoral theses research data and metadata documentation ETD 2013 - - PowerPoint PPT Presentation

doctoral theses research data and metadata documentation
SMART_READER_LITE
LIVE PREVIEW

Doctoral theses research data and metadata documentation ETD 2013 - - PowerPoint PPT Presentation

Doctoral theses research data Doctoral theses research data and metadata documentation ETD 2013 Hong Kong 16th International Symposium on Electronic Theses and Dissertations 25.09.2013 Maxi Kindling Berlin School of Library and


slide-1
SLIDE 1

Doctoral theses’ research data

Doctoral theses’ research data and metadata documentation

ETD 2013 Hong Kong 16th International Symposium on Electronic Theses and Dissertations 25.09.2013

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

1

25.09.2013

slide-2
SLIDE 2

Doctoral theses’ research data

Agenda

  • Motivation
  • Research data
  • Context
  • Examples
  • Survey results
  • Aspects

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

2

25.09.2013

slide-3
SLIDE 3

Doctoral theses’ research data

Motivation

  • Research data curation and sharing in times of Open Science
  • Supplementary material & enhanced publications
  • Less examples
  • HU Berlin concept to archive and publish research data

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

3

25.09.2013

slide-4
SLIDE 4

Doctoral theses’ research data

Survey on Research Data Management at HU Berlin

  • 6 weeks in early 2013
  • Target group was academic staff at HU Berlin (~2000 persons)
  • Overall response rate of ~24 % (499)
  • 117 PhD students from most disciplines (departments and institutes)

– Chemistry (13) – Psychology (11) – Social sciences (9) – History (8) – Biology (7)

  • Important: Participants were not asked to rely answer to a specific type of

research data, e.g. only research data based on ETDs

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

4

25.09.2013

slide-5
SLIDE 5

Doctoral theses’ research data

Survey on Research Data Management at HU Berlin

a) Where does your research data derive from? Please indicate your main sources.

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

5

25.09.2013

10 20 30 40 50 60 Text documents Experiments Surveys and interviews Observations Statistics and reference data Simulations Other Images Logfiles and usage data

slide-6
SLIDE 6

Doctoral theses’ research data

Survey on Research Data Management at HU Berlin

b) Please indicate data types more specifically.

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

6

25.09.2013

10 20 30 40 50 60 70 Other Video recordings Multi-dimensional visualisations and models Audio recordings Data specific for your field or instrument Images Programmes and applications Databases Spreadsheets Texts

slide-7
SLIDE 7

Doctoral theses’ research data

Survey on Research Data Management at HU Berlin

c) Please indicate specific data types you work with.

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

7

25.09.2013

2 4 6 8 10 12 14 16 Climate modelling Text-corpora / annotations Topographic data Satellite imagery Remote sensing GIS data Patient data Other Surveys Spectra Statistic analysis Measurement series

slide-8
SLIDE 8

Doctoral theses’ research data

Research data definition approach

“[…] digital data being a (descriptive) part or the result of a research process. This process covers all stages of research, ranging from research data generation, which may be in an experiment in the sciences, an empirical study in the social sciences or observations of cultural phenomena, to the publication of research results. Digital research data occur in different data types, levels of aggregation and data formats, informed by the research disciplines and their methods. With regards to the purpose of access for use and re-use of research data, digital research data are of no value without their metadata and proper documentation describing their context and the tools used to create, store, adapt, and analyze them.” (Kindling & Schirmbacher, 2013)

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

8

25.09.2013

slide-9
SLIDE 9

Doctoral theses’ research data

Relevance and context: Policitical Strategies

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

9

25.09.2013

http://oa.mpg.de/lang/de/berlin-prozess/berliner-erklarung/ http://www.consilium.europa.eu/uedocs/cms_Data/docs/pressdata/en/intm/138118.pdf

slide-10
SLIDE 10

Doctoral theses’ research data

Relevance and context: „Academic Fraud“

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

10

25.09.2013

http://royalsociety.org/uploadedFiles/Royal_Society_Content/policy/projects/sape/ 2012-06-20-SAOE.pdf http://retractionwatch.wordpress.com/2013/06/12/glaxo-asks-nature-medicine-to- retract-paper-by-fired-company-scientist/

slide-11
SLIDE 11

Doctoral theses’ research data

Relevance and context: Research Integrity

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

11

25.09.2013

http://www.esf.org/index.php?eID=tx_nawsecuredl&u=0&file=fileadmin/be_user/C EO_Unit/MO_FORA/MOFORUM_ResearchIntegrity /Code_Conduct_ResearchIntegrity.pdf&t=1367499587&hash=ac6e154c2fed65fa0d6 54b467ffafb0c0d9ef44d http://www.dfg.de/download/pdf/dfg_im_profil/reden_stellungnahmen/download/ empfehlung_wiss_praxis_0198.pdf

slide-12
SLIDE 12

Doctoral theses’ research data

DFG (1998) Guidelines on safeguarding good scientific practice

12

25.09.2013

http://www.dfg.de/download/pdf/dfg_im_profil/reden_stellungnahmen/download/empfehlung_wiss_praxis_0198.pdf

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

slide-13
SLIDE 13

Doctoral theses’ research data

Relevance and context: Journal Policies

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

13

25.09.2013

http://www.nature.com/scientificdata/for-authors/data-deposition-policies/ http://www.plosone.org/static/policies.action#sharing

slide-14
SLIDE 14

Doctoral theses’ research data

Relevance and context: Impact on universities

  • Increasing impact on universities worldwide
  • Research data management services and support are ahead in UK, USA

and Australia (evaluation in HU seminar)

  • Since some years also a „hot topic“ in German LIS domain on university

level

  • Some German universities already have research data archiving included in

their policy

  • HU Berlin did the first extensive survey on research data management at

German universities

  • Interviews showed that some departments at HU are planning to mandate

research data archiving for ETDs

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

14

25.09.2013

slide-15
SLIDE 15

Doctoral theses’ research data

Research data features

  • Research object
  • Research result
  • Scholarly communication (published data)
  • Proof
  • Retracability/Verifiability
  • Impact
  • ETDs: Review and evaluation (exam)
  • Re-use
  • Innovation
  • Financial benefits

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

15

25.09.2013

slide-16
SLIDE 16

Doctoral theses’ research data

Research data metadata and documentation

General aspects

  • Interpretation
  • Reproducability
  • Re-Use
  • Less motivation to data

documentation

  • Metadata standards for

most disciplines ETDs

  • Research Integrity (exam)
  • Citability (PID)
  • Findability
  • Extensive data description

(part of ETDs), but diverse

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

16

25.09.2013

slide-17
SLIDE 17

Doctoral theses’ research data

Survey on Research Data Management at HU Berlin

e) Who is currently responsible for storing, back up or archiving your research data?

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

17

25.09.2013

20 40 60 80 100 120 Library staff Project manager PhD student External service provider Other My assistant CMS staff Special staff Myself

slide-18
SLIDE 18

Doctoral theses’ research data

Survey on Research Data Management at HU Berlin

g) Have you ever deposited your research data in a data archive or repository?

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

18

25.09.2013

10 20 30 40 50 60 No answer Yes. No, but I intend doing so. No, I do not intend doing so in the near future. No, I was not aware of such option.

slide-19
SLIDE 19

Doctoral theses’ research data

HU Berlin Phd candidates storing data in…

A sample of journals and data repositories

  • www.plosone.org/
  • http://www.myexperiment.org/
  • Dalton Transaction
  • Organometallics
  • SHARE European Social Survey
  • Scifinder, web of knowledge
  • Angewandt chemie, chemical communications, chemistry-an European Journal, Journal
  • f the American chemical Society, Journal of Organic Chemistry"
  • DRYAD
  • Dropbox (3)
  • prl.aps.org
  • Polylog - Zeitschrift für Interkulturelle Philosophie, Deutsche Zeitschrift für Philosophie
  • http://sfb649.wiwi.hu-berlin.de/fedc/data.php
  • gesis
  • SESS

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

19

25.09.2013

slide-20
SLIDE 20

Doctoral theses’ research data

Survey on Research Data Management at HU Berlin

f) Would you be generally willing to deposit particular research data to an archive or to share it?

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

20

25.09.2013

5 10 15 20 25 30 35 40 Most likely. More likely. I have to consider this option more carefully. Less likely. Not likely.

slide-21
SLIDE 21

Doctoral theses’ research data

Survey on Research Data Management at HU Berlin

h) What support or services do you wish to have at HU?

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

21

25.09.2013

10 20 30 40 50 60 70 80 Other I have no need for support or services. Support on compiling a data management plan if requested by a research funder. Advice & guidance on general research data management issues. Advice & guidance on specific issues (e.g. when submitting your research data to a journal along with a manuscript). Advice & guidance on citing and publishing (your own) data. Advice & guidance on technical issues (e.g. metadata, standards, long-term archiving/preservation). Advice & guidance on legal issues (z.B. access restrictions, sensible data, licensing). Secured and backed-up storage for my research data.

slide-22
SLIDE 22

Doctoral theses’ research data

edoc Server HU Berlin

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

22

25.09.2013

http://edoc.hu-berlin.de/docviews/abstract.php?lang=&id=38578 http://edoc.hu-berlin.de/dissertationen/hoefler-carolin-2009-09- 28/PDF/hoefler_anhang.pdf

slide-23
SLIDE 23

Doctoral theses’ research data

edoc Server HU Berlin

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

23

25.09.2013

http://edoc.hu-berlin.de/docviews/abstract.php?lang=&id=38578 http://edoc.hu-berlin.de/dissertationen/hoefler-carolin-2009-09- 28/PDF/hoefler_anhang.pdf

slide-24
SLIDE 24

Doctoral theses’ research data

SULB

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

24

25.09.2013

http://scidok.sulb.uni-saarland.de/volltexte/2009/2416/

slide-25
SLIDE 25

Doctoral theses’ research data

Quocosa (Saxony)

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

25

25.09.2013

slide-26
SLIDE 26

Doctoral theses’ research data

University Bremen & PANGAEA

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

26

25.09.2013

slide-27
SLIDE 27

Doctoral theses’ research data

University Bremen & PANGAEA

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

27

25.09.2013

http://nbn-resolving.de/urn:nbn:de:gbv:46-ep000102570

slide-28
SLIDE 28

Doctoral theses’ research data

University Bremen & PANGAEA

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

28

25.09.2013

slide-29
SLIDE 29

Doctoral theses’ research data

ePIC & PANGAEA

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

29

25.09.2013

slide-30
SLIDE 30

Doctoral theses’ research data

ePIC & PANGAEA

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

30

25.09.2013

slide-31
SLIDE 31

Doctoral theses’ research data

University of Bielefeld

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

31

25.09.2013

http://pub.uni-bielefeld.de/publication/2486950

slide-32
SLIDE 32

Doctoral theses’ research data

University of Bielefeld

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

32

25.09.2013

http://pub.uni-bielefeld.de/publication/2486950

slide-33
SLIDE 33

Doctoral theses’ research data

Examples: Summary

  • ETD full text including graphs, pictures, tables etc.
  • ETD full text file + pdf attachments mostly including graphs, pictures and

tables)

  • ETD full text + attachment (zip-files)
  • ETD full text + linked research data, metadata on ETD
  • ETD full text + linked research data in institutional repository, metadata on ETD

and research data

  • ETD full text + linked research data in (multi-)disciplinary repository (with PID

and metadata/data documentation), metadata on ETD and research data

However, at least in Germany less examples showing ETDs interlinked with research data.

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

33

25.09.2013

slide-34
SLIDE 34

Doctoral theses’ research data

Possible university strategies

  • Institutional research data repository for archiving

– With possibility to publish research data or not – Mandating research data deposit or offering optional deposit service – Zipped data packages – Mostly possible, but not often used

  • (Multi-)Disciplinary repositories recommended by university

– But still for a lot of ‘small sciences’ there is no such infrastructure; very likely an institutional infrastructure is needed – Cross-linking workflow for enhanced publications – Not yet common

  • Challenges

– Disciplinary differences and heterogeneity of research data and research data infrastructures

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

34

25.09.2013

slide-35
SLIDE 35

Doctoral theses’ research data

re3data.org

Maxi Kindling Institut für Bibliotheks- und Informationswissenschaft Humboldt-Universität zu Berlin

35

18.09.2013

re3data.org

Registry of Research Data Repositories

slide-36
SLIDE 36

Doctoral theses’ research data

Research Data Alliance (RDA)

Maxi Kindling Institut für Bibliotheks- und Informationswissenschaft Humboldt-Universität zu Berlin

36

18.09.2013

slide-37
SLIDE 37

Doctoral theses’ research data

Digital Curation Center (DCC)

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

37

25.09.2013

slide-38
SLIDE 38

Doctoral theses’ research data

Some aspects…

  • Research data definition depending on disciplinary characteristics!
  • Workflows considering universities‘ settings

– Mandating Policy – Documented process in Phd guidelines – Participating committees – Self-deposit? Delivery workflow – Open Access, Embargo, Access restrictions

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

38

25.09.2013

slide-39
SLIDE 39

Doctoral theses’ research data

Some aspects…

  • Metadata and data documentation

– Cross-linking between ETD and research data (minimum dc:relations,

  • ptimal: „semantic“ linking and exposure via e.g. OAI-ORE)

– ETDs data set provided with PIDs – ETDs data description parts – DataCite Metadata: Core Element Set (Identifier, identifierType, Creator, creatorName) – Visibility via NDLTD Union Catalogue, ProQuest etc.

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

39

25.09.2013

slide-40
SLIDE 40

Doctoral theses’ research data

Some aspects…

  • Repository infrastructure

– Institutional or disciplinary/multidisciplinary? – Evaluation of repositories as support service from the library? – Long term preservation – Trust and sustainability – Restrictions of file formats, space, … – Legal aspects – Upload/Ingest via data package file (OAIS SIP)?

  • Training

– Awareness – Workshops and training tutorials in Graduate Schools and Phd courses

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

40

25.09.2013

slide-41
SLIDE 41

Doctoral theses’ research data

Thank you very much for your attention! (maxi.kindling@hu-berlin.de, twitter: maxi_ki)

Thanks to my colleagues from Electronic Publishing Working Group at Humboldt-Universität zu Berlin: Niels Fromm, Sabine Henneberger Peter Schirmbacher, Elena Simukovic, Paul Vierkant, Dennis Zielke German repository community: Götz Hatop, Ulrich Herb, Marten Hoogerwerf, Najko Jahn, Jens Klump, Angela Schäfer, Jochen Schirrwagen, Michaela Voigt, Jan Weiland, Karin Zwiesler

Maxi Kindling Berlin School of Library and Information Science Humboldt-Universität zu Berlin

41

25.09.2013

slide-42
SLIDE 42

Doctoral theses’ research data

References

  • Collie, W. Aaron & Witt, Michael (2011) A practice and value proposal for

doctoral dissertation data curation. In: The International Journal of Digital Curation 2 (6), 165-175.

  • Kindling, Maxi, Schirmbacher, Peter & Simukovic, Elena (2013)

Forschungsdatenmanagement an der Humboldt-Universität zu Berlin. In:

  • LIBREAS. Library Ideas 23. = http://www.libreas.eu. Will be published soon.
  • Kindling, Maxi & Schirmbacher, Peter (2013) Die „digitale Forschungswelt“ als

Gegenstand der Forschung. In: Information : Wissenschaft und Praxis, 64(2/3): 127-136. = 10.1515/iwp-2013-0017

  • NISO (2013) Recommended Practices for Online Supplemental Journal Article

Materials- = http://www.niso.org/apps/group_public/download.php/10055/RP-15- 2013_Supplemental_Materials.pdf

  • Pampel, Heinz et al. (2013) Making research data repositories visible: the

re3data.org registry. In: PeerJ Preprints 1:e21v1 = http://dx.doi.org/10.7287/peerj.preprints.21v1

Maxi Kindling Institut für Bibliotheks- und Informationswissenschaft Humboldt-Universität zu Berlin

42

18.09.2013