Make it easy - integration of data description in the research - - PowerPoint PPT Presentation

make it easy integration of data description in the
SMART_READER_LITE
LIVE PREVIEW

Make it easy - integration of data description in the research - - PowerPoint PPT Presentation

Sibylle Hermann, Dorothea Iglezakis, Anett Seeland Make it easy - integration of data description in the research process 11. June 2019 University of Stuttgart Einfhrung Anforderungen Umsetzung Zusammenfassung Beispiel:


slide-1
SLIDE 1

Make it easy - integration of data description in the research process

  • 11. June 2019

University of Stuttgart

Sibylle Hermann, Dorothea Iglezakis, Anett Seeland

slide-2
SLIDE 2

11 Universität Stuttgart 29.03.2019 Art Gitter Randbedingung Integration Filterung Dämpfu x y z char. fest extrap. period. Verfahren ∆t Ordnung Breite St grob mittel fein grob mittel fein grob mittel fein DNS

x x x

  • RK4

0,001 10 12 x x x

  • RK4

0,001 10 12 x x x

  • RK4

0,001 10 12 x x x

  • RK4

0,001 10 12 x x x

  • RK4

0,001 10 12 x x x

  • RK4

0,001 10 12 x x x

  • RK4

0,001 10 12 x x x

  • RK4

0,001 10 12 x x x

  • RK4

0,001 10 12 x x x

  • RK4

0,001 10 12 x x x

  • RK4

0,001 10 12 … … … x x x

  • RK4

0,001 10 12 … … … x x x

  • RK4

0,001 10 12 … … … x x x

  • RK4

0,001 10 12 … … … x x x

  • RK4

0,001 10 12 … … … x x x

  • RK4

0,001 10 24

Einführung – Anforderungen – Umsetzung – Zusammenfassung Beispiel: Direkte numerische Simulation einer turbulenten Grenzschichtströmung Vorbereitung Vorbereitung

  • Projekt
  • Bestimmung
  • Gitter (N=33)
  • Randbedingung (N=33∙3)
  • Numer. Parameter (N=33∙3∙23)

→ O(103) Simulationen Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 1 / 13

slide-3
SLIDE 3

What the User doesn’t like to do

  • Publish data because it is not yet common in engineering

science

  • Spend time with documentation

Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 2 / 13

slide-4
SLIDE 4

What the User Needs

  • Manage a lot of data
  • Find saved data easily
  • Browse data sets
  • Change data sets dynamically
  • Record metadata easily
  • Link results with simulaions
  • Link data sets from different simulations
  • Give controlled access

Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 3 / 13

slide-5
SLIDE 5

Metadata

What our users want to search for (apart from Author, Year)

  • Variables – measured and controlled
  • Parameters of the used method
  • Parameters of the observed system

What our users want to document from their research process

  • Methods and workflows
  • Software and computing environments
  • Instruments
  • Parameters and assumptions

Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 4 / 13

slide-6
SLIDE 6

EngMeta

A Metadata Schema for Engineering Science

Schembera & Iglezakis “The Genesis of EngMeta-A Metadata Model for Research Data in Computational Engineering”, In: Research Conference on Metadata and Semantics Research, p127–132, 2018, Springer. Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 5 / 13

slide-7
SLIDE 7

Local Data Management – Prerequisite for Open Data

Idea Adding metadata to the data as early in the process and as easy as possible Approach Using a data repository primarily as metadata store and tools around it for smooth interaction

Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 6 / 13

slide-8
SLIDE 8

DaRUS

Data Repository of the University of Stuttgart

Based on Dataverse

  • Open source research data repository software
  • Repository hosts multiple virtual archives called Dataverses

Image: http://guides.dataverse.org/en/latest/user/dataverse-management.html, Access: 6/7/2019 Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 7 / 13

slide-9
SLIDE 9

Challenge I: Automation

Ingest of (Meta)data

Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 8 / 13

slide-10
SLIDE 10

Challenge II: Handling of Large Files

Dataverse not designed for large files

  • Users experienced frozen UI and timeouts

→ Use REST API for files > 2 GB

  • Trade-off between timeout configuration and available threads

→ Introduce 2nd thread pool in Glassfish → Uploads around 100 GB possible

Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 9 / 13

slide-11
SLIDE 11

Challenge II: Handling of Large Files

  • Currently under development
  • In planning
  • Connection of object storage to tape library
  • Extend Dataverse to support different storage classes

(Download vs Provide-Buttons)

Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland10 / 13

slide-12
SLIDE 12

Outlook: Different Data Overview Needed

Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland11 / 13

slide-13
SLIDE 13

Summary

  • Starting early in the process means less effort at the end
  • To make it easy is still a challenge
  • Automation is a key requirement

Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland12 / 13

slide-14
SLIDE 14

Thank you!

FoKUS E-Mail: URL:

fokus@izus.uni-stuttgart.de https://www.izus.uni-stuttgart.de/en/fokus/

DaRUS URL.: https://www.izus.uni-stuttgart.de/en/fokus/darus/

Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland13 / 13