Email: drdonald@indiana.edu @hoosierdevan WHO AM I? Devan Ray - - PowerPoint PPT Presentation

email drdonald indiana edu hoosierdevan
SMART_READER_LITE
LIVE PREVIEW

Email: drdonald@indiana.edu @hoosierdevan WHO AM I? Devan Ray - - PowerPoint PPT Presentation

Email: drdonald@indiana.edu @hoosierdevan WHO AM I? Devan Ray Donaldson Assistant Professor of Information Science Department of Information and Library Science School of Informatics and Computing Indiana University, Bloomington Ph.D. in


slide-1
SLIDE 1

Email: drdonald@indiana.edu @hoosierdevan

slide-2
SLIDE 2

WHO AM I?

Devan Ray Donaldson Assistant Professor of Information Science Department of Information and Library Science School of Informatics and Computing Indiana University, Bloomington Ph.D. in Information Science, University of Michigan RDA US Data Share Fellow

2

slide-3
SLIDE 3

WHY AM I HERE?

Because of Thomas Proffen Because Frank asked me to come speak :0)

3

slide-4
SLIDE 4

MOTIVATION FOR STUDY To understand perspectives on data sharing in a field that has traditionally focused more on sustaining use of data by those who created them as opposed to enabling reuse of data by others.

4

slide-5
SLIDE 5

STUDY DETAILS Focus groups with: Data consumers (n=3) Data managers (n=5) Data producers (n=5)

5

slide-6
SLIDE 6

Da Data p producers generate ra raw data data (unprocessed numbers and

descriptions) from which they can construct reduced d

data (a

set with extraneous data removed and more complete descriptions).

Da Data ma mana nagers then produce reduced d data that harmonizes

the unprocessed data with theoretical models which can be used to create mo

modele led d data, or

data that demonstrates how results do/do not conform to theoretical models.

Da Data c cons nsume mers utilize mo modele led d data to create

research and scholarship demonstrating how materials function on an atomic level.

Raw Da Data Reduced Da Data Modele led Da Data

Da Data Pr Produce cer Da Data Mana nager Da Data Cons nsume mer

Us Users Typ ype o

  • f Da

Data Int Interaction

FINDINGS: WORKFLOW

slide-7
SLIDE 7

FINDINGS: DATA CONSUMERS

Data consumers:

  • 1. Identified reasons for reusing data
  • 2. Discussed information they needed to know

about data

  • 3. Articulated the importance of journal articles
  • 4. Described barriers to reuse
  • 5. Expressed a desire for discoverability

7

slide-8
SLIDE 8

PARTICIPANT CHARACTERISTICS

  • Expressed interest in data reuse
  • 2 research scientists; 1 professor
  • Interests: theory of magnetism, condensed/

soft matter physics

  • Multiple years of experience with neutron

data and Oak Ridge facilities

8

slide-9
SLIDE 9

PARTICIPANT CHARACTERISTICS

  • Expressed interest in data reuse
  • 2 research scientists; 1 professor
  • Interests: theory of magnetism, condensed/

soft matter physics

  • Multiple years of experience with neutron

data and Oak Ridge facilities

9

slide-10
SLIDE 10

REASONS FOR REUSING

  • To compare/verify a result against their own

measurements

  • To test a new theory using existing data

10

slide-11
SLIDE 11

WHAT REUSERS NEED TO KNOW

1) How the data were produced 2) How the sample was prepared 3) What the units of measurement are 4) How the temperature was determined

11

slide-12
SLIDE 12

IMPORTANCE OF PUBLICATIONS

1) Journal articles provide context for data 2) Participants articulated interest in reproducing charts and graphs

12

slide-13
SLIDE 13

BARRIERS TO REUSE

Technical barriers: e.g., Lack of expertise in software

13

slide-14
SLIDE 14

DISCOVERABILITY

Consumers of neutron data want to know:

  • 1. What other measurements have been

created for particular problems

  • 2. Particular characteristics across data sets

(e.g., temperature readings)

14

slide-15
SLIDE 15

RECOMMENDATIONS

Policy recommendations Technical recommendations

15

slide-16
SLIDE 16

POLICY RECOMMENDATIONS

Policy recommendations Provide Principal Investigators with the

  • ption to make their data accessible and
  • penly available if they choose.

16

slide-17
SLIDE 17

SYSTEM RECOMMENDATIONS

Policy recommendations 1) Include metadata about how the data were produced, how the sample was prepared, what the units of measurement are, and how the temperature was determined for every data set.

17

slide-18
SLIDE 18

SYSTEM RECOMMENDATIONS

Policy recommendations 2) Link data to any publications based on or

  • therwise related to those data.

18

slide-19
SLIDE 19

SYSTEM RECOMMENDATIONS

Policy recommendations 3) Make data more discoverable by allowing characteristics of data to be searchable across data sets.

19

slide-20
SLIDE 20

FUTURE RESEARCH

Policy recommendations 1) Conduct similar studies with other neutron scientists to confirm results 2) Conduct studies of reuse “in real time”

20

slide-21
SLIDE 21

ACKNOWLEDGMENTS

Policy recommendations Thomas Proffen, Oak Ridge National Laboratory Shawn Martin, Doctoral Student, Indiana University

21

slide-22
SLIDE 22

Email: drdonald@indiana.edu @hoosierdevan