Open Notebook Computer Science Open Software Day 2012 Vadim - - PowerPoint PPT Presentation

open notebook computer science
SMART_READER_LITE
LIVE PREVIEW

Open Notebook Computer Science Open Software Day 2012 Vadim - - PowerPoint PPT Presentation

CWI vadim@grammarware.net Open Notebook Computer Science Open Software Day 2012 Vadim Zaytsev, SWAT, CWI 2012 Open Science CWI Open A piece of content or data is open if anyone is free to use, reuse, and redistribute it subject


slide-1
SLIDE 1

Vadim Zaytsev, SWAT, CWI 2012 vadim@grammarware.net

Open Notebook Computer Science

Open Software Day 2012

CWI

slide-2
SLIDE 2

Open Science

slide-3
SLIDE 3

CWI

Open …

A piece of content or data is open if anyone is free to use, reuse, and redistribute it — subject only, at most, to the requirement to attribute and/or share-alike.

Open Definition: Defining the Open in Open Data, Open Content and Open Services

slide-4
SLIDE 4

CWI

Open source software

  • Source code is available
  • to copy and distribute
  • to inspect and analyse
  • to modify and specialise
  • to repurpose and extend
  • “Open source science”
  • term occasionally used in open access discussions
  • not enough for science!
slide-5
SLIDE 5

CWI

(Computer) Science

  • Accumulating knowledge
  • Experiments and hypotheses
  • Long line of failures
  • Published success stories
  • Formal methods
  • Assumed/expected rigidity
slide-6
SLIDE 6

CWI

Open access

VVZ1530: Is Open Access better off going “Green” or “Gold”? (2012)

  • “Green” route
  • embargo period
  • restricted reuse
  • “Gold” route
  • pre-publication charges up front
  • immediate free unlimited access
  • “Silver” route
  • disclose papers after submission
  • parallel/ to traditional publishing

still not enough! self-archiving

slide-7
SLIDE 7

CWI

Open research

  • Open access + open collaboration
  • Transparency + reproducibility
  • Scientists want credit
  • credit ⇒ priority ⇒ prestige
  • no need to code in anagrams any more
  • enough to be the first on the web
slide-8
SLIDE 8

CWI

Open notebook

  • Lab notebook: public, free, indexed by search engines
  • Expose even raw experimental data
  • to reinterpret and reanalyse
  • to repurpose and reuse
  • Variations
  • some content / all content
  • immediate access / delayed access

Jean-Claude Bradley: Open Nodebook Science (2006)

slide-9
SLIDE 9

CWI

Open notebook in CS/SE

  • Not enough? Too much!
  • Pros
  • nice to use
  • achieves lots of objectives of open science
  • Contras
  • tough to create
  • jeopardises the research itself
slide-10
SLIDE 10

Open Notebook

slide-11
SLIDE 11

CWI

Automation: traces

  • Git/subversion/…

commits

  • Tweets
  • Quora answers
  • Papers!
  • Presentations
  • Blog posts
  • Wiki edits
  • Exposed tools
  • Documentation
  • Shared raw data
  • Auxiliary material
slide-12
SLIDE 12

CWI EWD1300: The notational conventions I adopted, and why

slide-13
SLIDE 13

CWI

slide-14
SLIDE 14

CWI

slide-15
SLIDE 15

CWI

slide-16
SLIDE 16

CWI

slide-17
SLIDE 17

CWI

slide-18
SLIDE 18

CWI

slide-19
SLIDE 19

CWI

slide-20
SLIDE 20

CWI

slide-21
SLIDE 21

CWI

slide-22
SLIDE 22

CWI

slide-23
SLIDE 23

CWI

slide-24
SLIDE 24

CWI

Open notebook entry

  • Unique id
  • VVZxxxx, e.g. VVZ1362
  • Cf. EWDxxx
  • Cf. Edsger Wybe Dijkstra Archive
slide-25
SLIDE 25

CWI

Open notebook entry

  • Unique id
  • VVZxxxx, e.g. VVZ1362
  • Cf. EWDxxx
  • Linked to an action
  • commit/tweet/answer/wikiedit/DOI/…
  • Tagged as related
  • to a paper/effort/project/topic
  • Cf. Edsger Wybe Dijkstra Archive
slide-26
SLIDE 26

CWI

slide-27
SLIDE 27

Open Questions

slide-28
SLIDE 28

CWI

Open notebook usage

  • Streamlined self-archiving

many scientists already achieved this

slide-29
SLIDE 29

CWI

Open notebook usage

  • Streamlined self-archiving
  • Advanced self-archiving

blogs, quora, wikis, tweets

  • ften needed; rarely implemented
slide-30
SLIDE 30

CWI

Open notebook usage

  • Streamlined self-archiving
  • Advanced self-archiving
  • Documentation of the research process

how many tries did it take? how much time?

slide-31
SLIDE 31

CWI

Open notebook usage

  • Streamlined self-archiving
  • Advanced self-archiving
  • Documentation of the research process
  • Academic traceability

what sources were used?

slide-32
SLIDE 32

CWI

Open notebook usage

  • Streamlined self-archiving
  • Advanced self-archiving
  • Documentation of the research process
  • Academic traceability
  • Mining software repositories open notebooks

how others do it?

slide-33
SLIDE 33

CWI

Open notebook usage

  • Streamlined self-archiving
  • Advanced self-archiving
  • Documentation of the research process
  • Academic traceability
  • Mining software repositories open notebooks
  • Linked data
  • pen

structured URI-driven …

slide-34
SLIDE 34

CWI

Open notebook usage

  • Streamlined self-archiving
  • Advanced self-archiving
  • Documentation of the research process
  • Academic traceability
  • Mining software repositories open notebooks
  • Linked data
slide-35
SLIDE 35

CWI

Partiality

  • Some data is not to be shared
  • Prepare for publishing immediately
  • Release when safe
  • Where are the borders?
  • Is it “honest”?
slide-36
SLIDE 36

CWI

Problems in theory

  • Data theft & content theft
  • partiality
  • Constitutes prior publication
  • don’t use ONS for publishing (cf. Wikipedia)
  • Information flood
  • no solution
slide-37
SLIDE 37

CWI

Problems in practice

  • Incomplete automation
  • smarter tagging?
  • Useful querying languages/tools/technologies
  • expose how papers are related
  • connect to other people’s papers
  • Research ongoing, please join
  • grammarware.net
slide-38
SLIDE 38

CWI

slide-39
SLIDE 39

CWI

slide-40
SLIDE 40

CWI

slide-41
SLIDE 41

CWI

To summarise

  • “Open” is PD, CC-BY, CC-BY-SA
  • Open source principles for

science!

  • Open access for dissemination
  • Open research for collaboration
  • Open notebook for traceability
  • Openness for reproducibility!
  • ID with timestamp, action, tags
  • Many open questions

http://commons.wikimedia.org/wiki/File:Torii_kiyoshige_bando_hikosaburo_ii.jpg

slide-42
SLIDE 42

CWI

Credits

  • Designosaur open font (BY)
  • by Sergiy S. Tkachenko
  • Open Notebook Science logos (BY-SA)
  • by Andrew Lang (white background, green/red text)
  • by Shirley Wu (gray background, black frame)
  • by

Vadim Zaytsev (vector version)

  • Open Access logo PLoS transparent.svg (PD)
  • Open Source Initiative keyhole.svg (PD)
  • Hevelius and wife.jpg (PD)
slide-43
SLIDE 43

Questions?

vadim@grammarware.net