National Institutes of Health • U.S. Department of Health and Human Services
1
The NIEHS Data Commons Deep Patel, Mike Conway Office of Data - - PowerPoint PPT Presentation
The NIEHS Data Commons Deep Patel, Mike Conway Office of Data Science National Institute of Environmental Health Sciences National Institutes of Health U.S. Department of Health and Human Services 1 The NIEHS Office of Data Science Who
National Institutes of Health • U.S. Department of Health and Human Services
1
National Institutes of Health U.S. Department of Health and Human Services 2
National Institutes of Health U.S. Department of Health and Human Services 3
Develop a standards-based commons
laboratories, including next-gen sequencing data.
analysis
National Institutes of Health U.S. Department of Health and Human Services 4
Manage metadata for discoverability and long-term usability
actionable policies
discovery and re-use
National Institutes of Health U.S. Department of Health and Human Services 5
Support integration and use of data in computation and analysis
National Institutes of Health U.S. Department of Health and Human Services 6
Data/Tools Enrollment Data/Tools Discovery Secure Collaboration, Analysis, and Workflow Execution
Data Commons APIs NIH Data Commons
National Institutes of Health U.S. Department of Health and Human Services 7
NIEHS Concerns:
within project
NIH Concerns:
data
Moore, Reagan W., et al. "White Paper: National Data Infrastructure for Earth System Science."
National Institutes of Health U.S. Department of Health and Human Services 8
– How do we as a community develop frameworks around iRODS capabilities and the philosophy of policy-based data management that ease development? – How do we develop a pattern language and architectural discipline and talk with each other about systems that support FAIR and Big Data? – The Consortium is already developing a pattern catalog, and this is a Good.Thing.
National Institutes of Health U.S. Department of Health and Human Services 9
this may be the ‘next thing’.
translate into frameworks and capabilities in iRODS?
Patterns from https://irods.org/documentation/
National Institutes of Health U.S. Department of Health and Human Services 10
Instruments commonsProdZone
NextGen Sequencing
Tiering File Scanner Landing Zone Data-to- compute Compute
National Institutes of Health U.S. Department of Health and Human Services 11
commonsProdZone NIEHS Central Ontology/CV Service
Vocabs
Index/Search Platforms *Indexing Framework
*Metadata Templates *Virtual Collections
????
National Institutes of Health U.S. Department of Health and Human Services 12
identifiers
not more difficult
National Institutes of Health U.S. Department of Health and Human Services 13
National Institutes of Health U.S. Department of Health and Human Services 14
NIEHS: Beth Bowden, John Bucher, Allen Dearry, Leesa Deterding, Michael Devito, Christopher Duncan, Matthew Edin, Thomas Van'T Erve, John Grovenstein, Guang Hu, Mary Jacobson, Jeffrey Kuhn, Beth Lauderdale, Jian-Liang Li, Alex Merrick, Geoffrey Mueller, Suzanne Osborne, Scott Redman, Andy Shapiro, Troy Simpson, Chris Stone, Cheryl Thompson, Paul Wade, Deborah Wales, Jason Williams, Rick Woychik; Renaissance Computing Institute: (RENCI); iRODS Consortium