Frontiers of Metrology in Biology
26th CGPM 2018 Marc Salit Joint Initiative for Metrology in Biology NIST, Stanford University and SLAC
Frontiers of Metrology in Biology 26 th CGPM 2018 Marc Salit Joint - - PowerPoint PPT Presentation
Frontiers of Metrology in Biology 26 th CGPM 2018 Marc Salit Joint Initiative for Metrology in Biology NIST, Stanford University and SLAC Frontiers of Metrology in Biology 26 th CGPM 2018 Marc Salit Joint Initiative for Metrology in Biology
26th CGPM 2018 Marc Salit Joint Initiative for Metrology in Biology NIST, Stanford University and SLAC
26th CGPM 2018 Marc Salit Joint Initiative for Metrology in Biology NIST, Stanford University and SLAC
There are 1013 cells in a human (human cells). There are 1014 microbial cells in a human. There are ~1010 carbon atoms in a human cell.
This is a 192 x 128 grid at 18 droplets per cm. It contains 7212 droplet transfers of 2.5 nl/droplet. Each droplet contained about 1000 cells. Cells were grown for ~24h.
ACCELERATION VELOCITY AREA VOLUME ABSORBED DOSE DOSE EQUIVALENT PRESSURE, STRESS FORCE ENERGY, WORK, QUANTITY OF HEAT POTENTIAL, ELECTROMOTIVE FORCE CAPACITANCE ELECTRIC CHARGE ACTIVITY
(OF A RADIONUCLIDE)
CONDUCTANCE RESISTANCE INDUCTANCE MAGNETIC FLUX DENSITY MAGNETIC FLUX CELSIUS TEMPERATURE LUMINOUS FLUX ILLUMINANCE THERMODYNAMIC TEMPERATURE
K
LUMINOUS INTENSITY
cd
ELECTRIC CURRENT
A
AMOUNT OF SUBSTANCE
mol
TIME
s
LENGTH
m
MASS
kg
lm lx T Wb °C H
W
S V C
CATALYTIC ACTIVITY
kat F W Bq
FREQUENCY
Hz J Gy Sv Pa N
m3 m/s m2 m/s2
rad
PLANE ANGLE
sr
SOLID ANGLE POWER, HEAT FLOW RATE t/°C = T/K – 273.15 coulomb farad siemens
degree Celsius lumen (C/V) (A·s) katal (mol/s) (1/W) (V/A) (K) (cd·sr) becquerel (1/s) (1/s) hertz radian steradian (m2/m2 = 1) (m/m = 1) gray sievert pascal newton joule watt volt henry tesla lux weber (J/s) (Wb/A) (V·s) (W/A) (Wb/m2) (N·m) (N/m2) (J/kg) (J/kg) (kg·m/s2) (lm/m2)
kelvin candela ampere mole second meter kilogram
SI DERIVED UNITS WITH SPECIAL NAMES AND SYMBOLS SI BASE UNITS
Derived units without special names Solid lines indicate multiplication, broken lines indicate division
https://physics.nist.gov/cuu/Units/SIdiagram.html
ACCELERATION VELOCITY AREA VOLUME ABSORBED DOSE DOSE EQUIVALENT PRESSURE, STRESS FORCE ENERGY, WORK, QUANTITY OF HEAT POTENTIAL, ELECTROMOTIVE FORCE CAPACITANCE ELECTRIC CHARGE ACTIVITY
(OF A RADIONUCLIDE)
CONDUCTANCE RESISTANCE INDUCTANCE MAGNETIC FLUX DENSITY MAGNETIC FLUX CELSIUS TEMPERATURE LUMINOUS FLUX ILLUMINANCE THERMODYNAMIC TEMPERATURE
K
LUMINOUS INTENSITY
cd
ELECTRIC CURRENT
A
AMOUNT OF SUBSTANCE
mol
TIME
s
LENGTH
m
MASS
kg
lm lx T Wb °C H
W
S V C
CATALYTIC ACTIVITY
kat F W Bq
FREQUENCY
Hz J Gy Sv Pa N
m3 m/s m2 m/s2
rad
PLANE ANGLE
sr
SOLID ANGLE POWER, HEAT FLOW RATE t/°C = T/K – 273.15 coulomb farad siemens
degree Celsius lumen (C/V) (A·s) katal (mol/s) (1/W) (V/A) (K) (cd·sr) becquerel (1/s) (1/s) hertz radian steradian (m2/m2 = 1) (m/m = 1) gray sievert pascal newton joule watt volt henry tesla lux weber (J/s) (Wb/A) (V·s) (W/A) (Wb/m2) (N·m) (N/m2) (J/kg) (J/kg) (kg·m/s2) (lm/m2)
kelvin candela ampere mole second meter kilogram
SI DERIVED UNITS WITH SPECIAL NAMES AND SYMBOLS SI BASE UNITS
Derived units without special names Solid lines indicate multiplication, broken lines indicate division
JIMB is focused on Operational Mastery of living matter at the cellular level.
“Measure, Model, Make”
Synthetic Biology
the cell… Not focusing on metrology of biomaterials properties, medical diagnostics, biotherapeutics, regenerative medicine, diagnostic imaging…
Characterizing living matter requires measuring massively multiplexed measurands of heterogeneous systems with complex dynamics and interactions.
chemical systems governed by biophysics.
Genome
Transcriptome
Proteome
Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome
Jay Shendure,1*. Gregory J. Porreca,1*. Nikos B. Reppas,1 Xiaoxia Lin,1 John P. McCutcheon,2,3 Abraham M. Rosenbaum,1 Michael D. Wang,1 Kun Zhang,1 Robi D. Mitra,2 George M. Church1
We describe a DNA sequencing technology in which a commonly available, inexpensive epifluorescence microscope is converted to rapid nonelectrophoretic DNA sequencing automation. We apply this technology to resequence an evolved strain of Escherichia coli at less than one error per million consensus bases. A cell-free, mate-paired library provided single DNA molecules that were amplified in parallel to 1-micrometer beads by emulsion polymerase chain reaction. Millions of beads were immobilized in a polyacrylamide gel and subjected to automated cycles of sequencing by ligation and four-color imaging. Cost per base was roughly one-ninth as much as that of conventional sequencing. Our protocols were implemented with off-the-shelf instrumentation and reagents. The ubiquity and longevity of Sanger sequenc- ing (1) are remarkable. Analogous to semicon- ductors, measures of cost and production have followed exponential trends (2). High-throughput centers generate data at a speed of 20 raw bases per instrument-second and a cost of $1.00 per raw kilobase. Nonetheless, optimizations of elec- trophoretic methods may be reaching their lim-
genome requires a paradigm shift in our under- lying approach to the DNA polymer (3). Cyclic array methods, an attractive class
that they leverage a single reagent volume to enzymatically manipulate thousands to mil-
Shendure, J., Porreca, G. J., Reppas, N. B., Lin, X., McCutcheon, J. P., Rosenbaum, A. M., … Church, G. M. (2005). Accurate multiplex polony sequencing of an evolved bacterial genome. Science, 309(5741), 1728–1732. https://doi.org/10.1126/science.1117389
Ge Genome Re Regulation Transcriptom
Reg. Pr Proteome Re Reg. Me Metabolome Or Organelle Ce Cell Ti Tissue Or Organism Sy System …
Lots of methods, reasonably characterized We’re pretty good at sequencing, but sampling presents challenges. Pretty good at RNA-Seq, reasonably characterized, sampling challenges. A couple of methods, still emerging Variety of methods, technically challenging Variety of methods, still emerging Enzyme activity measures, not ‘omics? It’s more granular than this – there’s work to do to roadmap our measurement capabilities.
/Specifications
Materials
Data
from https://www.encodeproject.org based on an image from Darryl Leja (NHGRI), Ian Dunham (EBI), Michael Pazin (NHGRI)
were created in consortium partnerships
adopted
Genome-Scale Measurements Transcriptome Spike-ins ERCC Controls Human Genomes GIAB Genomes
SRM 2374 Plasmid DNA Library in vitro transcription RNA transcripts Pooling Pools with known abundance ratios
…
23 Controls per Subpool D e s i g n a b u n d a n c e s p a n s 220 r a n g e w i t h i n e a c h S u b p
Munro, S. A. et al. Nat.
doi: 10.1038/ncomms6125 (2014).
Evaluate Dynamic Range Performance Evaluate Ratio Performance – “MA Plot” Evaluate Diagnostic Performance – “ROC Curve” Establish Lower Limit of Detection for Differential Expression Detection – “LODR”
Sample gDNA isolation Library Prep Sequencing Alignment/Mapping Variant Calling Confidence Estimates Downstream Analysis
demonstrate, refine, optimize technologies
with stakeholders @ GA4GH
developers, regulators, clinical research teams
genome sequencing measurement process
Users analyze GIAB Samples Benchmark
data Critical feedback to GIAB Integrate new methods New benchmark data
Method development,
demonstration Part of assay validation GIAB/NIST expands to more difficult regions
Reference data
human genomes
variants”
All data available immediately without embargo
transparency and metrology
N50 and Coverage
5 longest mapped reads
Accurate
genotypes, haplotypes, and regions
majority of differences (FPs/FNs) are errors in the method
Representative examples
different genome contexts
Comprehensive characterization
types/genome contexts
benchmarking
Accurate
genotypes, haplotypes, and regions
majority of differences (FPs/FNs) are errors in the method
Representative examples
different genome contexts
Comprehensive characterization
types/genome contexts
benchmarking
measuring bulk populations of heterogeneous cells
Ananda L. Roy et al. Sci Adv 2018;4:eaat8573
Figure 5. Retrospective samples from GTEx can be successfully profiled using single-nucleus RNA-
which samples in (B) are obtained, are circled. (B) Single nucleus RNA-Seq (by DroNc-Seq) of hippocampus and frontal cortex samples from the GTEx collection. tSNE plots are colored by k-NN graph clustering and labeled post hoc by cell type. (C) Each cluster is supported by multiple individuals (from relevant tissue).
From Human Cell Atlas Whitepaper, accessed 11/14/2018
PGP Individual Genome IPS Cell Line
Reference Material Set
Epigenome Transcriptome Translatome Proteome Differentiation Cell Type I Cell Type II Cell Type III Cell Type N Reference Material Set Predictive Systems Models, Iteratively Refined with Data
Reference Material Set
Epigenome Transcriptome Translatome Proteome Reference Material Set Reference Material Set
Develop reference sets from a single individual that represent a “body map” of functional ‘omes
development and validation
technology development
Essential genes of unknown function Unknown functions that are essential
“what is naturally alive I do not understand” — D. Endy “what I cannot create I do not understand” — R. Feynman
(enabling operational mastery of living matter)
Essential gene sets Abstracted functional modules APIs to (2) and (3) Cell-free & PURE Expression architectures, from gene to operon to genome Validated DNA via -omics Molecular ensembles via Cryo-EM Fluid physics ensemble dynamics
jimb.stanford.edu
metrics and comparability for results from complex algorithms
measurement process
degree of complexity is significant
“nominal properties”
compatability/comparability?
“Completeness” of Knowledgebases…
measurement science and standards for ‘omics and synthetic biology
private sector
scope measurement science, measurement tool, and standards development
NIST Genome-Scale Measurements Group, MD & CA, and JIMB
Justin Zook Jenny McDaniel Lindsay Harris David Catoe Sarah Munro Scott Pine Noah Spies Sasha Levy Darach Miller Arend Sidow Drew Endy and The Genome in a Bottle Consortium and The External RNA Controls Consortium