Djorgovski MSR LATAM Summit, May 2010
Virtualization of Science and Scholarship
- S. George Djorgovski
Virtualization of Science and Scholarship S. George Djorgovski - - PowerPoint PPT Presentation
Virtualization of Science and Scholarship S. George Djorgovski Caltech MSR LATAM Summit, Guaruja, Brasil, May 2010 Djorgovski MSR LATAM Summit, May 2010 Definition: By Virtualization , I mean a migration of the scholarly work, data, tools,
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Theory (analytical + numerical) Experiment + Data Mining
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Visible + X-ray Crab Star forming complex Radio + IR
Data + Theory = Understanding
1970 1975 1980 1985 1990 1995 2000 0.1 1 10 100 1000 CCDs Glass
doubling t ≈ 1.5 yrs
TB’s to PB’s of data, 108 - 109 sources, 102 - 103 param./source
Djorgovski MSR LATAM Summit, May 2010
Infrastructure
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
– A broadening of the talent pool in astronomy, leading to a substantial democratization of the field
– Riding the exponential growth of the IT is far more cost effective than building expensive hardware facilities, e.g., big telescopes – Especially useful for countries without major observatories
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Key Technical Challenges Key Methodological Challenges
+feedback
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
N = data vectors, ~ 108 - 109, D = dimension, ~ 102 - 103
– Clustering ~ N log N N2, ~ D2 – Correlations ~ N log N N2, ~ Dk (k ≥ 1) – Likelihood, Bayesian ~ Nm (m ≥ 3), ~ Dk (k ≥ 1)
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
– Data, metadata, virtual data, simulations, algorithms, blogs, wikis, multimedia… – From static to dynamic: evolving and growing data sets
– Massive data sets can be only published as electronic archives, and should be curated by domain experts – Effective peer review and quality control – Persistency and integrity of data and pointers – Interoperability and metadata standards
Djorgovski MSR LATAM Summit, May 2010
Theory and Simulations
Djorgovski MSR LATAM Summit, May 2010
An Evolutionary Approach, 1972
Cyberspace is now effectively World 3, plus the ways of interacting with it
Dawkins memes
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Justin Rattner, Intel CTO, in a keynote talk at the SC’09: “… There is nothing more important to the long-term health of the HPC industry than the 3D Web…” “… the 3D Web will be the technology driver that revitalizes the HPC business model …” Video games and Virtual Worlds … and the gamer generation growing up Holywood going 3-D
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Nobel laureate John Mather
Djorgovski MSR LATAM Summit, May 2010
Astronomy and data parameter spaces Chemistry and biology Mathematics and networks
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010
Djorgovski MSR LATAM Summit, May 2010