A Compendium Platform for Reproducible, R-based Research with a focus on Statistics Education
UseR! 2008 - Patrick Wessa - K.U.Leuven Association, Lessius Dept. of Business Studies
A Compendium Platform for Reproducible, R-based Research with a - - PowerPoint PPT Presentation
A Compendium Platform for Reproducible, R-based Research with a focus on Statistics Education UseR! 2008 - Patrick Wessa - K.U.Leuven Association, Lessius Dept. of Business Studies Introduction Acknowledgments Motivation (based on
UseR! 2008 - Patrick Wessa - K.U.Leuven Association, Lessius Dept. of Business Studies
– Literature – The compendium redefined – Proposed solution
– K.U.Leuven Association, OOF 2007/13 – Donations from private companies
– Less than 8% of students got it right. – More than 90% of students could prove Wold's
– Interaction & collaboration (peer review) – Experimentation – Responsibility (social control)
*Source: Peter J. Green
*Source: Jan de Leeuw
applies to tables, standard errors, and so on. The fact that figures often happen to be easier to reproduce, does not preclude that we should apply the same rule to any form of computer-generated output.
We can make exactly the same statement about our lectures and teaching, certainly in the context of graduate teaching. We must be able to give our students our code and our graphics files, so that they can display and study them on their own computers (and not only on our workstations, or in crowded university labs).
``software environment'' is. Buckheit and Donoho apply the principle in such a way that everybody who wants to check their results is forced to buy MatLab(R). Not Mathematica(R), Macsyma(R), or S-plus(R). Those you may need to buy for
*Source: Jan de Leeuw, Reproducible Research: the Bottom Line, 2001, online
– is required to DIE (Download, Install, Execute) – must have a working knowledge of LaTeX and R – must recreate a working compendium (for each
Text Software Data Text Software Data Software Tar, zip, rar, ... LaTeX R code
Meta Information Software Data Text Ref. Ref. Ref. Ref. Ref. Ref. R Module R Module R Module R Module R Module R Module
Meta Information Software Data Text Ref. R Module 1 R Module 1 Changed/New R Module
R Framework Compendium Platform Compendium Blog Reproduce & Reuse Reference Create/Maintain Query Engine Process Measurements (Virtual) Learning Environment Usage Usage Search Engine
www.wessa.net www.freestatistics.org www.moodle.org
http://www.wessa.net/download/tutorial1.pdf (Time Series Analysis - Introduction) http://www.wessa.net/download/tutorial.pdf (Descriptive Statistics – Central Tendency) Note: both documents are “work in progress” Please, send corrections & suggestions to patrick@wessa.net
A framework for statistical software development, maintenance, and publishing within an open-access business model, 2008, Computational Statistics
Learning Statistics based on the Compendium and Reproducible Computing, Proceedings of the International Conference on Education and Information Technology (ICEIT'08), Berkeley, San Francisco, USA
Reproduce at wessa.net Cite the computation as follows
How Reproducible Research Leads to Non-Rote Learning Within a Socially Constructivist E-Learning Environment, Proceedings of the 7th European Conference on e-Learning (ECEL'08), Cyprus
Submitting Peer Review (feedback) is a good learning activity – not a good grading procedure
Measurement and Control of Statistics Learning Processes based on Constructivist Feedback and Reproducible Computing, Proceedings of the 3rd International Conference on Virtual Learning (ICVL '08), Romania http://www.wessa.net/rwasp_icvl2008.wasp
Antoniadis, editor, Wavelets and Statistics. Springer-Verlag, 1995.
423–438, 2003.
Mesirov, H. Coller, M.L. Loh, J. R. Downing, M. A. Caligiuri, C. D. Bloomfield, and E. S. Lander. Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science, 286:531–537, 1999.
International Journal of Wavelets, Multiresolution and Information Processing, 2004
Epidemiologic Research, American Journal of Epidemiology, 2006
Bioconductor
BioSilico, 2005
Review of the State of the Art), Department of Statistics and Mathematics Wirtschaftsuniversität Wien, Research Report Series, Report 60, November 2007
Reproducible Research, http://www.bepress.com/bioconductor/paper2
reproducible, Computing in Science & Engineering, 2 (6), pp. 61-67, 2000.
Proceedings of the 3rd International Workshop on Distributed Statistical Computing, 2003, Vienna, Austria, ISSN 1609-395
Proceedings of the International Conference of Education, Research and Innovation (ICERI 2008), *submitted*
Education, based on the Compendium Platform, Proceedings of the International Conference of Education, Research and Innovation (ICERI 2008), *submitted*
Learning Environments, Proceedings of the International Conference of Education, Research and Innovation (ICERI 2008), *submitted*
Vancouver, Canada
Reproducible Computing, Applied Statistics 2008, to be submitted to Advances in Methodology and Statistics
business model, Computational Statistics
International Conference on Education and Information Technology (ICEIT'08), Berkeley, San Francisco, USA
Environment, Proceedings of the 7th European Conference on e-Learning (ECEL'08), Cyprus
Reproducible Computing, Proceedings of the 3rd International Conference on Virtual Learning (ICVL '08), Romania
http://www.wessa.net/download/tutorial.pdf
http://www.wessa.net/download/tutorial1.pdf All documents will be available at http://www.freestatistics.org/index.php?action=10 in the near future.