Using Research Data Centres (RDC) to access Big Data David Schiller - - PowerPoint PPT Presentation

using research data centres rdc
SMART_READER_LITE
LIVE PREVIEW

Using Research Data Centres (RDC) to access Big Data David Schiller - - PowerPoint PPT Presentation

New Techniques and Technologies for Statistics NTTS 2015, Brussels (Belgium), March 11, 2015 Using Research Data Centres (RDC) to access Big Data David Schiller (IAB) Anja Burghardt (IAB) The Research Data Centre (FDZ) of the German Federal


slide-1
SLIDE 1 New Techniques and Technologies for Statistics NTTS 2015, Brussels (Belgium), March 11, 2015

Using Research Data Centres (RDC) to access Big Data

David Schiller (IAB) Anja Burghardt (IAB)
slide-2
SLIDE 2 The Research Data Centre (FDZ) of the German Federal Employment Agency (BA) at the Institute for Employment Research (IAB)
slide-3
SLIDE 3 The Research Data Centre (FDZ) of the German Federal Employment Agency (BA) at the Institute for Employment Research (IAB)
slide-4
SLIDE 4 The Research Data Centre (FDZ) of the German Federal Employment Agency (BA) at the Institute for Employment Research (IAB)

Big

Big Data Data

slide-5
SLIDE 5

A closer loot at Big Data

 Too big to be moved  Designed (survey) Data vs. Organic (Big)

Data (Groves)

 Meaningful Big Data by linkage to survey

data (Kreuter/Peng)

 Restricted (Big) Data vs. Open (Big) Data  Ownership: Private and Public Sector

Using RDCs to access Big Data, NTTS 2015 5
slide-6
SLIDE 6

Infrastructure to work with Big Data

 User located in different access locations

slide-7
SLIDE 7

Infrastructure to work with Big Data

 A graphical user interface for interaction between

user and infrastructure

slide-8
SLIDE 8

Infrastructure to work with Big Data

 A Computation Centre hosting the relevant

applications (may also be organized as distributed solution)

slide-9
SLIDE 9

Infrastructure to work with Big Data

 Data sources with different storage locations,

storage formats, access restrictions etc.

slide-10
SLIDE 10

Infrastructure to work with Big Data

 All can also be seen as a (secure) Virtual

Research Environment to work with Big Data

slide-11
SLIDE 11

Conclusion and Outlook

 Use RDC structure, experience, and knowledge  Blend Big Data and survey-based/official data; in order to: ‐ Use Big Data as additional resource to enrich fixed (designed) social science matrix files or to ‐ Dive into Big Data by using new research technics that can deal with the flow of data (more research and more education for students needed)  Need for secure Virtual Research Environment (secure VRE) ‐ protect intellectual property as well as confidential data ‐ enable replication studies and communication between researcher ‐ Offer tools and services needed to work with Big Data Using RDCs to access Big Data, NTTS 2015 11