Stata export for metadata documentation Munich, 26.05.2019 Anne - - PowerPoint PPT Presentation

stata export for metadata documentation
SMART_READER_LITE
LIVE PREVIEW

Stata export for metadata documentation Munich, 26.05.2019 Anne - - PowerPoint PPT Presentation

Stata export for metadata documentation Munich, 26.05.2019 Anne Balz, Klaus Pforr, Florian Thirolf Motivation German Microdata Lab (GML) offers Metadata for various official microdata online Goal: extract Metadata from these Datasets


slide-1
SLIDE 1

Stata export for metadata documentation

Munich, 26.05.2019

Anne Balz, Klaus Pforr, Florian Thirolf

slide-2
SLIDE 2

Motivation

German Microdata Lab (GML) offers Metadata for various official microdata online

Goal: extract Metadata from these Datasets automatically and import them into our database German Microcensus European Labour Force Survey EU-SILC (European Union Statistics on Income and Living Conditions)

2

slide-3
SLIDE 3

Microdata-Informationsystem MISSY

3

Online plattform („MISSY-web“) Documentation of official microdata (European & national) Documentation on different levels:

study question variable

slide-4
SLIDE 4

Microdata-Informationsystem MISSY

4

slide-5
SLIDE 5

Microdata-Informationsystem MISSY

5

slide-6
SLIDE 6

ado dta2md

core functionality

slide-7
SLIDE 7

core functionality

7

*.dta

  • utput.*
slide-8
SLIDE 8

core functionality

8

*.dta dta2meta.ado meta.dta

  • utput.*

meta2*.ado

slide-9
SLIDE 9

ado dta2md

ado dta2md

slide-10
SLIDE 10

ado dta2md

10

*.dta dta2meta.ado meta.dta meta2*.ado

  • utput.*
slide-11
SLIDE 11

the meta-file

All necessary (meta-)information in a table format: Variable level

Varname, -label Summary statistics (min, max, mean, std)

Value level

Value, - label Frequencies and percentages Overall For groups (e.g.: countries)

11

slide-12
SLIDE 12

ado dta2md

12

Value Level User Input (Variable): Group-Variable & Computed Technical: First Value within Variable Variable Level

slide-13
SLIDE 13

the meta-file

13

slide-14
SLIDE 14

the meta-file

14

slide-15
SLIDE 15

ado dta2md

15

slide-16
SLIDE 16

ado dta2md

dta2md input(filename) output(filename) // freqvarlist(varlist) // [group(varname) // missingdef(string) smissingdef(string) // replace ] dta2md input($path/micro_file.dta) output($path/meta_file.dta)// freqvarlist(var1 var2 var3) // group(country) // missing("X<0") // smissingdef(`"X="invalid answer"| X="did not understand""') // replace

16

slide-17
SLIDE 17

ado dta2md

17

Loop over all vars If group specified: Loop over all groups (within levels of vars) If computed: Loop over all levels (within all vars) If group specified: Loop over all groups

slide-18
SLIDE 18

ado dta2md

ado meta2DDI

slide-19
SLIDE 19

ado meta2DDI

19

*.dta dta2Meta.ado meta.dta meta2DDI.ado DDI2.5.xml

slide-20
SLIDE 20

ado meta2DDI

Uses the ‚file‘ command ‚forvalues‘ to run through all categories variables of the meta- file are used to form hierarchical output

20

example:

‚first‘ (0/1) tags first category of a variable used to generate output on variable level

slide-21
SLIDE 21

ado meta2DDI

21

slide-22
SLIDE 22

ado meta2DDI

22

slide-23
SLIDE 23

ado dta2md

usecase MISSY

slide-24
SLIDE 24

Usecase MISSY

24

*.dta dta2Meta.ado meta.dta meta2sql.ado getUUIDs generateUUIDs mapRelations Database

  • utput.sql
slide-25
SLIDE 25

meta2sql.ado

‚file‘ command is used different frame ‚forvalues‘ for each database-table

25