digital medicine i introduction to programming
play

Digital Medicine I: Introduction to Programming Pandas Autumn 2019 - PowerPoint PPT Presentation

Hans-Joachim Bckenhauer and Dennis Komm Digital Medicine I: Introduction to Programming Pandas Autumn 2019 December 19, 2019 So far. . . numpy and matplotlib The Modules numpy and matplotlib numpy Calculations with vectors and matrices


  1. Hans-Joachim Böckenhauer and Dennis Komm Digital Medicine I: Introduction to Programming Pandas Autumn 2019 – December 19, 2019

  2. So far. . . numpy and matplotlib

  3. The Modules numpy and matplotlib numpy Calculations with vectors and matrices Numerical methods Documentation: https://numpy.org/doc/ Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 1 / 12

  4. The Modules numpy and matplotlib numpy Calculations with vectors and matrices Numerical methods Documentation: https://numpy.org/doc/ matplotlib Data visualization (Plots) Documentation: https://matplotlib.org/contents.html Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 1 / 12

  5. Now. . . pandas

  6. The Module pandas pandas Processing of large sets of data Allows a functionality similar to Excel Documentation: https://pandas.pydata.org/pandas-docs/stable/ Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 2 / 12

  7. The Module pandas pandas Processing of large sets of data Allows a functionality similar to Excel Documentation: https://pandas.pydata.org/pandas-docs/stable/ Project 3: Reading in and processing CSV file “manually” pandas contains data structures and functions for this Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 2 / 12

  8. The Module pandas Import pandas analogously to numpy and matplotlib import pandas as pd Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 3 / 12

  9. The Module pandas Import pandas analogously to numpy and matplotlib import pandas as pd Read in CSV file and store it in a special data type pandas dataframe (instead of Python list or numpy array) data = pd.read_csv("daten.csv") Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 3 / 12

  10. The Module pandas Import pandas analogously to numpy and matplotlib import pandas as pd Read in CSV file and store it in a special data type pandas dataframe (instead of Python list or numpy array) data = pd.read_csv("daten.csv") Files in Excel format can be read in analogously data = pd.read_excel("daten.xlsx") Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 3 / 12

  11. Air Measurements using pandas

  12. Exercise – Air Measurements Air measurements Copy the data file from project 3 ugz_luftqualitaetsmessung_seit-2012.csv Read in the CSV file and output its content To this end, use read_csv() and print() Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 4 / 12

  13. Air Measurements import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") print(data) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 5 / 12

  14. Air Measurements import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") print(data) Accessing individually cells using data.iloc Same functionality as lists Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 5 / 12

  15. Air Measurements import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") print(data) Accessing individually cells using data.iloc Same functionality as lists print(data.iloc[5]) print(data.iloc[0:10]) print(data.head(3)) print(data.iloc[8, 0]) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 5 / 12

  16. Air Measurements import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") print(data) Accessing individually cells using data.iloc Same functionality as lists print(data.iloc[5]) Output line 5 print(data.iloc[0:10]) print(data.head(3)) print(data.iloc[8, 0]) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 5 / 12

  17. Air Measurements import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") print(data) Accessing individually cells using data.iloc Same functionality as lists print(data.iloc[5]) Output lines 0 to 9 print(data.iloc[0:10]) print(data.head(3)) print(data.iloc[8, 0]) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 5 / 12

  18. Air Measurements import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") print(data) Accessing individually cells using data.iloc Same functionality as lists print(data.iloc[5]) print(data.iloc[0:10]) print(data.head(3)) Output lines 0 to 2 print(data.iloc[8, 0]) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 5 / 12

  19. Air Measurements import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") print(data) Accessing individually cells using data.iloc Same functionality as lists print(data.iloc[5]) print(data.iloc[0:10]) print(data.head(3)) print(data.iloc[8, 0]) Output line 8, column 0 Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 5 / 12

  20. Reading in and Processing CSV Files Extract data Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 6 / 12

  21. Reading in and Processing CSV Files Extract data Numerical data starts from line 5 We are only interested in the first 3 columns We want to change column names Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 6 / 12

  22. Reading in and Processing CSV Files Extract data Numerical data starts from line 5 We are only interested in the first 3 columns We want to change column names import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") newdata = data.iloc[5:, 0:3] newdata = newdata.rename(columns={"Zürich Stampfenbachstrasse": "SO2", \ "Zürich Stampfenbachstrasse.1": "CO"}) newdata.to_csv("messungen.csv") Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 6 / 12

  23. Reading in and Processing CSV Files Extract data Numerical data starts from line 5 Selection from line 5 We are only interested in the first 3 columns and columns 0 to 2 We want to change column names import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") newdata = data.iloc[5:, 0:3] newdata = newdata.rename(columns={"Zürich Stampfenbachstrasse": "SO2", \ "Zürich Stampfenbachstrasse.1": "CO"}) newdata.to_csv("messungen.csv") Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 6 / 12

  24. Reading in and Processing CSV Files Extract data Numerical data starts from line 5 We are only interested in the first 3 columns Rename columns We want to change column names import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") newdata = data.iloc[5:, 0:3] newdata = newdata.rename(columns={"Zürich Stampfenbachstrasse": "SO2", \ "Zürich Stampfenbachstrasse.1": "CO"}) newdata.to_csv("messungen.csv") Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 6 / 12

  25. Reading in and Processing CSV Files Accessing data using the column names Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 7 / 12

  26. Reading in and Processing CSV Files Accessing data using the column names Output all column names as list print(data.columns) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 7 / 12

  27. Reading in and Processing CSV Files Accessing data using the column names Output all column names as list print(data.columns) Output column “Datum” print(data["Datum"]) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 7 / 12

  28. Reading in and Processing CSV Files Accessing data using the column names Output all column names as list print(data.columns) Output column “Datum” print(data["Datum"]) Output column “Zürich Stampfenbachstrasse – Kohlenmonoxid” print(data["Zürich Stampfenbachstrasse.1"]) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 7 / 12

  29. Reading in and Processing CSV Files Filtering data Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 8 / 12

  30. Reading in and Processing CSV Files Filtering data Use loc instead of iloc in order to specify conditions print(data.loc[data["Datum"] == "2014-12-19"]) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 8 / 12

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend