CSE 115 Introduction to Computer Science I Road map Review - PowerPoint PPT Presentation

CSE 115 Introduction to Computer Science I

Road map ▶︎ Review ◀ Exercises from last time Reading csv files exercise

File reading A b i t o f t e x t \n o n s e v e r a l l i n e s \n … A text file is a sequence of characters. The contents can be read line by line: A b i t o f t e x t \n o n s e v e r a l l i n e s \n …

File reading File objects support iteration: with open("Chapter1.txt") as f: for line in f: . . . do something with each line . . .

Road map Review ▶︎ Exercises from last time ◀ Reading csv files exercise

Exercises 1. Define a function that takes a file name as an argument and returns a map with character counts for the file. def countCharacters(filename): count = {} with open(filename) as f: for line in f: for ch in line: Read data from file if ch in count: count[ch] = count[ch] + 1 else: count[ch] = 1 return count

Exercises 1. Define a function that takes a file name as an argument and returns a map with character counts for the file. def countCharacters(filename): count = {} with open(filename) as f: for line in f: for ch in line: if ch in count: Process each line from file count[ch] = count[ch] + 1 else: count[ch] = 1 return count

Exercises 1. Define a function that takes a file name as an argument and returns a map with character counts for the file. def countCharacters(filename): count = {} with open(filename) as f: for line in f: for ch in line: if ch in count: Process each character from line count[ch] = count[ch] + 1 else: count[ch] = 1 return count

Exercises 1. Define a function that takes a file name as an argument and returns a map with character counts for the file. def countCharacters(filename): count = {} with open(filename) as f: for line in f: for ch in line: if ch in count: If we've see a character before, increment its count count[ch] = count[ch] + 1 else: but the first time we see a character, enter it with a count of 1 count[ch] = 1 return count

Exercises 1. Define a function that takes a file name as an argument and returns a map with character counts for the file. def countCharacters(filename): count = {} with open(filename) as f: for line in f: for ch in line: if ch in count: count[ch] = count[ch] + 1 else: count[ch] = 1 return count

Exercises 2. Define a function that takes a file name as an argument and returns a map with word counts for the file. Q: What counts as a word? Anything consisting of uppercase letters A-Z, lowercase letters a-z, and the single quote '. This means that anything that is not A-Z or a-z or ' must come between words. Q: How do we segment a string into words? We can use a library called re, which is a regular expression library. The relevant regular expression to split a string into words is [^A-Za-z']+

Exercises 2. Define a function that takes a file name as an argument and returns a map with word counts for the file. import regular expression library import re def countWords(filename): count = {} with open(filename) as f: for line in f: wordList = re.split("[^a-zA-Z']+", line) for word in wordList: Read data from file if word in count: count[word] = count[word] + 1 else: count[word] = 1 return count

Exercises 2. Define a function that takes a file name as an argument and returns a map with word counts for the file. import re def countWords(filename): count = {} with open(filename) as f: for line in f: wordList = re.split("[^a-zA-Z']+", line) for word in wordList: Process each line from file if word in count: count[word] = count[word] + 1 else: count[word] = 1 return count

Exercises 2. Define a function that takes a file name as an argument and returns a map with word counts for the file. import re def countWords(filename): count = {} with open(filename) as f: for line in f: wordList = re.split("[^a-zA-Z']+", line) for word in wordList: if word in count: Process each word from line count[word] = count[word] + 1 else: count[word] = 1 return count

Exercises 2. Define a function that takes a file name as an argument and returns a map with word counts for the file. import re def countWords(filename): count = {} with open(filename) as f: for line in f: wordList = re.split("[^a-zA-Z']+", line) Break line into words for word in wordList: if word in count: Process each word from line count[word] = count[word] + 1 else: count[word] = 1 return count

Regular expressions Regular expressions are used to match patterns. We will use a regular expression library to split each line from the file into words in a reasonable way. Q: What counts as a word? Anything consisting of uppercase letters A-Z, lowercase letters a-z, and the single quote '. This means that anything that is not A-Z or a-z or ' must come between words.

Regular expressions This regular expression will break a string into parts at character sequences which are not letters or the single quote (apostrophe): Sally's new puppy is named Rover. Rover's tail was wagging. Rover was happy! Sally's new puppy is named Rover. Rover's tail was wagging. Rover was happy!

Exercises 2. Define a function that takes a file name as an argument and returns a map with word counts for the file. import re Any character that's not a One or more such letter or the single quote characters def countWords(filename): count = {} with open(filename) as f: for line in f: wordList = re.split("[^a-zA-Z']+", line) for word in wordList: if word in count: Process each word from wordList count[word] = count[word] + 1 else: count[word] = 1 return count

Exercises 2. Define a function that takes a file name as an argument and returns a map with word counts for the file. import re def countWords(filename): count = {} with open(filename) as f: for line in f: wordList = re.split("[^a-zA-Z']+", line) for word in wordList: if word in count: If we've see a word before, increment its count count[word] = count[word] + 1 else: but the first time we see a word, enter it with a count count[word] = 1 of 1 return count

Exercises 2. Define a function that takes a file name as an argument and returns a map with word counts for the file. import re def countWords(filename): count = {} with open(filename) as f: for line in f: wordList = re.split("[^a-zA-Z']+", line) for word in wordList: if word in count: count[word] = count[word] + 1 else: count[word] = 1 return count

Road map Review Exercises from last time ▶︎ Reading csv files ◀ exercise

csv files Comma-separated values In computing, a comma-separated values ( CSV ) file is a delimited text file that uses a comma to separate values. A CSV file stores tabular data (numbers and text) in plain text. Each line of the file is a data record. Each record consists of one or more fields, separated by commas. The use of the comma as a field separator is the source of the name for this file format. Excerpt from https://en.wikipedia.org/wiki/Comma-separated_values

csv files A csv file is a plain text file that contains rows of data, one row per line, with data elements separated by commas on each line. For example: Heating.csv Month,Budget,Actual January,200,190 February,200,210 March,150,185 April,100,110 May,50,40 June,50,15 July,50,12 August,50,14 September,50,35 October,100,78 November,150,125 December,200,167

csv files A csv files can be read from and written to by different applications, such as Excel (left) and Numbers (right). Heating.csv Month,Budget,Actual January,200,190 February,200,210 March,150,185 April,100,110 May,50,40 June,50,15 July,50,12 August,50,14 September,50,35 October,100,78 November,150,125 December,200,167

Reading csv files Let's write a program to read the data in our csv file into a dictionary. We'll use the month as a key, and put the rest of the data into a list. For example: {'Month': ['Budget', 'Actual'], 'January': ['200', '190'], 'February': ['200', '210'], 'March': ['150', '185'], 'April': ['100', '110'], 'May': ['50', '40'], 'June': ['50', '15'], 'July': ['50', '12'], 'August': ['50', '14'], 'September': ['50', '35'], 'October': ['100', '78'], 'November': ['150', '125'], 'December': ['200', '167'] }

Reading csv files import csv library import csv def readBudget(filename): budget = {} with open(filename, newline='') as f: reader = csv.reader(f) for line in reader: Read data from file month = line[0] line.pop(0) budget[month] = line return budget

Reading csv files documentation says this is needed when reading import csv csv files def readBudget(filename): budget = {} with open(filename, newline='') as f: reader = csv.reader(f) for line in reader: Process each line from file month = line[0] line.pop(0) budget[month] = line return budget

Reading csv files import csv def readBudget(filename): budget = {} with open(filename, newline='') as f: reader = csv.reader(f) for line in reader: month = line[0] Process data from line: a list of line.pop(0) the comma separated values budget[month] = line return budget

Reading csv files Class came up with this approach: import csv def readBudget(filename): budget = {} with open(filename, newline='') as f: reader = csv.reader(f) for line in reader: key = line[0] value = [line[1], line[2]] budget[key] = value return budget

CSE 115 Introduction to Computer Science I Road map Review - PowerPoint PPT Presentation

CSE 115 Introduction to Computer Science I Road map Review Exercises from last time Reading csv files exercise File reading A b i t o f t e x t \n o n s e v e r a l l i n e s \n A text file is a sequence of

Slide 7 / 115 Slide 8 / 115 Living and Nonliving Characteristics of Living Things Categorize

115 E 75th St Renovation December 17, 2018 E 77 ST 115 E 75 NORTHWELL HEALTH LENOX HILL

Matthew Samet msamet@tamdistrict.org 1st Period Ph.Un. - Room 120 3rd Period Ph.Un. - Room 115

Lagrange Multipliers Math 115 Calculus 115 How to deal with constrained optimization. Calculus

CSE 3401 Functional and Logic Programming York University CSE 3401 Vida Movahedi 1 York University

Introduction to Ecology This picture includes one squirrelfish (red), two cleaner wrasse (little

6th Grade Ecosystem Dynamics 2015-08-27 www.njctl.org Slide 3 / 115 Table of Contents:

Life Cycles Return to Table of Contents Slide 6 / 115 Living and Nonliving Think back to what

6th Grade Ecosystem Dynamics 2015-08-27 www.njctl.org Slide 3 / 115 Table of Contents:

Standard Metric Measurement Conversions Measurement Conversions Unit Cubes Volume of a

Standard Metric Measurement Conversions Measurement Conversions Unit Cubes Volume of a

Welcome%to Comp%115:%Databases http://www.cs.tufts.edu/comp/115/ Instructor:% Manos5Athanassoulis

CSE 115 Introduction to Computer Science I Announcements Lab activites/Lab exams submit

CSE 182-L2:Blast & variants I Dynamic Programming www.cse cse. .ucsd ucsd. .edu

CSE 312 Final Review: Section AA CSE 312 TAs December 8, 2011 CSE 312 Final Review: Section AA

Welcome to CSE 506 Introduc/on & Review Don Porter 1 2 CSE 506: Opera.ng Systems CSE 506:

Introduction read.csv Importing Data in R Importing data in R ? Importing Data in R 5 types

How to get data Dr. Steven Bitner Announcement HW # 8 is now due on Thanksgiving (Nov 22 nd )

Batch Metadata Editing in DSpace 1.6+ Maureen P. Walsh, The Ohio State University Libraries

Lecture 12: Comma Separated Values (CSV) Format CSV CSV data representing financial information

Manipulating Data Files in Python Learning Objectives Working with CSV files Reading

More than you ever wanted to know about CSV Digging into the CSV script Script Outline Load file

Export to RCE Adam Greenhill SecurityCompass Who am I? Senior Consultant @ Security

Mixed models in R using the lme4 package Part 1: Introduction to R Douglas Bates University of

CSE 115 Introduction to Computer Science I Road map Review - PowerPoint PPT Presentation

CSE 115 Introduction to Computer Science I Road map Review Exercises from last time Reading csv files exercise File reading A b i t o f t e x t \n o n s e v e r a l l i n e s \n A text file is a sequence of

Slide 7 / 115 Slide 8 / 115 Living and Nonliving Characteristics of Living Things Categorize

115 E 75th St Renovation December 17, 2018 E 77 ST 115 E 75 NORTHWELL HEALTH LENOX HILL

Matthew Samet msamet@tamdistrict.org 1st Period Ph.Un. - Room 120 3rd Period Ph.Un. - Room 115

Lagrange Multipliers Math 115 Calculus 115 How to deal with constrained optimization. Calculus

CSE 3401 Functional and Logic Programming York University CSE 3401 Vida Movahedi 1 York University

Introduction to Ecology This picture includes one squirrelfish (red), two cleaner wrasse (little

6th Grade Ecosystem Dynamics 2015-08-27 www.njctl.org Slide 3 / 115 Table of Contents:

Life Cycles Return to Table of Contents Slide 6 / 115 Living and Nonliving Think back to what

6th Grade Ecosystem Dynamics 2015-08-27 www.njctl.org Slide 3 / 115 Table of Contents:

Standard Metric Measurement Conversions Measurement Conversions Unit Cubes Volume of a

Standard Metric Measurement Conversions Measurement Conversions Unit Cubes Volume of a

Welcome%to Comp%115:%Databases http://www.cs.tufts.edu/comp/115/ Instructor:% Manos5Athanassoulis

CSE 115 Introduction to Computer Science I Announcements Lab activites/Lab exams submit

CSE 182-L2:Blast &amp; variants I Dynamic Programming www.cse cse. .ucsd ucsd. .edu

CSE 312 Final Review: Section AA CSE 312 TAs December 8, 2011 CSE 312 Final Review: Section AA

Welcome to CSE 506 Introduc/on &amp; Review Don Porter 1 2 CSE 506: Opera.ng Systems CSE 506:

Introduction read.csv Importing Data in R Importing data in R ? Importing Data in R 5 types

How to get data Dr. Steven Bitner Announcement HW # 8 is now due on Thanksgiving (Nov 22 nd )

Batch Metadata Editing in DSpace 1.6+ Maureen P. Walsh, The Ohio State University Libraries

Lecture 12: Comma Separated Values (CSV) Format CSV CSV data representing financial information

Manipulating Data Files in Python Learning Objectives Working with CSV files Reading

More than you ever wanted to know about CSV Digging into the CSV script Script Outline Load file

Export to RCE Adam Greenhill SecurityCompass Who am I? Senior Consultant @ Security

Mixed models in R using the lme4 package Part 1: Introduction to R Douglas Bates University of

CSE 182-L2:Blast & variants I Dynamic Programming www.cse cse. .ucsd ucsd. .edu

Welcome to CSE 506 Introduc/on & Review Don Porter 1 2 CSE 506: Opera.ng Systems CSE 506: