Guest Lecture Daniel Dao & Chad Cotton OVERVIEW What is - - PowerPoint PPT Presentation

guest lecture
SMART_READER_LITE
LIVE PREVIEW

Guest Lecture Daniel Dao & Chad Cotton OVERVIEW What is - - PowerPoint PPT Presentation

Guest Lecture Daniel Dao & Chad Cotton OVERVIEW What is Civitas Learning What We Do Mission Statement Demo What I Do How I Use Databases Chad Cotton CIVITAS LEARNING, INC. CONFIDENTIAL INFORMATION WHAT IS CIVITAS LEARNING Civitas


slide-1
SLIDE 1

Daniel Dao & Chad Cotton

Guest Lecture

slide-2
SLIDE 2

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

OVERVIEW

What is Civitas Learning What We Do Mission Statement Demo What I Do How I Use Databases Chad Cotton

slide-3
SLIDE 3

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

WHAT IS CIVITAS LEARNING

Civitas Learning Mid-sized startup Data driven company Education

slide-4
SLIDE 4

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

“We partner with forward-thinking colleges and universities, harnessing the power of insight and action analytics to help a million more students learn well and finish strong.” – The Million More Mission

slide-5
SLIDE 5

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

WHAT WE DO

  • Work with institutions to provide insights through

various applications

  • Inspire
slide-6
SLIDE 6

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

Inspire for Faculty Demo

slide-7
SLIDE 7

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

HOW I USE DATABASES

slide-8
SLIDE 8

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

WHAT I DO

  • My role in the company
  • How my work is broken down
  • Product
  • Dev managers, PSMs, engineers
  • Frontend
  • Work with HTML/CSS/ReactJS
  • Backend
  • Writing APIs
  • Working with models
  • Writing SQL
  • Optimizing performance
  • Writing tests
slide-9
SLIDE 9

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

Chad Cotton - DE

  • Graduated from THE University of Texas
  • Bachelors in Physic
  • Bachelors in Computer Science
  • Data Engineering
  • I've been working in Data Engineering for about 7 ½

years.

  • I've been at Civitas for a year and a half
  • Before I was a Data Engineer I taught Math and

Computer Science in High School

slide-10
SLIDE 10

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

Goals

At the end of this lecture, you will be able to:

  • Distinguish jobs in the data industry.
  • Describe the process Civitas uses to manipulate

data.

  • Evaluate methods of loading data into a database.
slide-11
SLIDE 11

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

Jobs in the Data Industry

  • When you are out looking for work, you may

wonder what types of things to look for:

  • Integration Specialist – getting data out of, or into a

database (ETL: Extract Transform Load)

  • Database developer – getting data out of a database and

using it in software, or creating routines to transform the data within the database.

  • Database Administrator – make the database perform as

efficiently as possible.

  • Database architect – Designing schema to organize the

data and or model a business process in a database

  • Data Scientist – Much more about math than databases.
  • Data Analyst – Investigate data to find insights.
slide-12
SLIDE 12

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

What I Do

  • Data Engineer
  • Made up title to describe someone that has to do both

the Integration specialist role and database developer

  • What I am not is a Data Architect/Scientist/Admin
  • ETL (Extract – Transform – Load)
  • This is an idiom for many combinations of the three.
  • As you will see, it is rarely as simple as just those three.
slide-13
SLIDE 13

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

Civitas Data Flow

Load Extract Red Shift Web Server Secure File Transfer Protocol Transform Extract Load Extract Transform Load Load

slide-14
SLIDE 14

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

Load

Red Shift Table File SFTP

Flat file: Plain Text file that is non-hierarchical, usually in the form of CSV, or TSV. Each row represents one row in the database.

Table Table

slide-15
SLIDE 15

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

Transform

SRC_ SRC_ SRC_ RAW RAW SELECT SPBPERS.SPBPERS_PIDM AS raw_person_id , SPBPERS.SPBPERS_BIRTH_DATE AS raw_birth_dt , SPBPERS.SPBPERS_DEAD_DATE AS raw_death_dt , SPBPERS.SPBPERS_SEX AS raw_gender , null AS raw_primary_language , null AS raw_country_of_origin FROM src_banner_saturn.spbpers

slide-16
SLIDE 16

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

Technology for each phase

Source Raw Canonical Model Aggregation Data Science Application Model Load data from partner institution. Ingest using Libraries Mapping Source

  • nto our

Base model SQL Computations to do statistical analytics. SQL/Statistical Modeling Prepare tables for final output to the application database: SQL

slide-17
SLIDE 17

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

Extract

  • As long as the data is in the tables, there are export

commands that can simply dump the data to a file.

APP APP RAW RAW RAW Exported as FLAT FILES

slide-18
SLIDE 18

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

Extract

  • As long as the data is in the tables, there are export

commands that can simply dump the data to a file.

APP APP LOAD RAW RAW RAW PostGres

slide-19
SLIDE 19

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

Data Flow

Red Shift Web Server Secure File Transfer Protocol

slide-20
SLIDE 20

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

Summary

  • Jobs in the data industry.
  • Process Civitas uses to manipulate data.
  • Methods of loading data into a database.
  • Careers at Civitas Learning.
slide-21
SLIDE 21

CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

Questions?