guest lecture
play

Guest Lecture Daniel Dao & Chad Cotton OVERVIEW What is - PowerPoint PPT Presentation

Guest Lecture Daniel Dao & Chad Cotton OVERVIEW What is Civitas Learning What We Do Mission Statement Demo What I Do How I Use Databases Chad Cotton CIVITAS LEARNING, INC. CONFIDENTIAL INFORMATION WHAT IS CIVITAS LEARNING Civitas


  1. Guest Lecture Daniel Dao & Chad Cotton

  2. OVERVIEW What is Civitas Learning What We Do Mission Statement Demo What I Do How I Use Databases Chad Cotton CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  3. WHAT IS CIVITAS LEARNING Civitas Learning Mid-sized Data driven Education startup company CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  4. “ We partner with forward-thinking colleges and universities, harnessing the power of insight and action analytics to help a million more students learn well and finish strong. ” – The Million More Mission CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  5. WHAT WE DO • Work with institutions to provide insights through various applications • Inspire CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  6. Inspire for Faculty Demo CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  7. HOW I USE DATABASES CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  8. WHAT I DO • My role in the company • How my work is broken down • Product • Dev managers, PSMs, engineers • Frontend • Work with HTML/CSS/ReactJS • Backend • Writing APIs • Working with models • Writing SQL • Optimizing performance • Writing tests CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  9. Chad Cotton - DE • Graduated from THE University of Texas • Bachelors in Physic • Bachelors in Computer Science • Data Engineering • I've been working in Data Engineering for about 7 ½ years. • I've been at Civitas for a year and a half • Before I was a Data Engineer I taught Math and Computer Science in High School CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  10. Goals At the end of this lecture, you will be able to: • Distinguish jobs in the data industry. • Describe the process Civitas uses to manipulate data. • Evaluate methods of loading data into a database. CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  11. Jobs in the Data Industry • When you are out looking for work, you may wonder what types of things to look for: • Integration Specialist – getting data out of, or into a database (ETL: Extract Transform Load) • Database developer – getting data out of a database and using it in software, or creating routines to transform the data within the database. • Database Administrator – make the database perform as efficiently as possible. • Database architect – Designing schema to organize the data and or model a business process in a database • Data Scientist – Much more about math than databases. • Data Analyst – Investigate data to find insights. CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  12. What I Do • Data Engineer • Made up title to describe someone that has to do both the Integration specialist role and database developer • What I am not is a Data Architect/Scientist/Admin • ETL (Extract – Transform – Load) • This is an idiom for many combinations of the three. • As you will see, it is rarely as simple as just those three. CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  13. Civitas Data Flow Secure File Transfer Protocol Red Shift Transform Extract Load Extract Extract Transform Web Load Load Server Load CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  14. Load SFTP Red Shift Table File Table Table Flat file: Plain Text file that is non-hierarchical, usually in the form of CSV, or TSV. Each row represents one row in the database. CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  15. Transform SRC_ RAW SRC_ SRC_ RAW SELECT SPBPERS.SPBPERS_PIDM AS raw_person_id , SPBPERS.SPBPERS_BIRTH_DATE AS raw_birth_dt , SPBPERS.SPBPERS_DEAD_DATE AS raw_death_dt , SPBPERS.SPBPERS_SEX AS raw_gender , null AS raw_primary_language , null AS raw_country_of_origin FROM src_banner_saturn.spbpers CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  16. Technology for each phase Source Raw Aggregation Application Canonical Data Science Model Model Load data from Mapping Source Computations to Prepare tables for partner institution. onto our do statistical final output to the Base model analytics. application database: Ingest using SQL SQL/Statistical SQL Libraries Modeling CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  17. Extract • As long as the data is in the tables, there are export commands that can simply dump the data to a file. APP RAW RAW APP RAW Exported as FLAT FILES CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  18. Extract • As long as the data is in the tables, there are export commands that can simply dump the data to a file. PostGres APP RAW LOAD RAW APP RAW CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  19. Data Flow Secure File Transfer Protocol Red Shift Web Server CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  20. Summary • Jobs in the data industry. • Process Civitas uses to manipulate data. • Methods of loading data into a database. • Careers at Civitas Learning. CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

  21. Questions? CIVITAS LEARNING, INC. – CONFIDENTIAL INFORMATION

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend