CHARTIO WEBINAR
Clean up and structure your database for self-serve analytics
Inspired by Data School’s book Cloud Data Management: 4 Stages for Informed Companies
Clean up and structure your database for self-serve analytics - - PowerPoint PPT Presentation
CHARTIO WEBINAR Clean up and structure your database for self-serve analytics Inspired by Data Schools book Cloud Data Management: 4 Stages for Informed Companies Housekeeping items Please ask questions at any time using the chat! I will
CHARTIO WEBINAR
Inspired by Data School’s book Cloud Data Management: 4 Stages for Informed Companies
Please ask questions at any time using the chat! I will answer them at the end
Recording & slides will be uploaded on our website and shared via email. If you have any questions, please reach out to me at mdavid@chartio.com
Matt David Head of The Data School @ Chartio
○ Source ○ Lake ○ Warehouse ○ Mart
cloud-based) data stack that will truly enable a company to explore and understand the data it collects to have high visibility into their business.
truly informed by their data has significant competitive advantages.
teams collecting more than 100GB of data per day.
stack.
have a few sources of interest.
and your application data in whatever PostgreSQL
with these sources, you might set them up with direct access; it’s more simple and agile for them to just work with the data directly.
Right for you if:
Application Dashboards Excel SQL IDE Cloud Dashboards BI product
Data Wiki Snippet Dictionary BI Layer Meta Modeling
Double Check Results Keep short Dashboards Design before building
You’ve outgrown if:
sources like Salesforce and Hubspot
users need to create their own charts
from applications like Salesforce, Hubspot, Jira, and Zendesk, you’ll want to create a single home for this data so you can access all of it together and with a single SQL syntax, rather than many different APIs.
What is a Warehouse Engine? Deciding factors Modern Warehouse Engine Products
Extract Options
Load Options
Multiple Schemas
Adding new sources Source updates Fixing broken connections
Access in central place Permission tiers
Optimize Queries - dataschool.com/sql-optimization/ BI tool
Database
You’ve outgrown if:
truth.
been quite a nightmare due to Dimensional modeling and OLAP cubes. ○ https://fivetran.com/blog/obt-star-schema
Consolidate Data Sources Simplify Schema Simplify Tables / Columns
Standardize Metrics
SQL
Apply style guide
Make things easy to understand and use
Read Only Custom User Groups Encrypt Columns Audit levels of access
Data Cleanup and Maintenance
Monitor Permissions and Organization Integrity Handling Tool Selection Education / Enablement
Track New Metrics
Deprecate Old Metrics
Permissions
Identify slow queries
Identify common queries
You’ve outgrown if:
explore and understand data themselves
hopefully using the many resources of the Data School
easier use
competitive success
company are able to answer their own questions.
tables in that source of truth, and users will become overwhelmed when trying to find the data that’s relevant to them.
sources of truth for a team or topic of investigation.
Views
Segment tables
Permissions Update
Mart Mayors
○ Schema ○ How to query
○ Performance ○ Integrity
○ Metrics
You’ve outgrown this stage if:
you’d like.
governed stack that will continually evolve and support your informed competitive company.
Download the book at https://dataschool.com/data-governance/
DataSchool.com - Join our Slack