CS591 progress bar Storage Layouts NoSQL Engines Rows vs Cols vs - - PowerPoint PPT Presentation

cs591 progress bar
SMART_READER_LITE
LIVE PREVIEW

CS591 progress bar Storage Layouts NoSQL Engines Rows vs Cols vs - - PowerPoint PPT Presentation

CS 591: Da Data S Systems & & M ML Prof. Manos Athanassoulis mathan@bu.edu http://manos.athanassoulis.net/classes/CS591 CS591 progress bar Storage Layouts NoSQL Engines Rows vs Cols vs Hybrid LSM-Trees Distributed DB Hash-based


slide-1
SLIDE 1

CS 591: Da

Data S Systems & & M ML

  • Prof. Manos Athanassoulis

mathan@bu.edu http://manos.athanassoulis.net/classes/CS591

slide-2
SLIDE 2

CS591 progress bar

Storage Layouts Rows vs Cols vs Hybrid New Hardware Flash Storage Multi-core Indexing When to use? UpBit NoSQL Engines LSM-Trees Hash-based Indexing Data Skipping Adaptive Indexing Scientific Data Management In-situ Query Processing Today: Array Data Distributed DB Database Systems at Global Scale MapReduce Computing at Scale

slide-3
SLIDE 3

CS591 progress bar

Storage Layouts Rows vs Cols vs Hybrid New Hardware Flash Storage Multi-core Indexing When to use? UpBit NoSQL Engines LSM-Trees Hash-based Indexing Data Skipping Adaptive Indexing Scientific Data Management In-situ Query Processing Today: Array Data Distributed DB Database Systems at Global Scale MapReduce Computing at Scale Systems for ML ML building blocks ML for Systems Automatic Data System Design

Data Systems & ML

slide-4
SLIDE 4

CS591 progress bar

Storage Layouts Rows vs Cols vs Hybrid New Hardware Flash Storage Multi-core Indexing When to use? UpBit NoSQL Engines LSM-Trees Hash-based Indexing Data Skipping Adaptive Indexing Scientific Data Management In-situ Query Processing Today: Array Data Distributed DB Database Systems at Global Scale MapReduce Computing at Scale Systems for ML ML building blocks ML for Systems Automatic Data System Design

Data Systems & ML

Data Systems & ML:

how can we ef efficien ently s support s statistical queries on large datasets? how can we us use s stat atistical al anal analysis of data & queries to be better t r tune une data systems?

slide-5
SLIDE 5

CS591 progress bar

Storage Layouts Rows vs Cols vs Hybrid New Hardware Flash Storage Multi-core Indexing When to use? UpBit NoSQL Engines LSM-Trees Hash-based Indexing Data Skipping Adaptive Indexing Scientific Data Management In-situ Query Processing Today: Array Data Distributed DB Database Systems at Global Scale MapReduce Computing at Scale Systems for ML ML building blocks ML for Systems Automatic Data System Design Learned Indexes Learn Data Distributions for Indexing Data Calculator Synthesize Indexes

Data Systems & ML Learning Indexes

CS591 progress bar

Storage Layouts Rows vs Cols vs Hybrid New Hardware Flash Storage Multi-core Indexing When to use? UpBit NoSQL Engines LSM-Trees Hash-based Indexing Data Skipping Adaptive Indexing Scientific Data Management In-situ Query Processing Today: Array Data Distributed DB Database Systems at Global Scale MapReduce Computing at Scale Systems for ML ML building blocks ML for Systems Automatic Data System Design Learned Indexes Learn Data Distributions for Indexing Data Calculator Synthesize Indexes

Data Systems & ML Learning Indexes

Added video presentations for the last four papers!

slide-6
SLIDE 6

Pr Project Pr Presentations

April 29th, 11:59pm: su submi mit p t pro roject re ct report a rt and c code April 30th and May 2nd : 6 + 6 10 6 + 6 10-mi minut nute pre prese sentati ations ns (doodle link will be sent after class) May 7th, 11:59pm (hard deadline): se send u updated re report ( rt (if n needed) (maximum project grade change 10%)

slide-7
SLIDE 7

Visitor: Charles Fracchia

CEO and Co-Founder, BioBright Inc. Data Management at Scale for Scientific (focus on Biomedical) Discovery Visits MiDAS group on May 3rd Talk on Friday, May 3rd, at 11am (room TBA)