CS 591: Da
Data S Systems & & M ML
- Prof. Manos Athanassoulis
mathan@bu.edu http://manos.athanassoulis.net/classes/CS591
CS591 progress bar Storage Layouts NoSQL Engines Rows vs Cols vs - - PowerPoint PPT Presentation
CS 591: Da Data S Systems & & M ML Prof. Manos Athanassoulis mathan@bu.edu http://manos.athanassoulis.net/classes/CS591 CS591 progress bar Storage Layouts NoSQL Engines Rows vs Cols vs Hybrid LSM-Trees Distributed DB Hash-based
mathan@bu.edu http://manos.athanassoulis.net/classes/CS591
Storage Layouts Rows vs Cols vs Hybrid New Hardware Flash Storage Multi-core Indexing When to use? UpBit NoSQL Engines LSM-Trees Hash-based Indexing Data Skipping Adaptive Indexing Scientific Data Management In-situ Query Processing Today: Array Data Distributed DB Database Systems at Global Scale MapReduce Computing at Scale
Storage Layouts Rows vs Cols vs Hybrid New Hardware Flash Storage Multi-core Indexing When to use? UpBit NoSQL Engines LSM-Trees Hash-based Indexing Data Skipping Adaptive Indexing Scientific Data Management In-situ Query Processing Today: Array Data Distributed DB Database Systems at Global Scale MapReduce Computing at Scale Systems for ML ML building blocks ML for Systems Automatic Data System Design
Data Systems & ML
Storage Layouts Rows vs Cols vs Hybrid New Hardware Flash Storage Multi-core Indexing When to use? UpBit NoSQL Engines LSM-Trees Hash-based Indexing Data Skipping Adaptive Indexing Scientific Data Management In-situ Query Processing Today: Array Data Distributed DB Database Systems at Global Scale MapReduce Computing at Scale Systems for ML ML building blocks ML for Systems Automatic Data System Design
Data Systems & ML
Data Systems & ML:
how can we ef efficien ently s support s statistical queries on large datasets? how can we us use s stat atistical al anal analysis of data & queries to be better t r tune une data systems?
Storage Layouts Rows vs Cols vs Hybrid New Hardware Flash Storage Multi-core Indexing When to use? UpBit NoSQL Engines LSM-Trees Hash-based Indexing Data Skipping Adaptive Indexing Scientific Data Management In-situ Query Processing Today: Array Data Distributed DB Database Systems at Global Scale MapReduce Computing at Scale Systems for ML ML building blocks ML for Systems Automatic Data System Design Learned Indexes Learn Data Distributions for Indexing Data Calculator Synthesize Indexes
Data Systems & ML Learning Indexes
Storage Layouts Rows vs Cols vs Hybrid New Hardware Flash Storage Multi-core Indexing When to use? UpBit NoSQL Engines LSM-Trees Hash-based Indexing Data Skipping Adaptive Indexing Scientific Data Management In-situ Query Processing Today: Array Data Distributed DB Database Systems at Global Scale MapReduce Computing at Scale Systems for ML ML building blocks ML for Systems Automatic Data System Design Learned Indexes Learn Data Distributions for Indexing Data Calculator Synthesize Indexes
Data Systems & ML Learning Indexes
Added video presentations for the last four papers!
April 29th, 11:59pm: su submi mit p t pro roject re ct report a rt and c code April 30th and May 2nd : 6 + 6 10 6 + 6 10-mi minut nute pre prese sentati ations ns (doodle link will be sent after class) May 7th, 11:59pm (hard deadline): se send u updated re report ( rt (if n needed) (maximum project grade change 10%)
CEO and Co-Founder, BioBright Inc. Data Management at Scale for Scientific (focus on Biomedical) Discovery Visits MiDAS group on May 3rd Talk on Friday, May 3rd, at 11am (room TBA)