A New System for Big Music Data Analysis Daniel Wolff The DML - - PowerPoint PPT Presentation

a new system for big music
SMART_READER_LITE
LIVE PREVIEW

A New System for Big Music Data Analysis Daniel Wolff The DML - - PowerPoint PPT Presentation

A New System for Big Music Data Analysis Daniel Wolff The DML System Provides ... Access : Systematic exploration of heterogenuous and large music libraries Control : Interfacing with complex automatic music analysis tools Analysis


slide-1
SLIDE 1

A New System for Big Music Data Analysis

Daniel Wolff

slide-2
SLIDE 2

The DML System Provides ...

A new system for big music data analysis 2

 Access: Systematic exploration of

heterogenuous and large music libraries

 Control: Interfacing with complex

automatic music analysis tools

 Analysis: Gain summarised knowledge on

large numbers of recordings

 Sharing: Experiments reproducible with same data, clear

provenance of analysis results.

slide-3
SLIDE 3

The T echnical Perspective

A new system for big music data analysis 3

 Access to data  Audio – access restricted by physical location  Metadata – unification of different formats  Control via web interface to large-scale analysis  Interactive UI for overview and exploration  Scalable analysis is available on collection-level

and recording-level

 Share the well-defined and derived data

 Re-use of existing software and published code for analysis

slide-4
SLIDE 4

Software Ecosystem

A new system for big music data analysis 4

 Distributed system

 Virtual machines (VirtualBox)  Open Source OS (Ubuntu)

 Parallelised existing analysis tools

 Python (NumPy)  Vamp Plugins  Big-Data map-reduce (Spark)

 Computation management

 Built on semantic architecture

 Interactive user interface for exploration and analysis

 Built using state-of-the-art web technologies

slide-5
SLIDE 5

Data-Flow for Computational Analysis

A new system for big music data analysis 5

User Interface Web Server Provide Analysis Management: Cliopatria Database: Results & Metadata Computing Server Audio, Transcriptions and Feature Storage Access Audio and Features

slide-6
SLIDE 6

Physical Locations Matter: Content Access

A new system for big music data analysis 6

 Two computing servers, located at BL and ILM

 Allow for in-place access to restricted data

 Dedicated server at City for web access

slide-7
SLIDE 7

Sustainability

A new system for big music data analysis 7

 Preference on Open Source

 Basic infrastructure (Ubuntu, Spark,

Vamp ...)

 Soundsoftware repository for

 Publishing versioned code of newly developed software  Backup and sharing: Open data / features / results

 Open and reproducible method

 Enables similar set-up in further institutions

slide-8
SLIDE 8

Results Implemented in the DML System

The Digital Music Lab Project 8

 Conceptual framework (including imp-

lementation) for collection-level analysis

 Collection in focus as object of analysis

 Data-flow allowing for interactive retrieval of results

 Secure, responsive and redundant network structure

 Distributed placement of computation ressources  Open-source software ecosystem for large-scale music analysis

 Parallelised feature extraction and results management  Collection-level analysis, interface and visualisation