Challenges and opportunities for computational analysis of wax - - PowerPoint PPT Presentation

challenges and opportunities for computational analysis
SMART_READER_LITE
LIVE PREVIEW

Challenges and opportunities for computational analysis of wax - - PowerPoint PPT Presentation

Challenges and opportunities for computational analysis of wax cylinders Joren Six 1 , Olmo Cornelis 2 and Marc Leman 1 1 IPEM, Ghent University, Belgium 2 Indiana University, USA joren.six@ugent.be International Symposium on Computational


slide-1
SLIDE 1

Challenges and opportunities for computational analysis of wax cylinders

Joren Six1, Olmo Cornelis2 and Marc Leman1

1IPEM, Ghent University, Belgium 2Indiana University, USA

joren.six@ugent.be

International Symposium on Computational Ethnomusicological Archiving December 2017 - Hamburg, Germany

slide-2
SLIDE 2

Overview I

Introduction Wax cylinders Archives Challenges Signal/noise Reliability of meta-data Recording/playback speed of wax cylinders Missing context Opportunities Pitch interval analysis Conclusion

2/22

slide-3
SLIDE 3

Wax cylinders

Figure: A wax cylinder recording from a 1911 expedition by Hutereau.

Early field recordings were captured on wax cylinders.

◮ 1895-1935 ◮ No electricity

needed

◮ Noisy ◮ Limited frequency

range

3/22

slide-4
SLIDE 4

Archives: ATM (USA), RMCA (Belgium)

Figure: Locations of recordings in the RMCA-archive.

Collection of the Royal Museum for Central Africa (RMCA), Tervuren, Belgium

◮ More than 35 000 items ◮ Mainly field recordings from Central

Africa

◮ First recordings from 1890s ◮ Many analogue carriers types ◮ Challenging meta-data

Archives of Traditional Music at Indiana University (ATM, USA)

4/22

slide-5
SLIDE 5

Signal/noise

◮ Segmentation ◮ Noise levels ◮ Some repetitive noise sources

Most wax cylinders contain segments with a reasonable signal/noise ratio.

Figure: Wax cylinder, a source of noise

5/22

slide-6
SLIDE 6

Reliability of meta-data — Problems

Meta-data problematic [2, 3]:

◮ Changing geographical nomenclature ◮ Many vernacular names for musical

instruments

◮ Transcription of tonal languages

(Yoruba, Igbo, Ashanti, Ewe)

◮ Collection vs scientific field work

Figure: Kombi, Kembe, Ekembe, Ikembe, Dikembe

  • r Likembe?

6/22

slide-7
SLIDE 7

Reliability of meta-data — Quantify

Check meta-data via duplicate detection[4]

  • 1. Find duplicate items[6]
  • 2. Compare meta-data
  • 3. Analyze differences

2.5% (887 of 35306) duplicates in RMCA archive.

meta-data field1 field 2 field 3 meta-data field1 field 3 Original Duplicate

Figure: Comparison of meta-data fields using duplicates

7/22

slide-8
SLIDE 8

Reliability of meta-data — Fields

Field Empty Different Exact match Fuzzy or exact match Year 20.83% 13.29% 65.88% 65.88% People 21.17% 17.34% 61.49% 64.86% Country 0.79% 3.15% 96.06% 96.06% Province 55.52% 5.63% 38.85% 38.85% Place 33.45% 16.67% 49.89% 55.86% Language 42.34% 8.45% 49.21% 55.74% Title 42.23% 38.40% 19.37% 30.18% Collector 10.59% 14.08% 75.34% 86.71% Table: Comparison of pairs of meta-data fields

8/22

slide-9
SLIDE 9

Reliability of meta-data — Fuzzy

Original title Duplicate title Warrior dance Warriors dance Amangbetu Olia Amangbetu olya Coming out of walekele Walekele coming out Nantoo Yakubu Nantoo O ho yi yee yi yee O ho yi yee yie yee Enjoy life Gently enjoy life Eshidi Eshidi (man’s name) Green Sahel The green Sahel Ngolo kele Ngolokole Table: Pairs of fuzzily matched titles. The fuzzy match algorithm is based

  • n Srensen/Dice coefficients.

9/22

slide-10
SLIDE 10

Recording/playback speed of wax cylinders

Recording speed often unknown.

◮ Various systems (G) ◮ 80-240 cycles/s ◮ Some use reference tones

Absolute pitch unreliable.

Figure: Wax cylinder, speed unkown

10/22

slide-11
SLIDE 11

Missing context

Context needed for a deep understanding

  • f single recordings. A few aspects:

◮ Dance ◮ Language ◮ Religion ◮ Instrument building

Audio only offers a limited snapshot of (music) culture. Context might be changed dramatically and impossible to re-create.

Figure: Wax cylinder, without context

11/22

slide-12
SLIDE 12

Opportunities

Unique snapshots of century old musical practices. Opportunities for comparative studies:

◮ Compare current with past practices ◮ Compare musical idioms with western idioms ◮ Universals in scales? 12/22

slide-13
SLIDE 13

Opportunities

Pitfall

◮ Noisy ◮ Unreliable meta-data ◮ Recording speed

unknown

◮ Context missing for

individuals Avoidance

◮ Select less noisy segments manually ◮ Limit meta-data dependency ◮ Avoid claims about absolute pitch ◮ Focus on patterns, systems,

populations

13/22

slide-14
SLIDE 14

Pitch interval analysis

Manual, computer assisted analysis with Tarsos [5]

Figure: Tarsos software system for pitch analysis.

14/22

slide-15
SLIDE 15

Pitch interval analysis - 4 PC

Interval size (cents) Density 0.0000 0.0005 0.0010 0.0015 100 300 500 700 900 1100

15/22

slide-16
SLIDE 16

Pitch interval analysis - 5 PC

Interval size (cents) Density 0.0000 0.0005 0.0010 0.0015 0.0020 100 300 500 700 900 1100

16/22

slide-17
SLIDE 17

Pitch interval analysis - 6 PC

Interval size (cents) Density 0.0000 0.0005 0.0010 0.0015 0.0020 100 300 500 700 900 1100

17/22

slide-18
SLIDE 18

Pitch interval analysis - 7 PC

Interval size (cents) Density 0.0000 0.0005 0.0010 0.0015 0.0020 0.0025 100 300 500 700 900 1100

18/22

slide-19
SLIDE 19

Pitch interval analysis - Preliminary results

Figure: Diversity in 55 pentatonic scales, ordered by interval size of first interval.

Very large diversity but some general findings:

◮ The fifth is almost always present. ◮ Scales with four and five PC’s

share 240 cents as basic interval.

◮ Scales with six and seven pitch

classes share 170 cents

19/22

slide-20
SLIDE 20

Conclusion

◮ Presented a way to quantify meta-data quality in digital music

archives via duplicates[4, 1]

◮ Presented challenges and opportunities to research on wax

cylinder recordings

◮ Preliminary results on pitch content of 400 wax cylinders 20/22

slide-21
SLIDE 21

Bibliography I

[1] Federica Bressan, Joren Six, and Marc Leman. Applications of duplicate detection: linking meta-data and merging music archives. The experience of the IPEM historical archive of electronic music. In Proceedings of 4th International Digital Libraries for Musicology workshop (DLfM 2017), page submitted, Shanghai (China), 2017. ACM Press. [2] Olmo Cornelis, Rita De Caluwe, Guy Detr, Axel Hallez, Marc Leman, Tom Matth, Dirk Moelants, and Jos Gansemans. Digitisation of the ethnomusicological sound archive of the rmca. IASA Journal, 26:35–44, 2005. [3] Olmo Cornelis, Micheline Lesaffre, Dirk Moelants, and Marc Leman. Access to ethnic music: Advances and perspectives in content-based music information retrieval. Signal Processing, 90(4):1008 – 1031, 2010. Special Section: Ethnic Music Audio Documents: From the Preservation to the Fruition. [4] Joren Six, Federica Bressan, and Marc Leman. Applications of duplicate detection in music archives: From metadata comparison to storage optimisation - The case of the Belgian Royal Museum for Central Africa. In Proceedings of the 13th Italian Research Conference on Digital Libraries (IRCDL 2018), In Press - 2018. [5] Joren Six, Olmo Cornelis, and Marc Leman. Tarsos, a modular platform for precise pitch analysis of Western and non-Western music. Journal of New Music Research, 42(2):113–129, 2013.

21/22

slide-22
SLIDE 22

Bibliography II

[6] Joren Six and Marc Leman. Panako - A scalable acoustic fingerprinting system handling time-scale and pitch modification. In Proceedings of the 15th ISMIR Conference (ISMIR 2014), pages 1–6, 2014.

22/22