Identifying nocturnal bird calls Presentation at the Department of - - PowerPoint PPT Presentation

identifying nocturnal bird calls
SMART_READER_LITE
LIVE PREVIEW

Identifying nocturnal bird calls Presentation at the Department of - - PowerPoint PPT Presentation

Identifying nocturnal bird calls Presentation at the Department of Conservation Christchurch Douglas Bagnall and Edward Abraham June Executive summary not yet useful for Tier monitoring CCBY


slide-1
SLIDE 1

Identifying nocturnal bird calls

Presentation at the Department of Conservation Christchurch Douglas Bagnall and Edward Abraham June

Executive summary not yet useful for Tier monitoring

CCBY Presentation made available under a Creative Commons licence

slide-2
SLIDE 2

Goals

Identify kiwi morepork and weka in nocturnal recordings Allow recordings to be ignored that are unlikely to contain calls

to reduce the effort needed to score calls

Facilitate consistent automated monitoring of acoustic data

from around New Zealand

slide-3
SLIDE 3

Recurrent Neural Networks

slide-4
SLIDE 4

Audio processing

slide-5
SLIDE 5

Recurrent Neural Networks

slide-6
SLIDE 6

Parallel processing

slide-7
SLIDE 7

An example prediction

10 20 30 40 50 60 0.0 0.2 0.4 0.6 0.8 1.0

RFPT−LPC−2011−11−26T13:45:03Z−540−60.wav

Seconds Score

slide-8
SLIDE 8

Identifying kiwi

slide-9
SLIDE 9

Training data

Provided with minute audio files from the Tier

data set

First calls of each bird species in each file labelled using

Freebird

A total of and files containing kiwi morepork

and weka respectively

Files with kiwi in the Tier training set

species

  • browntokoeka
  • great spot
  • little spot
  • spp
  • total
slide-10
SLIDE 10

Training data

Requires a welllabelled training set Current Tier protocol not ideal for three reasons

  • For kiwi and weka there were insufficient examples in the

training data

  • not all calls are labelled
  • time bounding of calls isn’t precise

Carried out our own labelling of morepork calls Used data from the Rimutaka Forest Park Trust for kiwi

slide-11
SLIDE 11

Rimutaka kiwi

Data from the Rimutaka Forest Park Trust minute clips Half of the clips with high energy in the kiwi frequency Half of the clips randomly sampled from the remaining

clips

Added in minute kiwiless clips from the Tier set Held out clips as a test set

slide-12
SLIDE 12

A successful prediction

10 20 30 40 50 60 0.0 0.2 0.4 0.6 0.8 1.0

RFPT−LPC−2011−11−26T13:45:03Z−540−60.wav

Seconds Score

slide-13
SLIDE 13

No kiwi here

10 20 30 40 50 60 0.0 0.2 0.4 0.6 0.8 1.0

RFPT−LPA−2011−12−25T16:45:02Z−120−60.wav

Seconds Score

slide-14
SLIDE 14

This tūi might be a kiwi

10 20 30 40 50 60 0.0 0.2 0.4 0.6 0.8 1.0

RFPT−SG2−2012−03−16T22:45:03Z−660−60.wav

Seconds Score

slide-15
SLIDE 15

And it didn’t find this call

10 20 30 40 50 60 0.0 0.2 0.4 0.6 0.8 1.0

RFPT−LPB−2011−11−19T15:00:02Z−600−60.wav

Seconds Score

slide-16
SLIDE 16

Kiwi RNN applied to test data

An AUC of

0.0 2.5 5.0 7.5 10.0 0.00 0.25 0.50 0.75 1.00

RNN score Density

Kiwi FALSE TRUE

Score below Score above Correct No kiwi

  • kiwi
slide-17
SLIDE 17

Kiwi RNN applied to Tier data

An AUC of

Not useful for discriminating kiwi in the Tier data set

1 2 3 0.00 0.25 0.50 0.75 1.00

RNN score Density

Kiwi FALSE TRUE

Score below Score above Correct No kiwi

  • kiwi
slide-18
SLIDE 18

What has gone wrong?

Multiple kiwi species in Tier data set Greater diversity of background sounds More possibility of mistakes in minute data

slide-19
SLIDE 19

Difficult to distinguish kiwi and weka

Weka Weka sounding like a kiwi Labelled in Freebird as kiwiweka duet Labelled in Freebird as weka

slide-20
SLIDE 20

Identifying morepork

slide-21
SLIDE 21

Morepork calls

Use the first minute of each Tier file and the rest of

‘interesting’ files

Count any morepork call type ruru quee etc as a

morepork

Extend data by changing the levels and blending known

morepork with a range of background noise

A total of labelled minutes with morepork A total of calling periods

slide-22
SLIDE 22

Finding morepork calls

slide-23
SLIDE 23

Morepork RNN applied to test data

An AUC of

5 10 15 20 0.00 0.25 0.50 0.75 1.00

RNN score Density

Morepork FALSE TRUE

Score below Score above Correct No morepork

  • Morepork
slide-24
SLIDE 24

Morepork RNN applied to Tier data

An AUC of

Not useful for discriminating morepork in the Tier data set

1 2 3 4 0.00 0.25 0.50 0.75 1.00

RNN score Density

Morepork FALSE TRUE

Score below Score above Correct No morepork

  • Morepork
slide-25
SLIDE 25

Morepork RNN

Classifier not as accurate on minute training holdout clips

as kiwi

Morepork are harder as the individual calls are shorter Perhaps there are difficulties with the diversity of calls and

wide variation in intensities

Performance degrades as the interval is extended to

minutes

slide-26
SLIDE 26

Summary

Recurrent Neural Networks not yet suitable for automating

Tier acoustic monitoring

To improve would require specialised training data calls

welllocated in time and with large numbers of cases

May need other modelling methods eg Random Forests

to go from continuous score of the RNN to a classification of the audio file

Positively the RNNs will be useful for finding infrequent kiwi

calls at sites similar to the Rimutakas