Hardware Model and Software Validation for AcoustiGLASS
(Autonomous Wearable Alert Device based
- n Sound Pattern Recognition)
Kei Kojima March 2016
Hardware Model and Software Validation for AcoustiGLASS (Autonomous - - PowerPoint PPT Presentation
Hardware Model and Software Validation for AcoustiGLASS (Autonomous Wearable Alert Device based on Sound Pattern Recognition) Kei Kojima March 2016 OBJECTIVE & DESIGN CRITERIA The objective is to build an autonomous hearing glass
Kei Kojima March 2016
microphones RGB leds
microphones RGB leds Raspberry Pi 2
After the spectrogram of the recorded audio has been constructed…!
Police Reference Spectrogram
Bluejay to Bluejay! 2-D cross- correlation
10 - 4 10 - 3
Left Right
0.045 0.100
Envelope of Audio Wave (RIGHT)
Envelope of Audio Wave (Left)
POLICE SIREN! Mechanical sounds makes the detection relatively.! Has three distinct frequencies between 1 and 3 kHz.
GUNSHOT! Has an short abrupt frequency ranging from 0 to 64 kHz BLUEJAY CALL! Has distinct and discrete frequency peaks between two distinct frequencies approximately 10 and 20 kHz
2-D cross-correlation result
34 201 286 121 106 167 60 165 470 329 244 334 299 109 271 359 405 570 585 479 256 186 229 550 615 730 409 206 116 309 595 760 575 349 221 137 263 504 434 339 222 51 66 119 256 181 256 25 72
1 * 8 + 7 * 3 + 13 * 4 + 8 * 1 + 14 * 5 + 20 * 9 + 15 * 6 + 16 * 7 + 22 * 2 = 585
(5+3-1)-by-(5+3-1) or 7-by-7 matrix
5 by 5 matrix 3 by 3 matrix
Values of M1! matrix Values of M2! matrix
Alignment of center ! element of M2
Dog Growl (input) vs. Dog Growl (reference)
Strong correlation:!
peak.!
center line.!
☑ Sound category identified!
Dog Growl (input) vs. Bird chirp (reference)
Weak correlation:!
peaks.!
center.!
☐ Sound category rejected!!
max peak/ second peak! 2.582 peak location = 105.1
Averaged Police to Police cross-correlations(abs value)
its location is 67 on the x axis. This is not in between the thresholds of 62 - 66.!
highest peak was 2.0605. This is not in between the thresholds of 2.4 - 2.6.!
is not between the thresholds of >102 and <110.
peak location = 101
Averaged Police to Dog cross-correlations(abs value)
max peak/ second! peak = ! 2.0605
location is 64 on the x axis(64 is the exact middle in a graph that contains 127 columns) and it is between 62 - 66. !
highest peak was 2.582, which is between 2.4 - 2.6.!
which is between >102 and <110.
REJECTION
20 13 7 13 7 20 6 14
REJECTION
30 25 5 28 2 29 1 22 8
P: Police D: Dog S: Smoke alarm B: Bird chirping G: Gunshot P D B S G P D B S G P D B S G Simulation Test 1:!
Simulation Test 2 ! (improved peak detection scheme/thresholds):!
Applications!
(in addition to vision sensors).!
Bag of Features! The bag of features technique in which different features are taken from each reference spectrograms and the recorded spectrograms.!
Machine Learning! A endeavor into Machine Learning which trains the computer pr micro-controller to “learn” information directly from data without assuming a predetermined equation as a model, can be worthwhile in the longer outlook when detection with 2-D cross- correlation between the input signal and hundreds of reference spectrograms may be come computationally heavy.!
Add-on Systems! The development of an algorithm that is compatible with the iPhone or Google glass! would be suitable. The computation rates would be much better and the visualizing section (Google Glass)would be much better. Although the process of acquiring smart glasses would be costly, the insurances and state-owned funds would minimize the cost for the user’s.