Datasets by Edge Impulse / Audio Classification - Keyword Spotting Public

Audio Classification - Keyword Spotting

This is the finished Edge Impulse project for the tutorial 'Responding to your voice'. From here you acquire new training data, design impulses and train models.

Audio

About this project

Have you ever wanted to make your own "Ok, Google" or "Alexa" keyword spotting model? The helloworld class has been collected by Edge Impulse teams, the added noise samples come from the Microsoft Scalable Noisy Speech Dataset and the unknown samples are based on a subset of data in the Google Speech Commands Dataset.

This dataset can be used to build an Edge AI project detecting the "Hello World" keyword phrase.

You can also follow our tutorial to guide you through building your keyword spotting model, from data collection to deployment on embedded devices.

Compatible Blocks

Not sure what to choose? Try out this dataset with the EON Tuner.

unknown.2aca1e72_nohash_4.wav.1ncrnqnv
helloworld.aurelien.wav.1ncrrdp9.s82
unknown.2fee065a_nohash_2.wav.1ncrnqnm
unknown.5a98d407_nohash_0.wav.1ncrniil
noise.orig_train.NeighborSpeaking_10.wav.20000.wav.1ncroctu
helloworld.mauricio6.wav.1ncrqrbc.s4
unknown.7cf14c54_nohash_4.wav.1ncrnjgg
noise.orig_train.NeighborSpeaking_8.wav.9000.wav.1ncroco3
This project has no trained model yet.

Dataset summary

Data collected
34m 22s
Sensor
audio @ 16KHz
Labels
helloworld, noise, unknown

Project info

Project ID 499022
Project version 4
License BSD 3-Clause Clear
No. of views 28,235
No. of clones 441