Note: The is part two of a series to put a Neural Network to the Wio Terminal Device. Starting from here.
The FSD50K dataset is a dataset from Free Sound. It contains around 50k sound clip from https://freesound.org/. It is labelled with label based on Audioset ontology, with some extra label. I only use the dev data; you should use both dev data and the…