How to apply data-augmentation on acoustic datasets?

149 Views Asked by 21kc At 17 August 2025 at 14:58

I have a small acoustic dataset of human sounds which I would like to augment and later pass to a binary classifier.

I am familiar with data augmentation for images, but how is it done for acoustic datasets?

I've found 2 related answers regarding autoencoders and SpecAugment with Pytorch & TorchAudio but I would like to hear your thoughts about the audio-specific "best method".

Original Q&A

There are 1 best solutions below

MMis On 30 July 2020 at 09:34

It really depends on what are you trying to achieve, what your classifier is designed for and how it works.

Depending on the above, you can for example cut the audio differently (if you are feeding the classifier with cut audio segments, and that makes sense in your particular case). You can also augment it with some background noise (artificial like white noise, or recorded one) with different signal to noise ratio - this should additionally make the classifier more robust against noise.

How to apply data-augmentation on acoustic datasets?

There are 1 best solutions below

Related Questions in PYTHON

Related Questions in DATA-AUGMENTATION

Related Questions in ACOUSTICS

Trending Questions

Popular # Hahtags

Popular Questions