I am extracting prosody features from an audio file while using Opensmile using Windows version of Opensmile. It runs successful and an output csv is generated. But when I open csv, it shows some rows that are not readable. I used this command to extract prosody feature:
SMILEXtract -C \opensmile-3.0-win-x64\config\prosody\prosodyShs.conf -I audio_sample_01.wav -O prosody_sample1.csv
And the output of csv looks like this:
[
Even I tried to use the sample wave file given in Example audio folder given in opensmile directory and the output is same (not readable). Can someone help me in identifying where the problem is actually? and how can I fix it?
You need to enable the csvSink component in the configuration file to make it work. The file
config\prosody\prosodyShs.conf
that you are using does not have this component defined and always writes binary output.You can verify that it is the standart binary output in this way: omit the
-O
parameter from your command so it becomesSMILEXtract -C \opensmile-3.0-win-x64\config\prosody\prosodyShs.conf -I audio_sample_01.wav
and execute it. You will get aoutput.htk
file which is exactly the same as theprosody_sample1.csv
.How output csv? You can take a look at the example configuration in
opensmile-3.0-win-x64\config\demo\demo1_energy.conf
where acsvSink
component is defined.You can find more information in the official documentation: