I have a .wav file and I want to read it in C++. I have done some research on the RIFF file header and wrote a code to load it.
#include <iostream>
#include <stdio.h>
#include <stdlib.h>
#include <string.h>
#include <math.h>
using namespace std;
#define BUFFER_LEN 4096
int main(int argc,char * argv[])
{
// Buffers etc..
char ChunkID[4], Format[4], Subchunk1ID[4],Subchunk2ID[4];
int ChunkSize,Subchunk1Size, SampleRate, ByteRate,Subchunk2Size;
short AudioFormat, NumChannels, BlockAlign, BitsPerSample;
// Read the wave file
FILE *fhandle=fopen(argv[1],"rb");
fread(ChunkID,1,4,fhandle);
fread(&ChunkSize,4,1,fhandle);
fread(Format,1,4,fhandle);
fread(Subchunk1ID,1,4,fhandle);
fread(&Subchunk1Size,4,1,fhandle);
fread(&AudioFormat,2,1,fhandle);
fread(&NumChannels,2,1,fhandle);
fread(&SampleRate,4,1,fhandle);
fread(&ByteRate,4,1,fhandle);
fread(&BlockAlign,2,1,fhandle);
fread(&BitsPerSample,2,1,fhandle);
fread(&Subchunk2ID,1,4,fhandle);
fread(&Subchunk2Size,4,1,fhandle);
fclose(fhandle);
// print RIFF info
printf("\%c",ChunkID[0]);
printf("\%c",ChunkID[1]);
printf("\%c",ChunkID[2]);
printf("\%c",ChunkID[3]);
cout << endl;
// print chunk size
printf("%d",ChunkSize);
cout << endl;
// print format
printf("\%c",Format[0]);
printf("\%c",Format[1]);
printf("\%c",Format[2]);
printf("\%c",Format[3]);
cout << endl;
// print sub chunk 1 ID
printf("\%c",Subchunk1ID[0]);
printf("\%c",Subchunk1ID[1]);
printf("\%c",Subchunk1ID[2]);
printf("\%c",Subchunk1ID[3]);
cout << endl;
// print sub chunk 1 size
printf("%d",Subchunk1Size);
cout << endl;
// print audio format
printf("%hd",AudioFormat);
cout << endl;
// print number of channels
printf("%hd",NumChannels);
cout << endl;
return 0;
}
However, it is very wired that I run this code on my input file OS.wav. It output the information:
RIFF
307201488
WAVE
Fake
2
0
28006
You can see the "fmt" is "Fake".
Then I use sox to convert this wav file by:
sox OS.wav test.wav
and run my code again. It has the information:
RIFF
307200050
WAVE
fmt
18
3
2
I did not change any thing. But the header information is so different. Could anyone tell me why this happen, please?
Thank you.
You are assuming the
fmt
chunk is the first sub-chunk within theRIFF/WAVE
chunk. That is not a requirement or a guarantee. Do some more research on the RIFF/WAV format.WAVE
sub-chunks can appear in any order, the only rule being that thefmt
chunk must appear before thedata
chunk, but other chunks can appear before, after, and in between them.The correct way to parse your file is to loop through the sub-chunks one at a time. Read a chunkID and chunkSize, then read the specified number of bytes (taking padding into account) and process them according to the chunkID as needed, and then repeat until EOF. This lets you process the chunks you are interested in and skip the ones you don't care about. Don't make any assumptions about their order (but do validate the one special case), and definitely don't assume the
fmt
chunk is the first chunk, because it might not be.Try something more like this: