THE AUD FILE FORMAT Revision 3 by Vladan Bato (bat22@geocities.com) In this document I'll try to describe the AUD file format used in Command & Conquer and Redalert. Command & Conquer is a trademark of Westwood Studios, Inc. Command & Conquer is Copyright (C)1995 Westwood Studios, Inc. Command & Conquer: Red Alert is a trademark of Westwood Studios, Inc. Command & Conquer: Red Alert is Copyright (C)1995,1996 Westwood Studios, Inc. The information provided here is for anyone who would like to make an AUD player program or AUD to WAV or WAV to AUD converters. Most information about AUD files and IMA-ADPCM compression has been provided by Douglas McFadyen. I won't explain here the format of the WAV files. You'll have to find this info yourself. I'll just tell you how to obtain 16-bit PCM data and how to encode it. I will use Pascal-like notation throughout this document. =============================== 0. IMPRTANT NOTE - WHAT'S NEW =============================== This revision contains an important difference in the IMA-ADPCM compression routine. Instead of computing the diffrence between the current and previous sample, it computes the difference between the current sample and the value that the decoding routine will predict for the previous sample. This is the way the algorithm is implemented in C&C. If you implement it the way it was in previous revisions of this document, the sound will be the same but there will be a "pop" sound at the end. ============== 1. AUD FILES ============== The AUD files have the following header : Header : record SamplesPerSec : word; {Frequency} Size : longint; {Size of file (without header)} OutSize : longint; {Size of ouput data} Flags : byte; {bit 0=stereo, bit 1=16bit} Typ : byte; {1=WW compressed, 99=IMA ADPCM} end; There are two types of compression. The first is the IMA-ADPCM compression used for 16-bit sound. It's used in most AUD files. The other one is a Westwood's proprietary compression for 8-bit sound and is used only for death screams. I won't describe it in this document because I don't know how it works. The rest of the AUD files is divided in chunks. These are usually 512 bytes long, except for the last one. Each chunk has the following header : ChunkHd : record Size : word; {Size of compressed data} OutSize : word; {Size of ouput data} ID : longint; {Always $0000DEAF} end; The IMA-ADPCM compression compresses 16-bit samples to 4 bits. This means that OutSize will be apporximately 4*Size. The IMA-ADPCM compression and decompression are described in the following sections. Note that the current sample value and index into the Step Table should be initialized to 0 at the start and are mantained across the chunks (see below). ========================== 2. IMA-ADPCM COMPRESSION ========================== I won't describe the theory behind the IMA-ADPCM compression. I will just give some pseudo code to compress and decompress data. The compression algorithm takes a stream of signed 16-bit samples in input and produces a stream of 4-bit codes in output. The 4-bit codes are stored in pairs (two codes in one byte). The first one is stored in the lower four bits. Two varaibles must be mantained while compressing : the previous sample value and the current index into the step table. You can find the Step Table in Appendix B. The Index adjustment table is in Appendix A. Here's the pseudo-code that will do the compression : Index:=0; Prev_Sample:=0; while there_is_more_data do begin Cur_Sample:=Get_Next_Sample; Delta:=Cur_Sample-Prev_Sample; if Delta<0 then begin Delta:=-Delta; Sb:=1; end else Sb:=0; {Sb is bit 4 of the output Code (sign bit)} Code := 4*Delta div Step_Table[Index]; if Code>7 then Code:=7; {These are the 3 low-order bits of output code} Index:=Index+Index_Adjust[Code]; if Index<0 then Index:=0; if Index>88 the Index:=88; Predicted_Delta:=(Step_Table[Index]*Code) div 4 + Step_Table[Index] div 8; {This is the Delta that decoding routine will compute} Prev_Sample:=Prev_Sample+Predicted_Delta; if Prev_Sample>32767 then Prev_Sample:=32767 else if Prev_Sample<-32768 then Prev_Sample:=-32768; {Prev_Sample is the sample value that the decoding routine will compute} Output_Code(Code+Sb*8); end; Note that this code is usually implemented in more efficient manner (No need to divide). The Get_Next_Sample function should return the next sample from the input buffer. The Output_Code function should store the 4-bit code to the output buffer. One byte contains two 4-bit codes, and this function should take care of this. ============================ 3. IMA-ADPCM DECOMPRESSION ============================ It is the exact opposite of the above. It receives 4-bit codes in input and produce 16-bit samples in output. Again you have to mantain an Index into the Step Table an the current sample value. The tables used are the same as for compression. Here's the code : Index:=0; Cur_Sample:=0; while there_is_more_data do begin Code:=Get_Next_Code; if (Code and $8) <> 0 then Sb:=1 else Sb:=0; Code:=Code and $7; {Separate the sign bit from the rest} Delta:=(Step_Table[Index]*Code) div 4 + Step_Table[Index] div 8; {The last one is to minimize errors} if Sb=1 then Delta:=-Delta; Cur_Sample:=Cur_Sample+Delta; if Cur_Sample>32767 then Cur_Sample:=32767 else if Cur_Sample<-32768 then Cur_Sample:=-32768; Output_Sample(Cur_Sample); Index:=Index+Index_Adjust[Code]; if Index<0 then Index:=0; if Index>88 the Index:=88; end; Again, this can be done more efficiently (no need for multiplication). The Get_Next_Code function should return the next 4-bit code. It must extract it from the input buffer (note that two 4-bit codes are stored in the same byte, the first one in the lower bits). The Output_Sample function should write the signed 16-bit sample to the output buffer. ========================================= Appendix A : THE INDEX ADJUSTMENT TABLE ========================================= Index_Adjust : array [0..7] of integer = (-1,-1,-1,-1,2,4,6,8); ============================= Appendix B : THE STEP TABLE ============================= Steps_Table : array [0..88] of integer =( 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 19, 21, 23, 25, 28, 31, 34, 37, 41, 45, 50, 55, 60, 66, 73, 80, 88, 97, 107, 118, 130, 143, 157, 173, 190, 209, 230, 253, 279, 307, 337, 371, 408, 449, 494, 544, 598, 658, 724, 796, 876, 963, 1060, 1166, 1282, 1411, 1552, 1707, 1878, 2066, 2272, 2499, 2749, 3024, 3327, 3660, 4026, 4428, 4871, 5358, 5894, 6484, 7132, 7845, 8630, 9493, 10442, 11487, 12635, 13899, 15289, 16818, 18500, 20350, 22385, 24623, 27086, 29794, 32767 ); --- Vladan Bato (bat22@geocities.com) http://www.geocities.com/SiliconValley/8682