1. Terms and definitions
For the purposes of this document, the following terms and definitions apply.1.1. IAMF audio data frame
bit stream generated by an IAMF encoder containing audio data and metadata representing a block of linear PCM samples according to the IAMF specification.1.2. frame length
number of linear PCM audio samples per IAMF audio data frame. The IAMF specification allows for multiple codecs, sampling rates and frame sizes. This burst specification constrains the use case to IAMF streams only employing codecs at a sampling rate of 48 kHz and an analysis data frame (update rate) of 20 ms, which corresponds to a frame length of 960 samples.1.3. latency
delay time of an external audio decoder to decode an IAMF audio data-burst defined as the sum of two values - the receiving delay and the decoding delay time.1.4. Abbreviated terms
HBR : high bit rate2. Mapping of the audio bitstream on IEC 61937-1
2.1. General
The coding of the bitstream and data-burst is in accordance with [IEC-61937-1] and [IEC-61937-2], including field names such as “Pa”, “Pc”, “Pe”, “Pf”, and “R”.2.2. IAMF burst-info
Pc bits 0 to 7 shall indicate extended data-type according to [IEC-61937-1] clause 6.1.8.3. As all bits are used in the preamble word of Pc, the value of data-type in Pc bits 0 to 4 is set to be 0x1F and the value of data-type bits 5 to 6 are set to 0. The extended data-type burst preamble Pe is used and the value is set to 0x0003. The 32-bit burst-info contains information about the data which will be found in the data-burst. (see Table 1).
Bits 0 to 6 of the burst-info (Pc) and bits 0 to 15 of the burst-info (Pe) signal the data-type used for transmission. For IAMF, the signalled extended data-type is 0x0003.
The Pc bits 8 to 9 indicate the IAMF bitstream mode. The repetition periods of data bursts in IEC 60958 frames shall be determined from the data-type-dependent information specified in Table 1.
3. Format of IAMF data-burst
3.1. General
This clause specifies the IAMF data-burst. Specific properties such as reference points, repetition period, the method of filling stream gaps, and decoding latency are specified.The decoding latency (or delay) should be used by the transmitter to schedule data-bursts as necessary to establish synchronization between picture and decoded audio.
3.2. Pause data-burst
The pause data-bursts for IAMF are defined in Table 2

If regular audio data-bursts are not being transmitted due to, for example, a pause condition, it is recommended to use pause data-bursts to fill such stream gaps. The repetition period of the pause data-bursts should be selected according to Table 2. If other repetition periods are necessary to precisely fill the stream gap length, or to meet the requirement on audio data-burst spacing (see IEC 61937-1), pause data-bursts may have other lengths.
When a stream gap in an audio stream is filled by a sequence of pause data-bursts, the Pa of the first pause data-burst shall occur after exactly that number of IEC 60958 frames as indicated by the IAMF audio data-frame length in conjunction with the data-type-dependent information from Table 3. It is recommended that the sequence of pause data-bursts that fills the stream gap should continue from this point until the Pa of the first audio data-burst that follows the stream gap, or as close as possible to that point, considering the specific IEC 60958 frame length of the pause data-burst with respect to the IAMF audio data frame length. The repetition-period-length parameter contained in the pause data-burst is intended to be interpreted by the receiver as an indication of the number of decoded PCM samples that are missing (due to the resulting audio gap).
3.3. Audio data-burst
3.3.1. The IAMF data
An IAMF bit stream consists of a sequence of IAMF audio data frames. The IAMF data-burst is headed with a burst-preamble, followed by the burst-payload. The structure of the IAMF data-burst is shown in Figure 1. The extended data-type Pe bits 0 to 15 of an IAMF data-burst is 0x0003h, and the data-type Pc bits 8 to 9 is 0. When IAMF data is being transmitted, the transmission device shall ensure that both the extended data-type Pe bits 0 to 15 and data-type Pc bits 8 to 9 values are set correctly. Additionally, the receiving device shall utilize both the extended data-type Pe bits 0 to 15 and data-type Pc bits 8 to 9 values to ensure that the content of the data-burst is correctly identified as IAMF.
The IEC 60958 frame rate for data-type IAMF shall be equal to the audio sample rate. The repetition period of data-bursts in IEC 60958 frames shall be determined from the IAMF audio data frame length in accordance with Table 1 (Pc bits 10-11). The maximum data-burst payload size in bits is determined from 2 x 16 x Rp – (Paf + Bs), where
-
Rp is the repetition period of data-bursts in IEC 60958 frames
-
Paf is the size of preamble words Pa-Pf in bits
-
Bs is the burst spacing in bits
With a repetition period according to Table 1, the maximum data-burst payload size is then 2 x 16 x 960 - ( 6 x 16 + 4 x 16) = 30560 bits. It is allowed for the size of the IAMF audio data frames to exceed the maximum data-burst payload size. In this case the IAMF audio data frames will overlap with the next burst payload.
3.3.2. The IAMF HBR data
When the required transmission rate for IAMF audio data exceeds the maximum data rate supported by an IAMF audio data burst, an IAMF HBR data-burst is used instead. An IAMF HBR data-burst is headed with a burst-preamble, followed by the burst-payload. The structure of the IAMF HBR data-burst is shown in Figure 1. The extended data-type Pe bits 0 to 15 of an IAMF data-burst is 0x0001h, and the data-type Pc bits 8 to 9 is 1. When IAMF data is being transmitted, the transmission device shall ensure that both the extended data-type Pe bits 0 to 15 and data-type Pc bits 8 to 9 values are set correctly. Additionally, the receiving device shall utilize both the extended data-type Pe bits 0 to 15 and data-type Pc bits 8 to 9 values to ensure that the content of the data-burst is correctly identified as IAMF HBR.
The IEC 60958 frame rate for data-type IAMF HBR shall be equal to two, four, eight, or sixteen times the audio sample rate. Pc bits 10 to 11 indicate the relationship between the IEC 60958 frame rate and the audio sample rate. The repetition period of data-bursts in IEC 60958 frames shall be determined from the IAMF audio data frame length and the IEC 60958 frame rate in accordance with Table 1 (Pc bits 10-11). The maximum data-burst payload size in bits is determined from 2 x 16 x Rp – (Paf + Bs), where
-
Rp is the repetition period of data-bursts in IEC 60958 frames
-
Paf is the size of preamble words Pa-Pf in bits
-
Bs is the burst spacing in bits
With repetition periods according to Table 1, the maximum data-burst payload size can then be calculated and be found in Table 3.

3.3.3. Latency of IAMF decoding
The latency of an external audio decoder to decode IAMF is defined as the sum of the receiving time of one IAMF audio data frame in one or more data-bursts and the time used for decoding of one IAMF audio data frame.Each data-burst contains a minimum of 4 mandatory fields (Pa, Pb, Pc, and Pd) and an extended field of Pe and Pf of length 16 bits each. The repetition period of data-bursts in IEC 60958 frames is computed according to information from Table 1 and Table 3. The reception delay for one IAMF audio data frame is calculated as the time elapsed counting from the first bit of the data-burst until the last bit of the actual audio payload inside the data-burst received. Subsequent stuffing is not considered. After a complete IAMF audio data frame is received immediate decoding and subsequent rendering of the audio frame is not recommended as the size of the next audio frame and therefore time required for receiving it completely cannot be determined accurately.
The maximum size of an IAMF audio data frame is dependent on certain parameters of the IAMF bit stream, for example, the number of encoded channels. Details are specified in the IAMF specification.
In order to simplify the timing mechanism for receiving and decoding of content of IAMF audio data bursts, the receiving delay should be calculated as the time necessary to receive the complete data-burst including the stuffing. The maximum time available to decode (the decoding delay) should be selected to correspond to the length of one full IAMF audio data-burst at the IEC 60958 frame rate.
If the HBR mode is used it is recommended to set the relationship between the IEC 60958 frame rate and the audio sample rate (bits 10 and 11 of Pc, see Table 1) so that the resulting maximum data-burst payload size is greater than the maximum size of an IAMF audio data frame.