6.3 High frequency band

26.1903GPPAdaptive Multi-Rate - Wideband (AMR-WB) speech codecSpeech codec speech processing functionsTranscoding functionsTS

For the higher frequency band (6.4 – 7.0 kHz), excitation is generated to model the highest frequencies. The high frequency content is generated by filling the upper part of the spectrum with a white noise properly scaled in the excitation domain, then converted to the speech domain by shaping it with a filter derived from the same LP synthesis filter used for synthesizing the down-sampled signal.

6.3.1 Generation of high-band excitation

The high-band excitation is obtained by first generating white noise uHB1(n). The power of the high-band excitation is set equal to the power of the lower band excitation u2(n) which means that

. ( 74)

Finally the high-band excitation is found by

, (75 )

where is a gain factor.

In the 23.85 kbit/s mode,is decoded from the received gain index.

In 6.60, 8.85, 12.65, 14.25, 15.85, 18.25, 19.85 and 23.05 kbit/s modes, gHB is estimated using voicing information bounded by [0.1,1.0]. First, tilt of synthesis etilt is found

(76 )

where is high-pass filtered lower band speech synthesis with cut-off frequency of 400 Hz. The is then found by

, ( 77 )

where gSP = 1 – etilt is gain for speech signal, gBG = 1.25gSP is gain for background noise signal, and wSP is a weighting function set to 1, when VAD is ON, and 0 when VAD is OFF. gHB is bounded between [0.1, 1.0]. In case of voiced segments where less energy is present at high frequencies, etilt approaches 1 resulting in a lower gain gHB. This reduces the energy of the generated noise in case of voiced segments.

6.3.2 LP filter for the high frequency band

6.3.2.1 6.60 kbit/s mode

The high-band LP synthesis filter AHB(z) is found by extrapolating the quantized ISF vector f into 20th order ISF vector fe. First, maximum of the autocorrelation Cmax(i) of ISF vector difference vector is obtained. Then new 16kHz ISF vector fe(i) is computed by

. (78 )

An approximation of the last element of new ISF vectoris updated based on lower frequency coefficients. New extrapolated ISF vector difference vector is

, (79 )

where cscale scales so that fe(19) will be equal to fe19. In order to insure stability, is bounded by

. (80)

Finally, the extrapolated ISF vector is obtained by

. (81 )

fe is converted to cosine domain to obtain with 16000 Hz sampling rate. The high-band LP synthesis filter AHB(z) is obtained by converting to LP filter as described in 5.2.4 with m=20.

6.3.2.2 8.85, 12.65, 14.25, 15.85, 18.25, 19.85, 23.05 or 23.85 kbit/s modes

The high-band LP synthesis filter AHB(z) is weighted low-band LP synthesis filter

, (82)

where Â(z) is the interpolated LP synthesis filter. has been computed analysing signal with the sampling rate of 12.8 kHz but it is now used for a 16 kHz signal. Effectively, this means that the frequency response FR16(f) of AHB(z) is obtained by

( 83)

where FR12.8(f) is the frequency response of A(z). This means that the band 5.1 – 5.6 kHz in 12.8 kHz domain will be mapped to 6.4 – 7.0 kHz in 16 kHz domain.

6.3.3 High band synthesis

uHB(n) is filtered through AHB(z). The output of this high-band synthesis sHB(n) is filtered through a band-pass FIR filter HHB(z) which has the passband from 6 to 7 kHz. Finally, sHB is added to synthesized speech to produce the synthesized output speech signal .