B.1 General
26.1323GPPRelease 18Speech and video telephony terminal acoustic test specificationTS
In this annex, a reference algorithm for evaluation of the echo control characteristics is described in pseudo code. The output of an implementation of the test method with the stimuli from the file "echo_control_reference_files.zip" should equal the results presented in Table 3a and Table 3b. To run the verification, the additional file named "p501-downlink_WB.pcm" in the pseudo code shall be created from the concatenated full band speech samples FB_female_conditioning_seq_long.wav and FB_male_female_single-talk_seq.wav from ITU-T Recommendation P.501, and processed with the following set of commands based on ITU-T Recommendation G.191:
filter -down HQ3 far_end_signal_48k.pcm far_end_signal_16k.pcm
filter P341 far_end_signal_16k.pcm p501-downlink_WB.pcm
Table 3a: Characterization of segment 1.
Double talk |
Single talk |
|||
Category |
Activity |
Av. Level [dB] |
Activity |
Av. Level [dB] |
A1 |
60,8% |
-1,2 |
95,1% |
0,1 |
A2 |
39,2% |
-5,1 |
1,4% |
-4,8 |
B |
0,0% |
0 |
0,0% |
0 |
C |
0,0% |
0 |
0,0% |
0 |
D |
0,0% |
0 |
0,0% |
0 |
E |
0,0% |
0 |
0,3% |
9,4 |
F |
0,0% |
0 |
3,2% |
8,7 |
G |
0,0% |
0 |
0,0% |
0 |
Table 3b: Characterization of segment 2.
Double talk |
Single talk |
|||
Category |
Activity |
Av. Level [dB] |
Activity |
Av. Level [dB] |
A1 |
50.2% |
-1.1 |
93,8% |
0,2 |
A2 |
40.8% |
-7.3 |
0,3% |
-5.6 |
B |
1.2% |
-16,9 |
0,0% |
0 |
C |
7.1% |
-17,2 |
0,0% |
0 |
D |
0,0% |
0 |
0,0% |
0 |
E |
0,0% |
0 |
0,5% |
9,5 |
F |
0,7% |
4.0 |
5.5% |
6,2 |
G |
0,0% |
0 |
0,0% |
0 |
The pseudo-code reference algorithm produces a text file output, and the implementation of the test method may be tested with the test script on the data in the file "echo_control_reference_files.zip" for which the result shall equal
ms01-rec2; segm. 1; Processed signal;
active speech level [dBovl]; -45.8; RMS level [dBovl]; -51.5; speech activity; 0.269
ms01-rec2; segm. 1; Near end signal;
active speech level [dBovl]; -42.6; RMS level [dBovl]; -49.1; speech activity; 0.225
ms01-rec2; segm. 1; Downlink signal;
active speech level [dBovl]; -26.6; RMS level [dBovl]; -27.4; speech activity; 0.823
ms01-rec2; segm. 1; delay 0; DL delay 0;
DT activity 0.100; 0.608; 0.392; 0.000; 0.000; 0.000; 0.000; 0.000; 0.000;
ms01-rec2; segm. 1; delay 0; DL delay 0;
DT level diff; -1.2; -5.1; 0.0; 0.0; 0.0; 0.0; 0.0; 0.0;
ms01-rec2; segm. 1; delay 0; DL delay 0;
ST activity 0.664; 0.951; 0.014; 0.000; 0.000; 0.000; 0.003; 0.032; 0.000;
ms01-rec2; segm. 1; delay 0; DL delay 0;
ST level diff; 0.1; -4.8; 0.0; 0.0; 0.0; 9.4; 8.7; 0.0;
ms01-rec2; segm. 2; Processed signal;
active speech level [dBovl]; -42.0; RMS level [dBovl]; -44.4; speech activity; 0.581
ms01-rec2; segm. 2; Near end signal;
active speech level [dBovl]; -40.6; RMS level [dBovl]; -42.7; speech activity; 0.625
ms01-rec2; segm. 2; Downlink signal;
active speech level [dBovl]; -26.5; RMS level [dBovl]; -27.2; speech activity; 0.841
ms01-rec2; segm. 2; delay -1; DL delay 0;
DT activity 0.348; 0.502; 0.408; 0.012; 0.071; 0.000; 0.000; 0.007; 0.000;
ms01-rec2; segm. 2; delay -1; DL delay 0;
DT level diff; -1.1; -7.3; -16.9; -17.2; 0.0; 0.0; 4.0; 0.0;
ms01-rec2; segm. 2; delay -1; DL delay 0;
ST activity 0.362; 0.938; 0.003; 0.000; 0.000; 0.000; 0.005; 0.055; 0.000;
ms01-rec2; segm. 2; delay -1; DL delay 0;
ST level diff; 0.2; -5.6; 0.0; 0.0; 0.0; 9.5; 6.2; 0.0;