6 Operation Points
26.1173GPP5G Media Streaming (5GMS)Release 17Speech and audio profilesTS
6.1 Introduction
The speech and audio Operation Points defined in this clause are primarily introduced in order to be used as content format in the context of 5G Media Streaming, but not restricted to this use case.
An operation point is a combination of rendering formats and media decoding capabilities.
For each Operation Point, Bitstream and Receiver requirements are detailed in the remainder of clause 6.
Table 6.1 provides an overview of the Operation Points defined in the present document.
Table 6.1: Speech and Audio Operation Points
Operation Point name |
Format Properties |
Decoding and Encoding Capabilities |
Reference |
AMR speech |
Sampling frequency: 8 kHz |
AMR |
6.2.2 |
AMR-WB speech |
Sampling frequency: 16 kHz |
AMR-WB |
6.2.3 |
EVS mono |
Sampling frequency: 8, 16, 32, 48 kHz |
EVS |
6.2.4 |
eAAC+ stereo |
Sampling frequency: 32, 44.1, 48 kHz |
eAAC+ |
6.3.2 |
AMR-WB+ |
Sampling frequency: 8, 16, 32, 48 kHz |
AMR-WB+ |
6.3.3 |
6.2 Speech Operation Points
6.2.1 Introduction
This clause defines speech operation points. For each operation point, the requirements for the bitstream as well as for the receiver are defined.
6.2.2 AMR
6.2.2.1 Bitstream Encoding Requirements
The following requirements apply to the AMR Operation Point.
– The sampling frequency shall be 8 kHz.
– The bitstream shall be encoded according to either 3GPP TS 26.073 [5] or 3GPP TS 26.104 [6].
Note that the bitstream produced by the AMR encoder consists of 20ms encoded speech frames.
6.2.2.2 Receiver Requirements
Receivers conforming to the AMR Operation Point shall support the AMR speech media decoding capability according to clause 5.2 and shall support playback of the decoded signal.
6.2.2.3 Sender Requirements
Senders conforming to the AMR Operation Point shall support the AMR speech media encoding capability according to clause 5.3 in real-time for any speech source format with sampling frequency 8kHz.
6.2.3 AMR-WB
6.2.3.1 Bitstream Requirements
The following requirements apply to the AMR-WB Operation Point.
– The sampling frequency shall be 16 kHz.
– The bitstream shall be encoded by one of the following methods:
– according to 3GPP TS 26.173 [10]
– according to 3GPP TS 26.204 [11];
– the AMR-WB IO mode according to TS 26.442 [14] and TS 26.443 [15],
– the AMR-WB IO mode according to TS 26.452 [34].
Note that the bitstream produced by the AMR-WB encoder consists of 20 ms encoded speech frames.
6.2.3.2 Receiver Requirements
Receivers conforming to the AMR-WB Operation Point shall support the AMR-WB speech media decoding capability according to clause 5.2 and shall support playback of the decoded signal.
6.2.3.3 Sender Requirements
Senders conforming to the AMR-WB Operation Point shall support the AMR-WB speech media encoding capability according to clause 5.3 in real-time for any speech source format with sampling frequency 16kHz.
6.2.4 EVS
6.2.4.1 Bitstream Encoding Requirements
The following requirements apply to the EVS Operation Point:
– The sampling frequency shall be one of the following: 8, 16, 32, 48 kHz.
– The bitstream shall be encoded according to one of the following methods
– TS 26.442 [14] and TS 26.443 [15] encoding functions; or
– TS 26.452 [34] encoding functions.
Note that the bitstream produced by the EVS encoder consists of 20ms encoded speech frames.
6.2.4.2 Receiver Requirements
Receivers conforming to the EVS Operation Point shall support the EVS speech media decoding capability according to clause 5.2 and shall support playback of the decoded signal.
6.2.4.3 Sender Requirements
Senders conforming to the EVS Operation Point shall support the EVS speech media encoding capability according to clause 5.3 in real-time for any speech source format with sampling frequency 8, 16, 32, 48 kHz.
6.3 Audio Operation Points
6.3.1 Introduction
This clause defines audio operation points. For each operation point, the requirements for the bitstream as well as for the receiver are defined.
6.3.2 eAAC+ stereo
6.3.2.1 Bitstream Encoding Requirements
The following requirements apply to the eAAC+ stereo Operation Point.
- The sampling frequency shall be either 32 kHz, 44.1 kHz or 48 kHz.
- The bitstream shall be encoded according to 3GPP TS 26.401 [19], clause 7, as well as 3GPP TS 26.403 [21], 3GPP TS 26.404 [22] and 3GPP TS 26.405 [23].
NOTE: The specified eAAC+ encoder consists of AAC-LC with additional tools that can be enabled (SBR, PS and more), see [19].
6.3.2.2 Receiver Requirements
Receivers conforming to the eAAC+ stereo Operation Point shall support the eAAC+ media decoding capability according to clause 5.3 and shall support playback of the decoded signal.
NOTE: The eAAC+ decoder supports decoding of streams encoded with AAC-LC or aacPlus, see [19].
6.3.2.3 Sender Requirements
Senders conforming to the eAAC+ stereo Operation Point shall support the eAAC+ stereo audio media encoding capability according to clause 5.3 in real-time for any stereo audio source format with sampling frequency 32kHz, 44.1kHz, 48kHz.
6.3.3 AMR-WB+
6.3.3.1 Bitstream Encoding Requirements
The following requirements apply to the AMR-WB+ Operation Point.
– The sampling frequency shall be either 8, 16, 32 or 48 kHz.
– The bitstream shall be encoded by one of the following methods
– according to 3GPP TS 26.273 [28]; or
– according to 3GPP TS 26.304 [27].
6.3.3.2 Receiver Requirements
Receivers conforming to the AMR-WB+ Operation Point shall support the AMR-WB+ media decoding capability according to clause 5.3 and shall support playback of the decoded signal.
6.3.3.3 Sender Requirements
Senders conforming to the AMR-WB+ Operation Point shall support the AMR-WB+ audio media encoding capability according to clause 5.3 in real-time for any stereo audio source format with sampling frequency 8, 16, 32 or 48 kHz.