This document specifies the system level aspects of the Discontinuous Transmission (DTX) function of the EVS codec.
The following documents contain provisions which, through reference in this text, constitute provisions of the present document.
References are either specific (identified by date of publication, edition number, version number, etc.) or non specific.
For a specific reference, subsequent revisions do not apply.
For a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same Release as the present document.
: "Vocabulary for 3GPP Specifications".
: "Codec for Enhanced Voice Services (EVS); Detailed Algorithmic Description".
: "Codec for Enhanced Voice Services (EVS); ANSI C code (fixed-point)".
: "Codec for Enhanced Voice Services (EVS); ANSI C code (floating-point)".
: "Codec for Enhanced Voice Services (EVS); Test Sequences".
: "Codec for Enhanced Voice Services (EVS); AMR-WB Backward Compatible Functions".
: "Codec for Enhanced Voice Services (EVS); Error Concealment of Lost Packets".
: "Codec for Enhanced Voice Services (EVS); Jitter Buffer Management".
: "Codec for Enhanced Voice Services (EVS); Comfort Noise Generation (CNG) Aspects".
: "Codec for Enhanced Voice Services (EVS); General Overview".
: "Codec for Enhanced Voice Services (EVS); Voice Activity Detection (VAD)".
: "Multimedia Telephony; Media handling and interaction".
: "Terminal acoustic characteristics for telephony; Requirements".
: "Codec for Enhanced Voice Services (EVS); Performance Characterization".
: "Codec for Enhanced Voice Services (EVS); ANSI C code; Alternative fixed-point using updated basic operators".
For the purposes of the present document, the abbreviations given in TR 21.905
and the following apply. An abbreviation defined in the present document takes precedence over the definition of the same abbreviation, if any, in TR 21.905
Algebraic Code-Excited Linear Prediction
Adaptive Multi Rate Wideband (codec)
Comfort Noise Generator
Enhanced Voice Services
Frame Erasure Concealment
Jitter Buffer Management
Most Significant Bit
Multimedia Telephony Service for IMS
Public Switched Telephone Network
Signal Activity Detection
Source Controlled - Variable Bit Rate
Silence Insertion Descriptor
Voice Activity Detection
Weighted Millions of Operations Per Second
The present document specifies the system level aspects of the Discontinuous Transmission (DTX) function of the EVS codec. Specific codec functions that implement the DTX and other EVS functionalities are described in TS 26.445
, TS 26.446
, TS 26.447
, TS 26.448
, TS 26.449
and TS 26.451
. The corresponding fixed-point reference C code is specified in TS 26.442
and TS 26.452
using two different sets of basic operators. The corresponding floating-point reference C code is specified in TS 26.443
. The EVS test sequences are given in TS 26.444
and the EVS codec is characterized in TR 26.952
Discontinuous transmission (DTX) is a mechanism for the EVS Codec to encode the input signal at a lower average rate by taking speech inactivity into account. The DTX scheme is useful for the following purposes:
to conserve UE power,
to improve network capacity.
DTX in the transmitting path (uplink) shall be in operation in UEs if commanded so by the network. The UE shall handle DTX in the receiving path (downlink) at any time, regardless of whether DTX in the transmitting path is commanded or not.
The default DTX mechanism described in the present document requires the following functions:
a Signal Activity Detector (SAD) on the transmit (TX) side;
evaluation of the background acoustic noise on the transmit (TX) side in order to transmit characteristic parameters to the receive (RX) side;
generation, on the receive (RX) side, of a similar noise called comfort noise during periods where the transmission is switched off.
The Signal Activity Detector (SAD) is defined in TS 26.451
and the comfort noise functions in TS 26.449
. Both are based partly on the EVS Codec and its internal variables defined in TS 26.445
In addition to these functions, if the parameters arriving at the RX side are detected to be seriously corrupted by errors, the speech or comfort noise shall be generated from substituted data in order to avoid seriously annoying effects for the listener. These functions are defined in TS 26.447
An overall description of the speech processing parts can be found in TS 26.441
. An overview of the DTX operation is shown in Figure 1