Tech-invite3GPPspaceIETFspace
21222324252627282931323334353637384‑5x
Top   in Index   Prev   Next

TS 22.076
Noise Suppression for the AMR codec –
Stage 1

V18.0.0 (PDF)2024/03  … p.
V17.0.0  2022/03  12 p.
V16.0.0  2020/06  12 p.
V15.0.0  2018/06  12 p.
V14.0.0  2017/03  12 p.
V13.0.0  2015/12  12 p.
V12.0.0  2014/09  12 p.
V11.0.0  2012/09  12 p.
V10.0.0  2011/04  12 p.
V9.0.0  2009/12  12 p.
V8.0.0  2008/12  12 p.
V7.0.0  2007/06  12 p.
V6.0.0  2005/01  12 p.
V5.0.0  2002/06  12 p.
V4.0.1  2001/10  12 p.
GSM Rel-99 v8.0.1  2001/08  11 p.
Rapporteur:
Mr. Usai, Paolino
ETSI

Content for  TS 22.076  Word version:  18.0.0

Here   Top

1  Scopep. 5

The present document specifies the stage 1 description for the Noise Suppression feature for the AMR codec which enhances the input speech signal corrupted by acoustic noise. In analogy with ITU-T Recommendations I.130 [1], Stage 1 is an overall service description, from the service subscriber's and user's standpoints, that views the network as a single entity which provides services to the user.

2  Referencesp. 5

The following documents contain provisions which, through reference in this text, constitute provisions of the present document.
  • References are either specific (identified by date of publication, edition number, version number, etc.) or non specific.
  • For a specific reference, subsequent revisions do not apply.
  • For a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same Release as the present document.
[1]
ITU-T Recommendations I.130 (1988): "Method for the characterization of telecommunication services supported by an ISDN and network capabilities of an ISDN".
[2]
GSM 01.04 (ETR 350): "Digital cellular telecommunications system (Phase 2+); Abbreviations and acronyms".
[3]
GSM 03.50: "Digital cellular telecommunications system (Phase 2+); Transmission planning aspects of the speech service in the GSM Public Land Mobile Network (PLMN) system".
Up

3  Definitions and abbreviationsp. 5

GSM 01.04 [2] (ETR 350) provides a list of abbreviations and acronyms used in GSM specifications.

4  Descriptionp. 5

Noise Suppression for the AMR codec is an optional feature designed to enhance speech quality in a range of environments where there is significant (acoustic) background noise. The noise suppression function is a preprocessing module that is used to improve the signal to noise ratio of a speech signal prior to voice coding. In so doing it may use functions and/or data from the AMR speech encoding function. It shall be possible to implement AMR Noise Suppression in the mobile station (operating on the uplink speech signal). The possibility to implement AMR Noise Suppression in the network (operating on the downlink speech signal) is for further study. The noise suppression specification shall be comprised of bit exact fixed point C code. Test vectors shall be defined to verify operation.
The AMR Speech decoder C-code should not be altered by the Noise Suppression.
It shall be possible for the network to disable the operation of the example noise suppression algorithm defined by this feature, whether that operation is operational in the network, the mobile station, or both locations.
Up

4.1  Applicability of Noise Suppression to Basic Services.p. 5

This feature shall be applicable (as an option) to all speech calls where the narrowband AMR codec is utilised. Operation of noise suppression for wideband AMR is for further study.

4.2  Support in Mobile Stations (MS)p. 6

Support of the Noise Suppression feature shall require modifications to future mobile stations. Provision of the feature in AMR-capable mobile stations is a manufacturer dependent option.
Use of the feature in the network during a call should not place any requirements on its use within the MS. Similarly, use of the feature by the MS during a call shall not place any requirements on its use in the network.
The network shall be able to enable or disable this example optional noise suppression function both at call set-up and in call [Signalling between network and mobile to allow this control is under study in SMG2 WPA].
Up

4.3  Support in the Networkp. 6

Provision of the feature in the network should be an option.
Use of the feature in the network during a call should not place any requirements on its use within the MS. Similarly, use of the feature by the MS during a call should not place any requirements on its use in the network.
The network should be able to enable or disable this example optional noise suppression function both at call set-up and in call.

4.4  Parameters to be indicated and negotiatedp. 6

[TBD]

4.5  Provision of Servicep. 6

4.5.1  Location Independencep. 6

The Noise suppression feature shall be location independent.

4.5.2  Provision of service within and between networksp. 6

Provision of the feature is the same whether or not the call is wholly contained within a network or between networks.

4.5.3  Subscription and Billing Informationp. 6

This feature shall not be provisioned on a per-subscriber basis and no record of the application of Noise Suppression is necessary for billing purposes.

4.6  Quality of Service (QoS)p. 6

4.6.1  Impact on Speech Qualityp. 6

The following performance requirements are stated under the assumption that the noise suppressor is tested as an integral part of the AMR speech codec with the speech codec operating at the following rates [TBD]. The performance requirements must be met for all these stated speech codec rates.

4.6.1.1  Initial Convergence Timep. 6

The initial convergence time shall be a maximum of T seconds with T equal to 2s. The definition of this time interval shall be understood strictly in accordance with its means of use in subjective listening experiments. Its use shall be defined by a process whereby the first T seconds of each sample processed through the AMR speech codec with and without noise suppression active, is deleted before presentation to listeners. It is assumed that this process does not reduce intelligibility, or introduce clipping or similar effects into the resultant speech plus noise material.
To test the subjective effect of initial convergence, there will be a subset of subjective testing defined where this initial period of T seconds is not removed from the processed samples. These tests should be representative of the full range of noise conditions.
Up

4.6.1.2  No Degradation in Clean Speechp. 7

The noise suppression function must not have a statistically significant distorting effect on clean speech, in comparison with the performance of the AMR codec without noise suppression applied. This requirement also applies to the case where tandeming of the standardised example noise suppression function occurs for mobile to mobile calls, in which case the reference condition is the tandem condition without noise suppression activated.
This requirement also applies when VAD/DTX is active.

4.6.1.3  No Artefacts in Residual Noisep. 7

The noise suppression function must not introduce any subjectively objectionable artefacts in the residual noise. This requirement also applies to the case where tandeming of the standardised example noise suppression function occurs for mobile to mobile calls, in which case the reference condition is the tandem condition without noise suppression activated.
This requirement also applies when VAD/DTX is active.

4.6.1.4  No Speech Clipping and no Reduction in Intelligibilityp. 7

The noise suppression function should introduce no subjectively objectionable degradation such as clipping or distortion in the speech, and no reduction in intelligibility. This requirement also applies to the case where tandeming of the standardised example noise suppression function occurs for mobile to mobile calls, in which case the reference condition is the tandem condition without noise suppression activated.
This requirement also applies when VAD/DTX is active.

4.6.1.5  Quality Impact compared to AMRp. 7

The AMR speech codec with noise suppression activated must produce an output in noisy speech which is preferred amongst test listeners with statistical significance, compared to the case where noise suppression is not used. This requirement also applies to the case where tandeming of the standardised example noise suppression function occurs for mobile to mobile calls, in which case the reference condition is the tandem condition without noise suppression activated.
This requirement also applies when VAD/DTX is active.
Up

4.6.2  Impact on Speech Path Delayp. 7

The one way algorithmic delay due to the activation of AMR noise suppression shall be no more than 7 ms in excess of the delay inserted by the AMR speech codec.
In handsfree case, this delay is part of the 39ms delay specified in GSM 03.50 [3].

4.7  Impact on Complexityp. 7

Table 4.1 defines complexity limits for AMR noise suppression.
Quantity Complexity Limit (Upper Bound)
Number of weighted operations per second5 WMOPS
Scratch pad RAMRe-use AMR speech encoder scratch pad RAM (or in the case of implementation which does not reside in the same device as the speech encoder, the available scratch pad RAM should be the same as that defined for the AMR speech encoder)
Static RAM1,5 kwords
Data ROM1 kword
Program ROM2000 basic ETSI operations
Up

4.8  Impact on Channel Activityp. 8

The AMR speech codec with noise suppression activated should not significantly increase channel activity when used in conjunction with DTX.
Channel activity increase will be measured thanks to the Voice Activity factor (VAF), defined as follows.
Let x be the VAF measured by the AMR VAD as an averaged value on all clean speech signals.
Let y be the VAF measured by the AMR VAD without AMR NS active as an averaged value on all clean speech + noise signals (where the applicable clean speech signal is the speech signal used in the measure of x).
Let w be the VAF measured by the AMR VAD with AMR NS active as an averaged value on all clean speech +noise signals (where the applicable clean speech signal is the speech signal used in the measure of x). w is required to be less than the maximum of y and x. Any case where w is greater than y should be further investigated.
For real word signals, w is required not to be significantly greater than y. Any case where w is greater than y should be further investigated.
These requirements shall apply to all standardized AMR VADs. (w,x,y) are determined using all VADs, and the requirements are checked relatively to each AMR VAD independently.
Up

5  Interaction with supplementary servicesp. 8

5.1  Generalp. 8

This clause defines the interactions between GSM supplementary services and the Noise Suppression Feature.
The application of Noise Suppression shall not interfere with the provision or invocation of any supplementary services.

5.2  Explicit Call Transfer (ECT)p. 8

No adverse interaction. If the new party is a mobile station with support for the Noise Suppression feature, the noise suppression feature shall be invoked.

5.3  Call wait/Call hold.p. 8

No interaction.

5.4  Multipartyp. 8

No interaction.

5.5  Service Announcementsp. 9

No interaction.

6  Interaction with Alternate and Followed by servicesp. 9

There shall be no impact on data transmission due the Noise Suppression Feature.

7  Interaction with other speech servicesp. 9

There is no requirement for Noise Suppression in ASCI services.

8  Interaction with DTMF and other signalling tonesp. 9

DTMF and other signalling tones transmission performance during the application of Noise Suppression shall be no worse than the case where Noise Suppression is turned off.

9  Interaction with Lawful Interceptp. 9

In the case where lawful intercept is required in a call where Noise Suppression is activated, the Noise Suppression shall not cause any degradation in the speech quality received by the A and B parties.

10  Interaction with TFOp. 9

No interaction.

$  Change Request Historyp. 10


Up   Top