Tech-invite3GPPspaceIETFspace
21222324252627282931323334353637384‑5x

Content for  TR 26.935  Word version:  17.0.0

Top   Top   None   None   Next
1…   5…   A…

 

1  Scopep. 8

The present document provides information on the performances of default speech codecs in packet switched conversational multimedia applications. The codecs under test are AMR-NB (Adaptive Multi-Rate Narrowband) and AMR-WB (Adaptive Multi-Rate Wideband). In addition, several ITU-T codecs (G.723.1, G.729, G.722 and G.711) are included in the testing. Experimental test results from the speech quality testing are reported to illustrate the behaviour of these codecs.
The results give information of the performance of PS conversational multimedia applications under various operating and transmission conditions (e.g., considering radio transmission errors, IP packet losses, end-to-end delays, and several types of background noise). The performance results can be used e.g. as guidance for network planning and to appropriately adjust the radio network parameters.
Up

2  Referencesp. 8

The following documents contain provisions which, through reference in this text, constitute provisions of the present document.
  • References are either specific (identified by date of publication, edition number, version number, etc.) or non-specific.
  • For a specific reference, subsequent revisions do not apply.
  • For a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same Release as the present document
[1]
ITU-T Recommendation P.800: "Methods for Subjective Determination of Transmission Quality".
[2]
ITU-T Recommendation P.831: "Subjective performance evaluation of network echo cancellers".
[3]
ITU-T Recommendation G.711: "Pulse code modulation (PCM) of voice frequencies".
[4]
ITU-T Recommendation G.729: "Coding of speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear-prediction (CS-ACELP)".
[5]
ITU-T Recommendation G.723.1: "Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s".
[6]
ITU-T Recommendation G.722: "7 kHz audio-coding within 64 kbit/s".
[7]
RFC 1889:  "RTP: A Transport Protocol for Real-Time Applications".
[8]
RFC 3267:  "Real-Time Transport Protocol (RTP) Payload Format and File Storage Format for the Adaptive Multi-Rate (AMR) Adaptive Multi-Rate Wideband (AMR-WB) Audio Codecs".
[9]
TS 34.121: "Terminal Conformance Specification, Radio Transmission and Reception (FDD)" (downlink).
[10]
TS 25.141: " Base Station (BS) conformance testing (FDD)" (uplink).
[11]
TR 25.853: "Delay budget within the access stratum".
[12]
TS 26.235: "Packet switched conversational multimedia applications; Default codecs".
[13]
TS 26.071: "AMR speech Codec; General description".
[14]
TS 26.171: "AMR speech codec, wideband; General description".
[15]
TS 25.322: "Radio Link Control (RLC) protocol specification".
[16]
RFC 3095:  "RObust Header Compression (ROHC): Framework and four profiles: RTP, UDP, ESP, and uncompressed".
[17]
TS 34.108: "Common test environments for User Equipment (UE) conformance testing".
[18]
ETSI TR 101 112: "Universal Mobile Telecommunications System (UMTS); Selection procedures for the choice of radio transmission technologies of the UMTS" (UMTS 30.03 v3.1.0).
[19]
TS 26.114: : "IP Multimedia Subsystem (IMS); Multimedia Telephony; Media handling and interaction"
[20]
ITU-T Recommendation P.805 (P.CONV): "Subjective evaluation of conversational quality"
Up

3  Abbreviationsp. 9

3.1  Abbreviationsp. 9

For the purposes of the present document, the following abbreviations apply:
AMR-NB (or AMR)
Adaptive Multi-Rate Narrowband Speech Codec
AMR-WB
Adaptive Multi-Rate Wideband Speech Codec
ANOVA
Analysis of Variance
ASY
ASYmmetric conditions
BLER
Block Error Rate
CDF
Cumulative Distribution Function
CMR
Codec Mode Request
COND
Test CONDitions
CN
Core Network
CQ
Conversational Quality
CRC
Cyclic Redundancy Check
DCH
Dedicated Channel
DL
Downlink
DMOS
Degradation Mean Opinion Score
DPCH
Dedicated Physical Channel
DTCH
Dedicated Traffic Channel
Eb/No
Ratio of energy per modulating bit to the noise spectral density
EID
Error Insertion Device
FER
Frame Erasure Rate, Frame Error Rate
GAL
Global Analysis Laboratory
GQ
Global Quality (of the conversation)
HM
High Mobility
HT
High Traffic
HSDPA/EUL
High Speed Downlink Packet Access/Enhanced UpLink
IA
InterAction (with your partner)
IP
Internet Protocol
ITU-T
International Telecommunication Union - Telecommunications Standardization Sector
JBM
Jitter Buffer Management
LAB
Listening LABoratory
LM
Low Mobility
LT
LowTraffic
MAC
Medium access control
MANOVA
Multivariate Analysis of Variance
Log-MAP
Logarithmic Maximum A Posteriori
MOS
Mean Opinion Score
NB
Narrowband
PC
PerCeption of impairments (also: Personal Computer)
PDCP
Packet Data Convergence Protocol
PDU
Protocol Data Unit
Pa
Sound Pressure Level (in Pascal)
PL
Packet Loss
plc
Packet Loss Concealment
RC
Radio Conditions
PS
Packet Switched
RB
Radio Bearer
RAB
Radio Access Bearer
RCV
Receive
RLC
Radio Link Control
ROHC
Robust Header Compression
RRM
Radio Resource Management
RTCP
Real-Time Control Protocol
RTP
Real-time Transport Protocol
SYM
SYMmetric conditions
TB size
Transport Block size
TF
Transport Format
ToC
Table of Content
TrCH
Transmission Channel
TTI
Transmission Time Interval
UDP
User Datagram Protocol
UE
User Equipment
UL
Uplink
UM
Unacknowledged Mode
UMD
Unacknowledged Mode Data
US
difficulty UnderStanding (your partner)
VOIP
Voice over IP
VQ
Voice Quality (of your partner)
WB
Wideband
XMIT
Transmit
Up

4  General Overviewp. 10

4.1  Introductionp. 10

The performance of default speech codecs (AMR-NB and AMR-WB) for packet switched conversational multimedia [12, 19] was characterised over DCH channels and over HSDPA/EUL radio channels.
The testing over DCH channels was carried out from October 2003 until February 2004. Further subjective testing was carried out from June until October 2007 in order to characterize the performance over HSDPA/EUL radio channels. The main purpose of the latter testing was to evaluate and verify adequate performance of the AMR-NB and AMR-WB speech codecs used as defined in IMS Multimedia Telephony TS 26.114 with a specific focus on jitter buffer management.
Up

4.2  Tests over DCH radio channelsp. 10

The tests over DCH channels were separated into two phases: Phase 1 considered the default speech codecs AMR-NB [13] and AMR-WB [14] in various operating conditions. Phase 2 considered also several other codecs including ITU-T codecs G.723.1 [5], G.729 [4], G.722 [6] and G.711 [3].
In Phase 1, France Telecom R&D acted as host laboratory. The subjective testing laboratories were ARCON for the North American English language, France Telecom R&D for the French language and NTT-AT for the Japanese language. Phase 1 tests consisted of 24 test conditions both for the AMR codec (modes 6.7 and 12.2 kbit/s) and the AMR-WB codec (modes 12.65 and 15.85 kbit/s) with error conditions covering both IP packet loss of 0% and 3% and radio conditions with 10-2, 10-3 and 5 10-4 BLER (Block Error Rate). End-to-end delays of 300 and 500 ms were covered. Robust Header Compression (ROHC), an optional UMTS functionality, was included for some test cases for AMR-WB. Three types of background noise were used: car, street and cafeteria.
In Phase 2, France Telecom R&D acted as host and listening laboratory. Two languages were used (French and Arabic). The following codecs were tested: AMR-NB (modes 6.7 and 12.2 kbit/s), AMR-WB (modes 12.65 and 15.85 kbit/s), ITU-T G.723.1 (mode 6.4 kbit/s), ITU-T G.729 (mode 8 kbit/s), ITU-T G.722 (mode 64 kbit/s) and ITU-T G.711 (64 kbit/s). Transmission error conditions covered IP packet loss of 0% and 3%.
Siemens provided the real time air interface simulator for the Phase 1. France Telecom provided the IP core network simulator and terminal simulator used in both experiments Phase 1 and Phase 2. IPv6 was employed in the testing. (IPv6 is fully simulated over the radio interface. The CN simulator employs IPv4 but since the only impact is a marginal difference in the end-to-end delay - of the order of ~16 ìs - the use of a particular IP-version in CN part has no impact on the performance results.)
These tests were the first ever conversational tests conducted in any standardization body. Performance evaluation consisted of assessment of 5 aspects: 1) voice quality, 2) difficulty of understanding words, 3) quality of interaction, 4) degree of impairments, and 5) global communication quality. A 5-category rating scale was used for each aspect.
Dynastat performed the global analysis for Phases 1 and 2. The results are contained in Clause 7.
Up

4.3  Tests over HSDPA/EUL radio channelsp. 11

These listening-only tests characterized the performance of AMR-NB and AMR-WB speech codecs over HSDPA/EUL channels when conducting buffer adaptation to the network delay variations using a simple jitter buffer management (JBM) algorithm. The tests focused on the effect of channel errors and channel jitter to speech quality instead of the impact of overall end-to-end delay in speech conversation. The end-to-end delay impact was considered separately by conducting a delay analysis on the whole processed test material.
The subjective listening-only tests were conducted in Finnish and Swedish languages at Nokia and Ericsson, respectively. The tests consisted of eight different channel conditions in clean speech and in background noise conditions. AMR-NB was tested in 12.2 and 5.9 kbit/s modes, and AMR-WB at 12.65 kbit/s. The outstanding issue was to evaluate the performance of adaptive JBM operation in HSDPA/EUL channel conditions. The applied adaptive jitter buffer was a simple implementation conducting buffer adaptation mainly during discontinuous transmission, i.e. speech pauses, and not using any time scaling operation. A non-implementable fixed jitter buffer with the full a priori knowledge on the channel characteristics was used as a reference. Although the average end-to-end delays of both adaptive and fixed jitter buffers were the same, the number and locations of jitter buffer induced frame losses were different depending on the channel conditions.
The results are contained in Clause 8.A.
A program of Conversation Tests was organized to evaluate the performance of AMR-NB and AMR-WB for UMTS over HSDPA/EUL. Three test labs were contracted to conduct the conversation tests and deliver raw voting data to Dynastat, the Global Analysis Lab (GAL), for processing and statistical analysis.
Three conversation tests were conducted in each of three test labs. The test labs were FTRD, testing in the French language, BIT, testing in the Chinese language, and Dynastat, testing in North American English. Each of the three conversation tests involved a different speech codec:
  • Exp.1 - AMR operating at 5.9k bps
  • Exp.2 - AMR operating at 12.2k bps
  • Exp.3 - AMR-WB operating at 12.65k bps
The experiments were conducted according to specifications contained in the ITU-T Recommendation for Conversation Testing, P.805 [20]. Alcatel-Lucent provided the network impairment simulation test-bed. The raw voting data for each test lab and each Experiment was delivered to the GAL. The GAL conducted statistical analyses on the raw voting data and the results of those analyses are contained in Clause 8.B.
Up

Up   Top   ToC