Anda di halaman 1dari 4

SpecialReport

VoIP Protocols

Solving QoS in VoIP:


A Formula for
Explosive Growth?
What users and their organizations expect from VoIP is essentially
PSTN quality and objective verification that they are receiving it.
VoIP service providers must meet these expectations if they want to
profit from faster market penetration.

by: Luis F. Ortiz, Director of Software Architecture


Brooktrout Technology

he absence of solutions ensuring However, perceptions drive decisions. voice quality called Mean Opinion Score

T Quality of Service (QoS) has been a


deterrent to the widespread adoption
of Voice over Internet Protocol (VoIP).
VoIP service providers must be extremely
sensitive to the perceptions of their cus-
tomers, because a decision to change ser-
(MOS). MOS ratings on a scale of 1 (bad)
to 5 (excellent) are derived by soliciting
perceptions from groups of real people to
Potential users often think that speech vice can be precipitated from such test messages.
quality won’t be as good as what they are negative perceptions as MOS ratings are meaningful because
accustomed to—the familiar public • When a user perceives unacceptable they accurately reflect the dynamics of
switched telephone network (PSTN). instantaneous quality, the user is likely how perceived quality is affected by trans-
Solutions that improve absolute voice to terminate the call prematurely. mission-channel changes. If channel
quality and that enable objective quality • If a user perceives overall poor quality quality changes abruptly during a call, for
measurements can easily be incorporated after completing a call, there is likely to example, the MOS rating recovers more
into Service Level Agreements (SLAs). be a harboring of residual dissatisfaction. slowly from a step improvement in quality
With these solutions in hand, the reluc- • If service providers achieve quality by (time constant about 15 seconds) than it
tance of potential users can begin to evap- overprovisioning their networks, the does from a step degradation (time con-
orate, and the adoption of VoIP will stand resulting high costs undermine the stant about 5 seconds).
poised for a period of explosive growth. user’s perception of value, despite excel- Another example is the so-called
lent voice quality. recency effect. If a period of bad channel
Voice Quality and MOS Ratings However, since a perception is of little quality comes near the end of a call, it
Voice quality is subjective because it’s use to a service provider unless it can be lowers the MOS rating for overall call
a measure of the intelligibility and clarity measured in some way, the industry has quality much more than if it had appeared
of speech as perceived by the listener. developed a numerical representation of earlier in the call. While MOS ratings are

As seen in July 2001


SpecialReport

R User Satisfaction MOS 40 40


100 G.711 w/o PLC
G.107 Very Satisfied
Default
94 4.4
30 30 G.711 w. PLC
90 4.3 (Random PL)
Satisfied G.711 G.711 w. PLC
80 4.0 R 20 G.729A
R 20 (Bursty PL)
Some Users Dissatisfied G.723.1 G.729A w. VAD

70 3.6 10 10 G.723.1 w. VAD


Many Users Dissatisfied
60
Nearly All Users Dissatisfied 3.1 0 0
0 100 200 300 400 500 0 2 4 6 8 10
50 2.6
Not Recommended
One-Way Delay (ms) Packet Loss %
0 1.0
Figure 1 Comparison of R-Value, MOS Figure 2 Delay degradation for three Figure 3 Packet loss degradation for
and user satisfaction. different codecs. three codecs showing the
effect of PLC on VoIP quality.

meaningful for statistically characterizing corresponds to a MOS of ~4.0, which is the Packet loss occurs as a result of
user reactions to test messages, they don’t nominal PSTN quality most users perceive random channel errors, excessive conges-
offer a real-time, non-intrusive tool that’s as satisfactory. R-Values below the mid-60s tion delays or from re-routing in the IP net-
suitable for IP Telephony. result in widespread user dissatisfaction. work. Lost packets typically occur in
bursts, and two or more missing packets in
R-Value Correlates to MOS Quality Impairments and R- a row are much more detrimental to speech
An emerging standards effort within Degradations quality than single missing packets distrib-
the International Telecommunication There are many factors that can impair uted randomly.
Union (ITU) is currently creating an voice quality, and each of these can be asso-
“E-Model” for estimating the voice quality ciated with an R-Degradation. Technical Endpoint Strategies for
of IP telephony. The output of the E-Model Services Bulletin TIA/TSB-116 of the Improving Voice Quality
is a scalar “Transmission Rating Factor” Telecommunications Industry Association Voice quality can be optimized by
called the R-Value (or simply “R”). The (TIA) contains a discussion of all such fac- making trade-offs aimed at producing the
importance of R is that it’s repeatable and tors, three of which will be discussed here: lowest possible sum of R-Degradations.
can be calculated in real time from mea- coding technique, delay and packet loss. We concentrate here on endpoint strategies
surable channel and equipment character- The coding technique has a profound while recognizing that network optimiza-
istics. With appropriate modeling, R can effect on the response to channel impair- tion techniques, such as bandwidth reser-
be correlated to MOS with remarkable ments and the ability to employ error miti- vation or the use of private vs. public
accuracy. This gives service providers a gation. In addition, a non-reducible resources, can help by improving the
mechanism for measuring the quality of R-Degradation is introduced by different underlying quality of the transmission
their service (R) and accurately relating it codecs, which effectively lowers the channel. Endpoint techniques for miti-
to the perceptions of their users (MOS). starting point for the overall R-Value. gating the effects of packet errors fall into
The idea is to start with a perfect score four classes—correction, distribution, con-
(R of 100) and then to quantify Codec R-Degradation Max. R-Value tainment and concealment.
“R-Degradations” that are modeled from G.711 0 94 Packet error correction or error distribu-
equipment and channel characteristics. G.729A 11 83 tion mechanisms can be employed at the
The sum of the R-Degradations is sub- G.723.1 15 79 cost of additional delay and/or bandwidth.
tracted from 100 to yield an overall R. The Error correction can be achieved using
lower the resultant R, the lower the quality. Figure 2 shows how one of the key fac- duplicate streams, Forward Error
Figure 1 illustrates the relationships tors in packetized voice—one-way delay— Correction (FEC) or jitter buffers to capture
between user satisfaction and MOS or correlates to R-Degradation for different or regenerate delayed, lost or corrupted
R-Values. With standard narrowband (300 speech compression techniques. The knee packets. Error distribution involves
to 3,400 Hz) telephony, the E-Model gener- in each curve illustrates how R-Degradation spreading out errors over a larger period of
ates a maximum attainable R of 94.15 cor- increases at a much faster rate after reaching time so that multiple consecutive packet
responding to a MOS of ~4.5. An R of 80 a one-way delay of around 175 ms. losses appear more randomly distributed.

As seen in July 2001


SpecialReport

These mechanisms must be judiciously codecs, with and without PLC. This model inal unimpaired audio file with the
employed, because using extra bandwidth shows that PLC can have a dramatic impaired file (strictly an off-line function),
increases transmission cost and adding extra impact on VoIP quality in the face of whereas VQmon operates in real time by
delay introduces a greater risk of passing the impairments. analyzing the pattern of transmission
knee of the Figure 2 delay curve where the Some PLC schemes, such as those channel events.
quality effects of delay are exaggerated. available from Global IP Sound, achieve VQmon computes the R quality metric
Error containment is largely coupled even better results than illustrated in at the conclusion of each call (and on an
with the choice of codec. High-compres- Figure 3 (while maintaining compatibility instantaneous basis, if desired), taking into
sion voice codecs such as G.729 and G.723 with existing codecs at the transmitting account dynamic user-perception effects
omit the transmission of redundant infor- end). Codecs using enhanced techniques such as recency and gradual recognition of
mation and transmit only changes from can sustain reasonable voice quality even quality changes. Presented at the conclu-
one sample to the next. This means that if through significantly impaired channels, sion of every call is a detailed record of
a frame is lost, the error persists into the which use a combination of error distribu- user-perceived voice quality for that call:
following frames because the reference tion, error containment and 3rd-Order
state of the decoder has been corrupted. error concealment. • Rend represents user’s memory of call
codecs that don’t depend on history, such quality, incorporating recency effect.
as G.711, have a higher base R-Value and Dynamic Measurement of • Rav represents the average quality for
are more amenable to error concealment. Voice Quality the specific call.
Packet Loss Concealment (PLC), The quality monitoring tools in wide- • Rmax represents the maximum quality
wherein lost packets are replaced with sub- spread use today don’t satisfy the needs of level experienced during the call.
stitutes, is the most productive error-miti- VoIP service providers. Some are based on • Rmin represents the minimum quality
gation technique for VoIP. There are four long-term averages of network statistics level experienced during the call.
classes of PLC: for jitter, packet loss and delay, which
don’t relate directly to how users perceive VoIP service providers can easily inte-
• 0th Order: substitute a constant packet quality on individual calls. Others are grate this information with their network
• 1st Order: substitute the last packet intrusive snapshots, which at best reflect or service management systems. They can
• 2nd Order: substitute an interpolated averages rather than experiences of indi- also include it into SLAs, because it docu-
packet vidual users. Examples of the latter are ments the real-time quality experiences of
• 3rd Order: substitute a modeled packet Perceptual Speech Quality Measure every call. VQmon is being integrated into
(PSQM) and Perceptual Analysis commercially available products such as
The 0th- and 1st-Order PLC methods Measurement System (PAMS) test calls, endpoint processing and interface modules
aren’t effective against the most important which actually impair quality by using from Brooktrout.
quality impairment, which is consecutive valuable network bandwidth. Recent technology advances have
packet losses. They’re therefore consid- The limitations of today’s quality mon- made possible a comprehensive frame-
ered primitive by modern standards. itoring systems are overcome with new work for addressing QoS in voice-over-
Substituting an interpolated packet tools based on emerging standards that packet services. Metrics based on the
(2nd Order) can be useful for both single extend the E-Model to include packet-loss standards-based E-Model can be used in
and double packet losses. Logically, this distribution and recency effects. A prime two key ways to overcome previous obsta-
involves computing the average of the example is VQmon, a product from cles to VoIP market acceptance: 1) to apply
neighboring packets and using this as an Telchemy, which models the effects of trade-offs that improve absolute quality
estimate for the missing packet(s). time-varying network impairments and and 2) to generate accurate estimates of
Third-order techniques are the most human short-term auditory memory. It’s a user-quality perception on a real-time, call-
advanced and are useful even in the face of non-intrusive, real-time monitoring tech- by-call basis.
triple packet losses. A 3rd-Order PLC nique that is implemented at the network This allows VoIP service providers to
engine monitors a stream of packets and endpoints, typically at the egress from a offer SLAs with the kind of meaningful
builds a dynamic model of the embedded VoIP service provider. voice-quality guarantees that current and
speech. When one or more packet losses In a recent subjective study comparing potential users have been demanding.
occur, the output of the model is inserted to VQmon with the widely used PSQM and Once users can secure the benefits of VoIP
replace the lost packets. These PLC sys- PAMS measurement algorithms, VQmon without concerns over quality, widespread
tems are good at tracking speech charac- was shown to be more accurate in pre- market acceptance will be the likely out-
teristics to which humans are sensitive, dicting the quality rank (MOS) given by come. Now is the time.
such as pitch or amplitude profiles. human listeners. This is particularly
Figure 3 illustrates the R-Degradation impressive when considering that PSQM
effects of packet loss using different and PAMS operate by comparing the orig-

As seen in July 2001


For more information please contact:
Brooktrout Technology
Needham, MA
Tel: (781) 449-4100 • Fax: (781) 449-9009
info@brooktrout.com
www.brooktrout.com

Brooktrout Technology Europe, Lid.


Hoeilaart Office Part
Vandanmmestraat 5, Box 2
1560 hoeilaart, Belgium
Tel: 32-2-658-0170 • Fax: 32-2-658-0180

As seen in July 2001

Anda mungkin juga menyukai