Wideband audio

From HandWiki
Short description: High-quality audio telephony
Audio bands in telephony[1]
Name Range (Hz)
Narrowband 300–3,400
Wideband 50–7,000
Superwideband 50–14,000
Fullband 20–20,000

Wideband audio, also known as wideband voice or HD voice, is high definition voice quality for telephony audio, contrasted with standard digital telephony "toll quality". It extends the frequency range of audio signals transmitted over telephone lines, resulting in higher quality speech. The range of the human voice extends from 100 Hz to 17 kHz[2] but traditional, voiceband or narrowband telephone calls limit audio frequencies to the range of 300 Hz to 3.4 kHz. Wideband audio relaxes the bandwidth limitation and transmits in the audio frequency range of 50 Hz to 7 kHz.[3][1] In addition, some wideband codecs may use a higher audio bit depth of 16 bits to encode samples, also resulting in much better voice quality.[citation needed]

Wideband codecs have a typical sample rate of 16 kHz. For superwideband codecs the typical value is 32 kHz.[1]

History

In 1987, the International Telecommunication Union (ITU) standardized a version of wideband audio known as G.722. Radio broadcasters began using G.722 over Integrated Services Digital Network (ISDN) to provide high-quality audio for remote broadcasts, such as commentary from sports venues. AMR-WB (G.722.2) was developed by Nokia and VoiceAge and it was first specified by 3GPP.

The traditional telephone network (PSTN) is generally limited to narrowband audio by the intrinsic nature of its transmission technology, TDM (time-division multiplexing), and by the analogue-to-digital converters used at the edge of the network, as well as the speakers, microphones and other elements in the endpoints themselves.

Wideband audio has been broadly deployed in conjunction with videoconferencing. Providers of this technology quickly discovered that despite the explicit emphasis on video transmission, the quality of the participant experience was significantly influenced by the fidelity of the associated audio signal.

Communications via Voice over Internet Protocol (VoIP) can readily employ wideband audio. When PC-to-PC calls are placed via VoIP services, such as Skype, and the participants use a high-quality headset, the resulting call quality can be noticeably superior to conventional PSTN calls. A number of audio codecs have emerged to support these services, supplementing G.722.

Manufacturers of audio conferencing equipment have introduced wideband-capable models that include support for G.722 over VoIP.

Conference calls are a direct beneficiary of the enhancements offered by wideband audio. Participants often struggle to figure out who is talking or to understand accented speakers. Misunderstandings are commonplace due primarily to generally poor audio quality and accumulation of background noise.

Some listener benefits cited of wideband audio compared to traditional (narrowband):

  • Clearer overall sound quality
  • Easier to recognize voices, distinguish confusing sounds and understand accented speakers
  • Ease of deciphering words that have the close sounds of ‘s’ and ‘f’ and others, often indistinguishable over telephone lines
  • Ability to hear faint talkers and to understand double-talk (when more than one person is speaking at the same time)
  • Reduced listening effort (decreased cognitive load), resulting in increased productivity and lessened listener fatigue
  • Better understanding amidst other impairments, such as when talkers are using a speakerphone or in the presence of background noise

Despite its reputation for poor audio quality, the mobile telephone industry has started to make some progress on wideband audio. The 3GPP standards group has designated G.722.2 as its wideband codec and calls it Adaptive Multirate – Wideband (AMR-WB). More than a hundred handsets have been introduced supporting this codec (for example, Apple, Google, HTC, Nokia, Samsung and Sony), and network demonstrations have been conducted.[citation needed]

Deployment

VoIP

As business telephone systems have adopted VoIP technology, support for wideband audio has grown rapidly. Telephone sets from Avaya, Cisco, NEC Unified Solutions, Grandstream, Gigaset, Panasonic (which brands wideband audio "HD Sonic"), Polycom (which brands wideband audio "HD Voice"), Snom, AudioCodes (which brands wideband audio "HDVoIP") and others now incorporate G.722, as well as varying degrees of higher-quality audio components.

Suppliers of integrated circuits for telephony equipment, including DSP Group, Broadcom, Infineon, and Texas Instruments, include wideband audio in their feature portfolios. There are audio conferencing service providers that support wideband connections from these and other VoIP endpoints, while also permitting PSTN participants to join the conference in narrowband. sipXtapi is an open-source solution for VoIP media processing engine supporting wideband and HD voice that provides RTP and codecs through a plugin framework for use with SIP and other VoIP protocols. Skype uses an audio codec called Silk which allows for extremely high-quality audio.

A number of carriers around the world have rolled out HD voice services based on the G.722 wideband standard. In North America, hosted service providers have recently[when?] deployed the Aastra Hi-Q upgrade to its installed user base and as of January 2010 claimed around 70,000 HD voice endpoints. The consumer service provider ooma has an estimated 25,000 HD voice endpoints deployed stemming from its rollout of its second-generation Telo hardware.

VoLTE

In cellular communication, "HD Voice" specifically refers to AMR-WB (G.722.2) in VoLTE, but AMR-WB in turn does not specify quality or bitrate. Likewise for HD Voice+ and AMR-WB+. GSMA has an HD trademark and runs two certification program around the HD and HD+ logos.[4]

AMR-WB is natively supported in Android since Android Gingerbread,[5] and in iOS since the iPhone 5.[6]

As of December 2015, a report announces 117 commercial mobile HD Voice networks launched in 76 countries.[7]

Many mobile networks including AT&T[8] and Verizon are discontinuing support for phones that don't support 4G and wideband audio.

Wideband audio coding standards

The following are wideband audio coding standards and audio codecs used in telecommunication.[9]

ITU-T

Year Wideband audio coding standard Wideband speech coding algorithm Ref
1988 G.722 SB-ADPCM [10]
1999 G.722.1 (Siren7) MDCT [11]
2003 G.722.2 (Adaptive Multi-Rate Wideband) ACELP [12]
2006 G.729.1 MDCT [13]
2008 G.711.1 MDCT [14]
G.718 MDCT [15]

GSMA

3GPP

Others

References

  1. 1.0 1.1 1.2 Cox, R. V.; Neto, S. F. De Campos; Lamblin, C.; Sherif, M. H. (October 2009). "ITU-T coders for wideband, superwideband, and fullband speech communication [Series Editorial"]. IEEE Communications Magazine 47 (10): 106–109. doi:10.1109/MCOM.2009.5273816. https://ieeexplore.ieee.org/document/5273816. 
  2. "Human Voice Frequency Range" (in en). 2020-06-14. https://seaindia.in/sea-online-journal/human-voice-frequency-range/. 
  3. "Answering the call of HD Voice". Global IP Sound. http://gipscorp.com/blog/2009/04/13/answering-the-call-of-hd-voice/. 
  4. "HD Voice". https://www.gsma.com/futurenetworks/ip_services/hd-voice/. 
  5. "MediaRecorder.AudioEncoder". https://developer.android.com/reference/android/media/MediaRecorder.AudioEncoder. 
  6. Buster Hein (13 September 2012). "The iPhone 5 Supports HD Voice, But You'll Never Get To Use It In The U.S.". http://www.cultofmac.com/190565/the-iphone-5-supports-hd-voice-but-youll-never-get-to-use-it-in-the-u-s/. 
  7. "Mobile HD Voice: Global Update report". GSA. 2014-09-19. http://www.gsacom.com/downloads/pdf/GSA_mobile_hd_voice_190914.php4. 
  8. "Get Ready, 3G is Going Away in 2022". https://www.att.com/support/article/wireless/KM1324171/. 
  9. "Which wideband codec to choose?". TMCnet. http://blog.tmcnet.com/voice-of-ip/2010/04/which_wideband_codec_to_choose.html. 
  10. ITU-T G.722 page ITU-T Recommendation G.722 (11/88), "7 kHz audio-coding within 64 kbit/s"
  11. Lutzky, Manfred; Schuller, Gerald; Gayer, Marc; Krämer, Ulrich; Wabnik, Stefan (May 2004). "A guideline to audio codec delay". 116th AES Convention. Audio Engineering Society. https://www.iis.fraunhofer.de/content/dam/iis/de/doc/ame/conference/AES-116-Convention_guideline-to-audio-codec-delay_AES116.pdf. Retrieved 24 October 2019. 
  12. ACELP map, VoiceAge Corporation
  13. Nagireddi, Sivannarayana (2008). VoIP Voice and Fax Signal Processing. John Wiley & Sons. p. 69. ISBN 9780470377864. https://books.google.com/books?id=5AneeZFE71MC&pg=PA69. 
  14. Sasaki, Shigeaki; Mori, Takeshi; Hiwasaki, Yusuke; Ohmuro, Hitoshi (August 2008). "Global Standard for Wideband Speech Coding: ITU-T G.711.1 (G.711 wideband extension)". NTT Technical Review (Nippon Telegraph and Telephone). https://www.ntt-review.jp/archive/ntttechnical.php?contents=ntr200808le1.html. 
  15. "ITU-T Work Programme". https://www.itu.int/ITU-T/workprog/wp_item.aspx?isn=344. 
  16. "HD Voice" (in en-US). https://www.gsma.com/futurenetworks/ip_services/hd-voice/. 
  17. "Enhanced Voice Services Codec for LTE". https://www.3gpp.org/news-events/3gpp-news/1639-evs_news. 

External links