Engineering:Ambisonic decoding

From HandWiki
Main page: Engineering:Ambisonics

This page focusses on decoding of classic first-order Ambisonics. Other relevant information is available on the Ambisonic reproduction systems page.

The Ambisonic B-format WXYZ signals define what the listener should hear. How these signals are presented to the listener by the speakers for best results, depends on the number of speakers and their location. Ambisonics treats directions where no speakers are placed with as much importance as speaker positions. It is undesirable for the listener to be conscious that the sound is coming from a discrete number of speakers. Some simple decoding equations are known to give good results for common speaker arrangements.

But Ambisonic Speaker Decoders can use much more information about the position of speakers, including their exact position and distance from the listener. Because human beings use different mechanisms to locate sound, Classic Ambisonic Decoders it is desirable to modify the speaker feeds at each frequency to present the best information using Shelf Filters.

Some views on the complexities of Shelf Filters and Distance Compensation are explained in "Ambisonic Surround Decoders"[1] and "SHELF FILTERS for Ambisonic Decoders".[2]

There are specialised decoders for large audiences in large spaces.

Hardware decoders have been commercially available since the late 1970s; currently, Ambisonics is standard in surround products offered by Meridian Audio, Ltd. Ad hoc software decoders are also available.

There are five main types of decoder:

Diametric decoders

This design is intended for a domestic, small room setting, and allows speakers to be arranged in diametrically opposed pairs.

Regular Polygon decoders

This design is intended for a domestic, small room setting. The speakers are equidistant from the listener and lie equally spaced on the circumference of a circle. The simplest Regular Polygon decoder is a Square with the listener in the centre. At least four speakers are required. Triangles do not work, exhibiting large "holes" between the speakers. Regular Hexagons perform better than Squares especially to the sides.

For the simplest (two dimensional) case (no height information), and spacing the loudspeakers equally in a circle, we derive the loudspeaker signals from the B-format W, X and Y channels:

[math]\displaystyle{ P_n = W + X \cos\theta_n + Y \sin\theta_n }[/math]

where [math]\displaystyle{ \theta_n }[/math] is the direction of the speaker under consideration.

The most useful of these is the Square 4.0 decoder.

The coordinate system used in Ambisonics follows the right hand rule convention with positive X pointing forwards, positive Y pointing to the left and positive Z pointing upwards. Horizontal angles run anticlockwise from due front and vertical angles are positive above the horizontal, negative below.

Auditorium decoders

This design is intended for a large, public space setting.

"Vienna" decoders

These are so named because the paper introducing them was presented at the 1992 AES conference held in Vienna. The design was covered by a 1998 patent.[3] from Trifield Productions. The technology provides one approach to the decoding of Ambisonic signals to irregular loudspeaker arrays (such as ITU) commonly used for 5.1 surround sound replay. A slight flaw in the 1992 published papers decoder coefficients, and the use of heuristic search algorithms in order to solve the set of non-linear simultaneous equations needed to generate the decoders was published by Wiggins et al. in 2003,[4] and later extended to higher order irregular decoders in 2004[5]

Parametric decoders

The idea behind parametric decoding is to treat the sound's direction of incidence as a parameter that can be estimated through time–frequency analysis. A large body of research into human spatial hearing[6][7] suggests that our auditory cortex applies similar techniques in its auditory scene analysis, which explains why these methods work.

The major benefits of parametric decoding is a greatly increased angular resolution and the separation of analysis and synthesis into separate processing steps. This separation allows B-format recordings to be rendered using any panning technique, including delay panning, VBAP[8] and HRTF-based synthesis.

Parametric decoding was pioneered by Lake DSP[9] in the late 1990s and independently suggested by Farina and Ugolotti in 1999.[10] Later work in this domain includes the DirAC method[11] and the Harpex method.[12]

Irregular Layout Decoders

The Rapture3D decoder from Blue Ripple Sound supports this and is already used in a number of computer games using OpenAL.

See also

References

  1. Lee, Richard (18 February 2007). "Ambisonic Surround Decoder". Ambisonia.com. http://www.ambisonia.com/Members/ricardo/ASD.htm. Retrieved 4 April 2009. 
  2. Lee, Richard (14 April 2007). "SHELF FILTERS for Ambisonic Decoders" (Zipped Microsoft Word document). Ambisonia.com. http://www.ambisonia.com/Members/ricardo/shelfs.zip/view. Retrieved 4 April 2009. 
  3. Gerzon, Michael Anthony & Geoffrey James Barton, "Surround sound apparatus", US patent 5757927, published 1998-05-26, assigned to Trifield Productions Ltd.
  4. Wiggins, Bruce; Paterson-Stephens, Iain; Lowndes, Val; Berry, Stuart (2003). "The Design and Optimisation of Surround Sound Decoders Using Heuristic Methods". Proceedings of UKSim 2003, Conference of the UK Simulation Society: 106–114. http://www.brucewiggins.co.uk/wp-content/plugins/download-monitor/download.php?id=7. 
  5. Wiggins, Bruce (2004). An Investigation into the Real-time Manipulation and Control of Three-dimensional Sound Fields (PhD). University of Derby. hdl:10545/217795.
  6. Blauert, Jens (1997). Spatial Hearing: The Psychophysics of Human Sound Localization (Revised ed.). Cambridge, MA: MIT Press. ISBN 978-0-262-02413-6. https://books.google.com/books?id=wBiEKPhw7r0C. 
  7. Bregman, Albert S. (29 September 1994). Auditory Scene Analysis: The Perceptual Organization of Sound. Bradford Books. Cambridge, MA: MIT Press. ISBN 978-0-262-52195-6. https://books.google.com/books?id=jI8muSpAC5AC&q=%22Auditory+scene+analysis%22+Bregman. 
  8. "Vector base amplitude panning". Research / Spatial sound. Otakaari, Finland: TKK Acoustics. 18 January 2006. http://www.acoustics.hut.fi/research/cat/vbap/. Retrieved 12 May 2012. 
  9. McGrath, David Stanley & Adam Richard McKeag, "Wavelet conversion of 3-D audio signals", US patent 6628787, published 2003-09-30, assigned to Lake Technology Ltd.
  10. Farina, Angelo; Ugolotti, Emanuele (April 1999). "Subjective Comparison Between Stereo Dipole and 3D Ambisonic Surround Systems for Automotive Applications". AES 16th International conference on Spatial Sound Reproduction. Rovaniemi, Finland: AES. s78357. http://pcfarina.eng.unipr.it/Public/Papers/126-AESC99.PDF. Retrieved 12 May 2012. 
  11. "Directional Audio Coding". Research / Spatial sound. Otakaari, Finland: TKK Acoustics. 23 May 2011. http://www.acoustics.hut.fi/research/cat/DirAC/. Retrieved 12 May 2012. 
  12. "Harpex". Oslo, Norway: Harpex Limited. 2011. http://harpex.net/. Retrieved 12 May 2012. 

External links