Adaptive Transform Acoustic Coding

From HandWiki
Revision as of 22:15, 6 March 2023 by LinuxGuru (talk | contribs) (add)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Adaptive Transform Acoustic Coding
Atrac.png
Filename extension.aa3
.oma
.at3
.at9
Developed bySony Corporation
Type of formatAudio file format

Adaptive Transform Acoustic Coding (ATRAC) is a family of proprietary audio compression algorithms developed by Sony. MiniDisc was the first commercial product to incorporate ATRAC in 1992. ATRAC allowed a relatively small disc like MiniDisc to have the same running time as CD while storing audio information with minimal loss in perceptible quality. Improvements to the codec in the form of ATRAC3, ATRAC3plus, and ATRAC Advanced Lossless followed in 1999, 2002, and 2006 respectively.[1]

Other MiniDisc manufacturers such as Sharp and Panasonic also implemented their own versions of the ATRAC codec.

On 31 March 2008 Sony all but dropped the ATRAC-related codecs in the USA and Europe, and in their SonicStage powered Connect Music Store (Sony's equivalent of iTunes and iTunes Music Store). In October 2009 Sony also announced that ATRAC would be discontinued in all remaining markets.[2]

General bitrate quality

ATRAC's original 292 kbit/s bitrate, as used on the original MiniDiscs, was designed to be close to CD quality acoustically. Years later ATRAC was improved and is generally considered better than earlier versions at similar bitrates. For purposes of comparison, CDs are encoded at 1411.2 kbit/s, and lossless encoders can encode most CDs below 1000 kbit/s, with significant bitrate reduction for easier-to-encode content such as voice.

Performance

According to ATRAC engineers, ATRAC algorithms were developed in close cooperation with LSI integrated circuit development engineers within Sony in order to deliver on a tangible product that could encode at high speeds and with minimal power consumption.[3] This is in contrast to other codecs developed on computers with no regard for the constraints of portable hardware. This is reflected in the design of the ATRAC codecs, which tend to emphasize processing smaller numbers of samples at a time to save memory at the cost of compression efficiency and additional multiplies. These trade-offs are entirely logical on DSP systems, where memory was often at a premium compared to multiplier performance.

Sony Walkmans offer better battery life when playing ATRAC files as compared to MP3 files. However, as Sony only pushed ATRAC compatibility in Sony Ericsson Walkman series phones in the Japanese market, it is not supported in GSM/UMTS market phones. Sony's Xplod series of car audio CD players support ATRAC CDs. Minidiscs with ATRAC format songs have, in the past, been supported on Eclipse brand car stereos.

ATRAC1

ATRAC1 was first used in Sony's own theater format SDDS system in the 1990s, and in this context is a direct competitor to Dolby Digital (AC3) and DTS. SDDS uses ATRAC1 with 8 channel encoding, and with a total encoding rate over all the channels of 1168 kbit/s.

Two stacked quadrature mirror filters split the signal into 3 parts:

  • 0 to 5.5125 kHz
  • 5.5125 to 11.025 kHz
  • 11.025 to 22.05 kHz

Full stereo (i.e., independent channel) encoding with a data rate of 292 kbit/s.

High-frequency lowpass depends on the complexity of the material; some encodings have content clear up to 22.05 kHz.

ATRAC1 can also be used in mono (one channel) mode, doubling recording time.

FFmpeg has an implementation[4] of an ATRAC1 decoder.

ATRAC3 (LP2 and LP4 Modes)

Like ATRAC1 and MP3, ATRAC3 is also a hybrid subband-MDCT encoder, but with several differences.

In ATRAC3, Three stacked QMF split the signal into 4 parts:

  • 0 to 2.75625 kHz (DC to f/16)
  • 2.75625 to 5.5125 kHz (f/16 to f/8)
  • 5.5125 to 11.025 kHz (f/8 to f/4)
  • 11.025 to 22.05 kHz (f/4 to f/2)

The four subbands are then MDCT encoded using a fixed-length transform. Unlike nearly all modern formats, the transform length cannot be varied to optimize coding transients. Instead, a simpler transient encoding technique called gain control is used, in which the gain of different subbands is varied during a transient prior to MDCT and then restored during decoding after the inverse MDCT to try to smooth over transients. Additionally, prior to quantization, tonal components are subtracted from the signal and independently quantized. During decoding, they are separately reconstructed and added back to reform the original MDCT coefficients.

Sony claims the major advantage of ATRAC3 is its coding efficiency, which was tuned for portable DSP which provides less computing power and battery life. However, as ATRAC is a hybrid subband-MDCT codec that is algorithmically very similar to MP3, any advantage is probably exaggerated. Furthermore, compared to newer formats such as Windows Media Audio which use a simple MDCT rather than a hybrid, ATRAC3 must perform an additional and computationally expensive inverse-QMF, although the hybrid system does significantly reduce memory usage, which was likely a factor given the limited memory available when ATRAC was originally developed.

LP2 Mode

This uses a 132 kbit/s data rate, the quality of which is advertised to be similar to that of MP3 encoded at a similar bit rate. However, in an independent double-blind test (2004/05) without format encoding parameters reference against Ogg Vorbis, AAC, and LAME VBR MP3, ATRAC3 came last.[5] Unfortunately, due to lack of transparency in ATRAC encoder versioning, it is not known if the ATRAC3 encoder tested was optimal, and subsequent investigation was inconclusive. It is possible that newer ATRAC3 encoders offer better performance.

LP4 Mode

This reduces the data rate to 66 kbit/s (half that of LP2), partly by using joint stereo coding and a lowpass filter around 13.5 kHz. It allows 324 minutes to be recorded on an 80-minute MiniDisc, with the same padding required as LP2.

Notes

FFmpeg has an implementation[6] of an ATRAC3 decoder, which was converted to fixed precision and implemented in the Rockbox series of firmware for ARM, Coldfire and MIPS processors. RealAudio8 is a high-bitrate implementation of ATRAC3 (up to 352.8kbit/s).

The PlayStation 3 video game Race Driver uses 224 simultaneous streams of ATRAC3 compressed audio, with between one and eight channels per stream at sample rates between 24 and 48 kHz, each filtered using 512 frequency bands of adaptive equalisation, routed via six reverb units running on the same SPU co-processor (one of eight on the PS3's Cell chip), alongside 7.1 channel hybrid third-order Ambisonic mixing.[7]

ATRAC3plus

ATRAC3plus CD playing on a Sony car stereo.

This codec is used in Sony Hi-MD Walkman devices (e.g., "Hi-LP and Hi-SP"), Network Walkman players, Memory Stick players, VAIO Pocket, PS3 and PSP console, and ATRAC CD players. It is a hybrid subband/MDCT codec based on a 16 channel QMF followed by a 128-point MDCT. Prior to MDCT coding, Generalized Harmonic Analysis (GHA) is used to extract tonal components, an improved version of the process used in ATRAC3. As in previous ATRAC versions, gain control is used to control preecho rather than variable sized transforms, although different MDCT windows are apparently possible.

SonicStage version 3.4, released in Feb 2006,[8] introduced ripping CDs in bitrates 320 and 352.[9] The available bitrates are: 48, 64, 96, 128, 160, 192, 256, 320 and 352 kbit/s. The newer bitrates are not always compatible with all older hardware decoders, however, some of the older hardware has been found to be compatible with certain newer ATRAC3plus bitrates.

MiniDiscs recorded in this format are incompatible with older players.

In a test conducted by an independent firm, but financed by Sony, it was concluded that ATRAC3plus at 64 kbit/s is equal in subjective sound quality to an obsolete MP3 encoder at 128 kbit/s.[10] Performance against modern high quality MP3 encoders was not evaluated.

ATRAC Advanced Lossless

ATRAC Advanced Lossless is a "scalable" lossless audio codec that records a lossy ATRAC3 or ATRAC3plus stream, and supplements it with a stream of correction information stored within the file itself that allows the original signal to be reproduced, if desired. A player/decoder can extract and use just the ATRAC3 or ATRAC3plus data, or it can combine that with the correction stream to perfectly reproduce the original audio information. This allows the file to be decoded as either lossless or lossy. It is implemented in such a way that allows the file size to be smaller than uncompressed or compressed versions of the same file. Compression is approximately 30–80% of the original file. Benefits of scalable compression include providing backward compatibility, such that older devices that are not AAL-aware can still have the ATRAC3 stream available for playback without understanding the AAL format, and faster transfer speed between portable audio devices and PC.[11]

ATRAC Advanced Lossless is widely supported in older Walkman players and SonicStage version 4 or later. SonicStage 4 allows download of ATRAC Advanced Lossless to MiniDisc Players, PlayStation Portable, and PlayStation 3. Recent Walkman players do not support ATRAC Advanced Lossless/ATRAC.

Comparable technology

AAL's use of a "core" (lossy) and "residual" (correction) stream is similar to the idea behind Opus, MPEG-4 SLS, DTS-HD Master Audio, Dolby TrueHD and Ogg Vorbis bitrate peeling. In fact, AAL was the first to be released in the commercial market with this scheme for backward compatibility.

WavPack hybrid mode and OptimFROG DualStream are in the same category, but store the correction stream in a separate file.

ATRAC9

According to Sony ATRAC9 is a high-compression audio codec optimized for games, offering low delay (granularity) and low CPU and memory usage. Used in PS4 and PS Vita consoles. Audio middleware such as FMOD and Audiokinetic Wwise supports it.[12][13]

See also

References

  1. "Siliconvalley.com". http://www.siliconvalley.com/news/ci_6759779. 
  2. "SonicStage In Its Death Throes, Replaced By New x-app Software (Japan Only)". Sony Insider. https://www.sonyinsider.com/2009/10/12/sonicstage-in-its-death-throes-replaced-by-new-x-app-software-japan-only/. 
  3. "Sony Global - ATRAC - Developer's Interview". https://www.sony.net/Products/ATRAC3/special/developers01.html. 
  4. "Source code for ATRAC decoder". http://git.ffmpeg.org/?p=ffmpeg;a=blob;f=libavcodec/atrac1.c;hb=HEAD. 
  5. "Results of Multiformat at 128kbit/s Listening Test". http://listening-tests.freetzi.com/html/Multiformat_128kbps_public_listening_test_results.htm. 
  6. "Source code for ATRAC3 decoder". http://git.ffmpeg.org/?p=ffmpeg;a=blob;f=libavcodec/atrac3.c;hb=HEAD. 
  7. "Develop - Issue 86 - August 2008". https://issuu.com/develop/docs/dev86_web. 
  8. "Overseas Model Customer Support". April 7, 2006. http://www.sony.jp:80/products/overseas/contents/support/download/index.html. 
  9. "Overseas Model Customer Support". February 5, 2006. https://www.sony.jp/products/overseas/contents/support/download/e_jeb_SS34ug.html. 
  10. "ITS test report". http://www.sony.net/Products/ATRAC3/tech/ITS_test_report.pdf. 
  11. "Sony Global - ATRAC - ATRAC Advanced Lossless". https://www.sony.net/Products/ATRAC3/tech/aal.html. 
  12. "FMOD sound formats". https://www.fmod.com/resources/documentation-api?page=content/generated/FMOD_SOUND_TYPE.html#/. 
  13. "include/AK/SoundEngine/Platforms/PS4/AkTypes.h File Reference". https://www.audiokinetic.com/library/edge/?source=SDK&id=include%2FAK%2FSoundEngine%2FPlatforms%2FPS4%2FAkTypes.h. 

External links

  • ATRACLife.com - This domain is now parked, forum content only available from google cache.

News portal, discussion forums and downloads related to ATRAC.