SSE5

The SSE5 (short for Streaming SIMD Extensions version 5) was a SIMD instruction set extension proposed by AMD on August 30, 2007 as a supplement to the 128-bit SSE core instructions in the AMD64 architecture.

AMD chose not to implement SSE5 as originally proposed. In May 2009, AMD replaced SSE5 with three smaller instruction set extensions named as XOP, FMA4, and F16C, which retain the proposed functionality of SSE5, but encode the instructions differently for better compatibility with Intel's proposed AVX instruction set.

The three SSE5-derived instruction sets were introduced in the Bulldozer processor core, released in October 2011 on a 32 nm process.^[1]

Compatibility

AMD's SSE5 extension bundle does not include the full set of Intel's SSE4 instructions, making it a competitor to SSE4 rather than a successor.

SSE5 enhancements

The proposed SSE5 instruction set consisted of 170 instructions (including 46 base instructions), many of which are designed to improve single-threaded performance. Some SSE5 instructions are 3-operand instructions, the use of which will increase the average number of instructions per cycle achievable by x86 code.^[2] Selected new instructions include:^[3]

Fused multiply–accumulate (FMACxx) instructions
Integer multiply–accumulate (IMAC, IMADC) instructions
Permutation (PPERM, PERMPx) and conditional move (PCMOV) instructions
Precision control, rounding, and conversion instructions

AMD claimed SSE5 would provide dramatic performance improvements, particularly in high-performance computing (HPC), multimedia, and computer security applications, including a 5x performance gain for AES encryption and a 30% performance gain for the discrete cosine transform (DCT) used for example in video processing.^[2]

2009 revision

The SSE5 specification included a proposed extension to the general coding scheme of x86 instructions in order to allow instructions to have more than two operands. In 2008, Intel announced their planned AVX instruction set which proposed a different way of coding instructions with more than two operands. The two proposed coding schemes, SSE5 and AVX, are mutually incompatible, although the AVX scheme has certain advantages over the SSE5 scheme: most importantly, AVX has plenty of space for future extensions, including larger vector sizes.

In May 2009, AMD published a revised specification for the planned future instructions. This revision changes the coding scheme to make it compatible with the AVX scheme, but with a differing prefix byte in order to avoid overlap between instructions introduced by AMD and instructions introduced by Intel.

The revised instruction set no longer carries the name SSE5, which has been criticized for being misleading, but most of the instructions in the new revision are functionally identical to the original SSE5 specification—only the way the instructions are coded differs. The planned additions to the AMD instruction set consists of three subsets:

XOP: Integer vector multiply–accumulate instructions, integer vector horizontal addition, integer vector compare, shift and rotate instructions, byte permutation and conditional move instructions, floating point fraction extraction.
FMA4: Floating-point vector multiply–accumulate.
F16C: Half-precision floating-point conversion.

Both XOP and FMA4 are removed in newer AMD processors using the Zen microarchitecture.^[4]

References

↑ Hruska, Joel (November 14, 2008). "AMD Fusion now pushed back to 2011". Ars Technica. https://arstechnica.com/news.ars/post/20081114-amd-fusion-now-pushed-back-to-2011.html.
↑ ^2.0 ^2.1 Vance, Ashlee (August 30, 2007). "AMD plots single thread boost with x86 extensions". The Register. https://www.theregister.co.uk/2007/08/30/amd_sse5/.
↑ "128-Bit SSE5 Instruction Set". AMD Developer Central. http://developer.amd.com/SSE5.
↑ Michael Larabel (March 3, 2017). "The Impact Of GCC Zen Compiler Tuning On AMD Ryzen Performance". http://www.phoronix.com/scan.php?page=article&item=amd-ryzen-znver1&num=1. "But with Zen being a clean-sheet design, there are some instruction set extensions found in Bulldozer processors not found in Zen/znver1. Those no longer present include FMA4 and XOP."

External links

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/SSE5. Read more

[1] Hruska, Joel (November 14, 2008). "AMD Fusion now pushed back to 2011". Ars Technica. https://arstechnica.com/news.ars/post/20081114-amd-fusion-now-pushed-back-to-2011.html.

[Reg1-2] 2.0 ^2.1 Vance, Ashlee (August 30, 2007). "AMD plots single thread boost with x86 extensions". The Register. https://www.theregister.co.uk/2007/08/30/amd_sse5/.

[3] "128-Bit SSE5 Instruction Set". AMD Developer Central. http://developer.amd.com/SSE5.

[4] Michael Larabel (March 3, 2017). "The Impact Of GCC Zen Compiler Tuning On AMD Ryzen Performance". http://www.phoronix.com/scan.php?page=article&item=amd-ryzen-znver1&num=1. "But with Zen being a clean-sheet design, there are some instruction set extensions found in Bulldozer processors not found in Zen/znver1. Those no longer present include FMA4 and XOP."

[1]

[2]

[3]

[4]

v t e AMD technology
Software	AMD Radeon Software AGESA AMDGPU
Platforms	Spider Dragon Horus
Technology	Cool'n'Quiet High Bandwidth Memory PowerNow! PowerPlay PowerTune Turbo Core ASTC AMD Wraith
Instructions	X86-64 3DNow! AVX XOP CVT16/F16C FMA FMA3 FMA4 BMI ABM BMI1 TBM SSE5 ASF AES

v t e Instruction set extensions
SIMD (RISC)	Alpha MVI ARM NEON SVE MIPS MDMX MIPS-3D MXU MIPS SIMD PA-RISC MAX Power ISA VMX SPARC VIS
SIMD (x86)	MMX (1996) 3DNow! (1998) SSE (1999) SSE2 (2001) SSE3 (2004) SSSE3 (2006) SSE4 (2006) SSE5 ~~(2007)~~ AVX (2008) F16C (2009) XOP (2009) FMA (FMA4: 2011, FMA3: 2012) AVX2 (2013) AVX-512 (2015)
Bit manipulation	BMI (ABM: 2007, BMI1: 2012, BMI2: 2013, TBM: 2012) ADX (2014)
Compressed instructions	Thumb MIPS16e ASE
Security and cryptography	AES-NI (2008); 32- and 64-bit ARMv8 also has AES instructions CLMUL (2010) RDRAND (2012) SHA (2013) MPX (2015) SGX (2015)
Transactional memory	TSX (2013) ASF
Virtualization	VT-x (2005) AMD-V (2006)
Suspended extensions' dates have been ~~struck through~~.

Anonymous

Search

SSE5

Namespaces

More

Page actions

Contents

Compatibility

SSE5 enhancements

2009 revision

See also

References

External links

Navigation

Navigation

Resources

Help

googletranslator

Navigation

Wiki tools

Wiki tools

Anonymous

Search

SSE5

Compatibility

SSE5 enhancements

2009 revision

See also

References

External links

Navigation

Wiki tools

Page tools

Other projects

Categories