Nvidia NVENC

From HandWiki
Short description: Feature of GPUs by Nvidia


Nvidia NVENC (short for Nvidia Encoder)[citation needed] is a feature in Nvidia graphics cards that performs video encoding, offloading this compute-intensive task from the CPU to a dedicated part of the GPU. It was introduced with the Kepler-based GeForce 600 series in March 2012 (GT 610,GT620 and GT630 is Fermi Architecture).[1][2]

The encoder is supported in many livestreaming and recording programs, such as vMix, Wirecast, Open Broadcaster Software (OBS) and Bandicam, as well as video editing apps, such as Adobe Premiere Pro or DaVinci Resolve. It also works with Share game capture, which is included in Nvidia's GeForce Experience software.[3][4][5]

Until March 2023 consumer-targeted GeForce graphics cards officially support no more than 3 simultaneously encoding video streams, regardless of the count of the cards installed, but this restriction can be circumvented on Linux and Windows systems by applying an unofficial patch to the drivers. Doing so also unlocks NVIDIA Frame Buffer Capture (NVFBC), a fast desktop capture API that uses the capabilities of the GPU and its driver to accelerate capture.[6] Professional cards support between 3 and unrestricted simultaneous streams per card, depending on card model and compression quality,[1] the restrictions were loosened in 2023 allowing up to 5 simultaneously encoding video streams.[7] From January 2024 onwards, 8 simultaneous encoding video streams became the baseline.[8]

Nvidia chips also feature an onboard decoder, NVDEC (short for Nvidia Decoder), to offload video decoding from the CPU to a dedicated part of the GPU.[1]

Versions

NVENC has undergone several hardware revisions since its introduction with the first Kepler GPU (GK104).[9]

NVENC summary[10][11]
GPU Hardware H.264 (AVC)

(In H.264, NVENC always has B Frame support, max 4096x4096 resolution, and max 8-bit depth)

H.265 (HEVC) AV1
NVENC Generation GPU Code Name NVENC per Chip Chroma Lossless Coding Chroma Lossless Coding Resolution Color Depth B Frames AV1
4:2:0 4:4:4 4:2:0 4:4:4
1st Gen GK110 1 Yes No No H.265 not supported AV1 not supported
GK107
GK106
GK104
2nd Gen GM107 1 Yes Yes Yes H.265 not supported AV1 not supported
3rd Gen GM206 1 Yes Yes Yes Yes Y?[11] N?[10] No 4096 x 4096 8-bit No
GM204 2 No
GM200
4th Gen GP108 0 No NVENC encoders available
GP107 1 Yes Yes Yes Yes Yes Yes 8192 x 8192 10-bit No AV1 not supported
GP106
GP104-2xx+ 2
GP104-1xx 1
GP102 2
GP100 3 4096 x 4096
5th Gen GV10x 8192 x 8192
TU117 1
6th Gen TU116 Yes
TU106
TU104
TU102
7th Gen GA107 1 Yes Yes Yes Yes Yes Yes 8192 x 8192 10-bit Yes
GA106
GA104
GA102
GA100[12] 0 No NVENC encoders available
8th Gen AD107 1 Yes Yes Yes Yes Yes Yes 8192 x 8192 10-bit Yes Yes
AD106
AD104-250
AD104-400 2
AD103
AD102
NVENC Generation GPU Code Name NVENC per Chip Chroma Lossless Coding Chroma Lossless Coding Resolution Color Depth B Frames AV1
4:2:0 4:4:4 4:2:0 4:4:4

First generation, Kepler GK1xx

The first generation of NVENC, which is shared by all Kepler-based GPUs, supports H.264 high-profile (YUV420, I/P/B frames, CAVLC/CABAC), H.264 SVC Temporal Encode VCE, and Display Encode Mode (DEM).

NVidia's documentation states a peak encoder throughput of 8× realtime at a resolution of 1920×1080 (where the baseline "1×" equals 30 Hz). Actual throughput varies on the selected preset, user-controlled parameters and settings, and the GPU/memory clock frequencies. The published 8× rating is achievable with the NVENC high-performance preset, which sacrifices compression efficiency and quality for encoder throughput. The high-quality preset is considerably slower but produces fewer compression artifacts

Second generation, Maxwell GM107

Introduced with the first-generation Maxwell architecture, second generation NVENC adds support for the high-performance HP444 profile (YUV4:4:4, predictive lossless encoding), and increases encoder throughput up to 16× realtime, which corresponds to about 1080p @ 480 Hz with the high-performance preset.

Maxwell GM108 does not have NVENC hardware encoder support.

Third generation, Maxwell GM20x

Introduced with the second-generation Maxwell architecture, third generation NVENC implements the video compression algorithm High Efficiency Video Coding (a.k.a. HEVC, H.265) and also increases the H.264 encoder's throughput to cover 4K-resolution at 60 Hz (2160p60). However, it does not support B-frames for HEVC encoding (just I and P frames). The maximum NVENC HEVC coding tree unit (CU) size is 32 (the HEVC standard allows a maximum of 64), and its minimum CU size is 8.

HEVC encoding also lacks Sample Adaptive Offset (SAO). Adaptive quantization, look-ahead rate control, adaptive B-frames (H.264 only) and adaptive GOP features were added with the release of Nvidia Video Codec SDK 7.[13] These features rely on CUDA cores for hardware acceleration.

SDK 7 supports two forms of adaptive quantization; Spatial AQ (H.264 and HEVC) and Temporal AQ (H.264 only).

Nvidia's consumer-grade (GeForce) cards and some of its lower-end professional Quadro cards are restricted to three simultaneous encoding jobs. Its higher-end Quadro cards do not have this restriction.

Fourth generation, Pascal GP10x

Fourth generation NVENC implements HEVC Main10 10-bit hardware encoding. It also doubles the encoding performance of 4K H.264 & HEVC when compared to previous generation NVENC. It supports HEVC 8K, 4:4:4 chroma subsampling, lossless encoding, and sample adaptive offset (SAO).

Nvidia Video Codec SDK 8 added Pascal exclusive Weighted Prediction feature (CUDA based). Weighted prediction is not supported if the encode session is configured with B frames (H.264).

There is no B-Frame support for HEVC encoding, and the maximum CU size is 32×32.

The NVIDIA GT 1030 and the Mobile Quadro P500 are GP108 chips that don't support the NVENC encoder.[10]

In laptop graphics, NVIDIA MX Graphics do not include NVENC as they are based on a Maxwell-generation GM108 or a Pascal-generation GP108 chip.[14] The GeForce MX350 is a GP107 chip whose NVENC encoder is disabled during manufacture.

Fifth generation, Volta GV10x/Turing TU117

Volta NVENC has similar performance as Pascal's NVENC. [1]

It does not offer support for HEVC B-Frames.

In mobile graphics, as with most other GeForce MX-series graphics, the GeForce MX450 does not support NVENC as it is a TU117 chip whose hardware encoder is permanently disabled in its manufacture. The GeForce MX550, however, does support NVENC as its hardware encoder remains enabled at manufacturing level.

Sixth generation, Turing TU10x/TU116

Sixth generation NVENC implements HEVC 8K encoding at 30 FPS, HEVC B-frames and HEVC B-frames as reference (with support for each and middle modes[15]) and Alpha HEVC[16] support and provides up to 25% bitrate savings for HEVC and up to 15% bitrate savings for H.264. The initial launch of the Nvidia GeForce GTX 1650 was exempt from this generation however, as it used Volta NVENC instead of Turing. Nvidia updated the NVENC encoder of the GTX 1650 cards in 2020 to also use the Turing engine.[17] The GTX 1650 Super uses the Turing NVENC engine as it is based on the TU116 rather than the TU117 used in the original GTX 1650.[18]

Seventh generation, Ampere GA10x

Ampere has essentially the same NVENC generation engine as Turing.[19] The only substantive difference is that NVDEC received support for AV1 decoding (with film grain).

In entry-level mobile graphics, the GA107-chip-based GeForce MX570 comes in two versions, one of which (the GeForce MX570 A) has the hardware decoder and encoder permanently disabled during manufacturing.

Eighth generation, Ada Lovelace AD10x

Nvidia announced the next-gen NVENC with 8K 10-bit 60FPS AV1 fixed function hardware encoder in Ada Lovelace GPUs.[20][21]

Operating system support

The Nvidia NVENC SIP core needs to be supported by the device driver. The driver provides one or more interfaces, (e.g. OpenMAX IL) to NVENC. The NVENC SIP core can only be accessed through the proprietary NVENC API (as opposed to the open-source VDPAU API).

It is bundled with Nvidia's GeForce driver.

NVENC is available for Windows and Linux operating systems.[1] The free and open-source nouveau device driver does not support Nvidia NVENC.[22]

Application software support

  • Adobe Premiere Pro added NVENC support in version 14.2 in May 2020[23]
  • AJA Bridge Live[24]
  • Avidemux has supported NVENC since at least 2016, in H.264 or H.265[25]
  • Bandicut
  • DaVinci Resolve Studio when exporting video in H.264 or H.265[26]
  • FFmpeg has supported NVENC since 2014,[27] and is supported in Nvidia drivers[28]
  • HandBrake has supported NVENC since version 1.2.0 (December 2018), for H.264 and H.265[29]
  • HitFilm express/pro when previewing video on the timeline or exporting video in H.264 or H.265[30]
  • Jellyfin has support for NVENC for realtime transcoding[31]
  • Logitech Capture
  • MacroSystem BogartSE supports NVENC export since v9.4 (October 2017) and UHD-Disc encoding since v11.3 (October 2019)[32]
  • MediaCoder
  • Open Broadcaster Software (OBS)
  • Shotcut
  • StaxRip
  • Turbo Play
  • VidCoder
  • VideoProc

GPU throughput

Comparison of Encode Throughput[33]

Streams H.264 Encode (1080p30)
GM204(Tesla M6) 18
GM107(Tesla M10) 28
GM200(Tesla M40) 18
GM204(Tesla M60) 36
GP104(Tesla P4) 24
TU104(Tesla T4) 32
P100 36
V100 36

See also

  • Intel Quick Sync Video, Intel's equivalent SIP core
  • Video Coding Engine, AMD's equivalent SIP core till 2017
  • Video Core Next, AMD's video core which combines the functionality of Video Coding Engine and Unified Video Decoder
  • Nvidia PureVideo, hardware video decoding
  • List of all Nvidia GPUs
  • Nvidia NVDEC

References

  1. 1.0 1.1 1.2 1.3 "NVIDIA VIDEO CODEC SDK". Nvidia. August 23, 2013. https://developer.nvidia.com/nvidia-video-codec-sdk. 
  2. "Maxwell's Feature Set: Kepler Refined". AnandTech. 2014-02-18. http://www.anandtech.com/show/7764/the-nvidia-geforce-gtx-750-ti-and-gtx-750-review-maxwell/2. 
  3. "System Requirements for NVIDIA GeForce Experience | GeForce | GeForce". https://www.geforce.com/geforce-experience/system-requirements. 
  4. "Wirecast Tech Specs". https://www.telestream.net/wirecast/tech-specs.htm. 
  5. "Open Broadcaster Software – Index". https://obsproject.com/. 
  6. "nvidia-patch". GitHub. https://github.com/keylase/nvidia-patch. 
  7. Shilov, Anton (March 24, 2023). "Nvidia Lifts Some Video Encoding Limitations from Consumer GPUs". https://www.tomshardware.com/news/nvidia-increases-concurrent-nvenc-sessions-on-consumer-gpus. 
  8. https://developer.nvidia.com/video-encode-and-decode-gpu-support-matrix-new
  9. "S5613 – High-Performance Video Encoding Using NVIDIA GPUs". Nvidia. http://on-demand.gputechconf.com/gtc/2015/video/S5613.html. 
  10. 10.0 10.1 10.2 "Video Encode and Decode GPU Support Matrix" (in en). 2016-11-09. https://developer.nvidia.com/video-encode-decode-gpu-support-matrix. 
  11. 11.0 11.1 "NVIDIA VIDEO CODEC SDK" (in en). 2016-11-09. https://developer.nvidia.com/nvidia-video-codec-sdk#NVENCFeatures. 
  12. "NVIDIA Ampere Architecture In-Depth" (in en-US). 2020-05-14. https://developer.nvidia.com/blog/nvidia-ampere-architecture-in-depth/. 
  13. http://on-demand.gputechconf.com/gtc/2016/presentation/s6226-abhijit-patait-high-performance-video.pdf[bare URL PDF]
  14. "NVIDIA GeForce MX250 i MX230 – dwie "nowe" grafiki do laptopów" (in pl). Dobre Programy. 2019-02-21. https://www.dobreprogramy.pl/nvidia-geforce-mx250-i-mx230-dwie-nowe-grafiki-do-laptopow,6628559174182529a. 
  15. "B-Frame reference mode flag in ffmpeg nvenc is bugged and doesn't work. · Issue #2374 · obsproject/obs-studio" (in en). https://github.com/obsproject/obs-studio/issues/2374. 
  16. Harrison, John (January 30, 2021). "johnhe4/nvenc_h265_transparency". https://github.com/johnhe4/nvenc_h265_transparency. 
  17. "NVIDIA GeForce GTX 1650 Graphics Card" (in en-us). https://www.nvidia.com/en-us/geforce/graphics-cards/gtx-1650/. 
  18. Walton, Jarred (2019-11-23). "Nvidia GeForce GTX 1650 Super review" (in en-US). https://www.pcgamer.com/nvidia-geforce-gtx-1650-super-review/. 
  19. "NVIDIA Ampere GA102 GPU Architecture: Second-Generation RTX". https://www.nvidia.com/content/dam/en-zz/Solutions/geforce/ampere/pdf/NVIDIA-ampere-GA102-GPU-Architecture-Whitepaper-V1.pdf. 
  20. Gray, Jon (September 20, 2022). "Creativity At The Speed of Light: GeForce RTX 40 Series Graphics Cards Unleash Up To 2X Performance in 3D Rendering, AI, and Video Exports For Gamers and Creators". https://www.nvidia.com/en-us/geforce/news/rtx-40-series-and-studio-updates-for-content-creation/. 
  21. "NVIDIA Video Codec SDK". August 23, 2013. https://developer.nvidia.com/nvidia-video-codec-sdk. 
  22. "Nouveau Feature Matrix". Freedesktop.org. https://nouveau.freedesktop.org/wiki/FeatureMatrix/. 
  23. Tack, Stanley (2020-05-19). "Cut to the Video: Adobe Premiere Pro Helps Content Creators Work Faster with GPU-Accelerated Exports | NVIDIA Blog" (in en-US). https://blogs.nvidia.com/blog/2020/05/19/gpu-acceleration-adobe-premiere-pro/. 
  24. "BRIDGE LIVE is a broadcast-quality, low-latency turnkey system". https://www.aja.com/products/bridge-live. 
  25. AVIDemux Changelog
  26. "Blackmagic Forum • View topic – Davinci studio 16, NVENC unavailable". https://forum.blackmagicdesign.com/viewtopic.php?f=21&t=98035#p543912/. 
  27. "nvenc.c H.264 hardware encoding using nvidia nvenc". https://ffmpeg.org/doxygen/trunk/nvenc_8c_source.html. 
  28. "QUADRO DESKTOP/QUADRO NOTEBOOK DRIVER RELEASE 375". http://www.nvidia.com/download/driverResults.aspx/109629/en-us. 
  29. HandBrake 1.2.0 released
  30. forum post by staff member saying there is an issue with the way the software is interacting with the encoder but at the same time saying it exsits
  31. "Transcoding | Jellyfin". https://jellyfin.org/docs/general/server/transcoding. 
  32. MacroSystem Arabesk 8 info from Casablanca Expert
  33. "NVIDIA vGPU Resources for Design & VIrtualization" (in en-US). https://www.nvidia.com/en-us/data-center/virtualization/resources/. 

External links