Other

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

OtherEmerging

GitHub Website

Stars

—

Forks

—

Contributors

—

Last push

5mo ago

Recent commits

Latest commits.

No recent commits available.

Top contributors

Builders behind this project.

No contributor data available.