Other

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

OtherEmerging

GitHub Website

Stars

—

Forks

—

Contributors

—

Last push

3d ago

Recent commits

Latest commits.

No recent commits available.

Top contributors

Builders behind this project.

No contributor data available.