LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Narratives
Ecosystems
Launchpads

Discover

Search
Sources

Other

FasterTransformer

Transformer related optimization, including BERT, GPT

OtherEmerging

Stars

—

Forks

—

Contributors

8

Last push

36mo ago

Recent commits

Latest commits.

Support size_per_head=112 (#660)
7777ff1Daya Khudia36mo ago
fix: swap tensor bug (#683)
eb9b81bPerkz Zheng36mo ago
[bugfix] Fix 2-shot All Reduce correctness issue (indexing bug). (#672)
1cf9b51Rahul Kindi37mo ago
Fix/gpt early stop (#584)
c6e8f60byshiue38mo ago
perf(bloom): improve performance of huggingface_bloom_convert.py, decrease the time cost and the mem using (#568)
19b2956杨睿38mo ago

[Enhancement]create huggingface_gptneox_convert.py (#569)

3460e20ADLIBS38mo ago

Update unfused_attention_kernels.cu

d7ccf83byshiue39mo ago

fix overflow in softmax_kernel when process long seqlen and big batch_size (#524)

adb21c3zhangxin8139mo ago

Top contributors

Builders behind this project.