Loreon
Labs
Platform
Docs
Home
Ecosystems
Other
FasterTransformer
Transformer related optimization, including BERT, GPT
Other
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
36mo ago
Recent commits
Latest commits.
Support size_per_head=112 (#660)
7777ff1
Daya Khudia
36mo ago
fix: swap tensor bug (#683)
eb9b81b
Perkz Zheng
36mo ago
[bugfix] Fix 2-shot All Reduce correctness issue (indexing bug). (#672)
1cf9b51
Rahul Kindi
37mo ago
Fix/gpt early stop (#584)
c6e8f60
byshiue
38mo ago
perf(bloom): improve performance of huggingface_bloom_convert.py, decrease the time cost and the mem using (#568)
19b2956
杨睿
38mo ago
[Enhancement]create huggingface_gptneox_convert.py (#569)
3460e20
ADLIBS
38mo ago
Update unfused_attention_kernels.cu
d7ccf83
byshiue
39mo ago
fix overflow in softmax_kernel when process long seqlen and big batch_size (#524)
adb21c3
zhangxin81
39mo ago
Top contributors
Builders behind this project.
byshiue
135 commits
daemyung
7 commits
PerkzZheng
4 commits
andabi
3 commits
842974287
3 commits
yuanzhedong
3 commits
zhang-ge-hao
3 commits
BestJuly
2 commits