Loreon
Labs
Platform
Docs
Home
Ecosystems
C++
FasterTransformer
Transformer related optimization, including BERT, GPT
C++
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
38mo ago
Recent commits
Latest commits.
perf(bloom): improve performance of huggingface_bloom_convert.py, decrease the time cost and the mem using (#568)
19b2956
杨睿
38mo ago
[Enhancement]create huggingface_gptneox_convert.py (#569)
3460e20
ADLIBS
38mo ago
Update unfused_attention_kernels.cu
d7ccf83
byshiue
39mo ago
fix overflow in softmax_kernel when process long seqlen and big batch_size (#524)
adb21c3
zhangxin81
39mo ago
Update cublasMMWrapper.cc
c6ba315
byshiue
39mo ago
Update cublasMMWrapper.cc
a6ef7af
byshiue
39mo ago
[Enhancement]add pytorch backend support for gptneox (#550)
169b8df
ADLIBS
39mo ago
Update T5DecodingWeight.cc
0c12805
byshiue
39mo ago
Top contributors
Builders behind this project.
byshiue
134 commits
daemyung
7 commits
andabi
3 commits
842974287
3 commits
yuanzhedong
3 commits
zhang-ge-hao
3 commits
PerkzZheng
3 commits
BestJuly
2 commits