Efficient RWKV inference engine. RWKV7 7.2B fp16 decoding 10250 tps @ single 5090.
Latest commits.
Builders behind this project.