Loreon
Labs
Platform
Docs
Home
Ecosystems
Cuda
CUDA-MSST-Infer
HuanLinOTO/CUDA-MSST-Infer
Cuda
Emerging
GitHub
Stars
1
Forks
—
Contributors
1
Last push
2mo ago
Recent commits
Latest commits.
refactor: 将Web UI从内联HTML提取到独立文件,优化可读性和文件选择按钮
d96ad1c
HuanLinOTO
2mo ago
feat: add runtime controls for CUDA Graph, model persistence, and precision mode; improve memory management and localize web UI
fb89b9b
HuanLinOTO
2mo ago
perf: add CUDA Graph caching for attention, optimize single-chunk mode and band processing
8acee3c
HuanLinOTO
2mo ago
feat: enhance profiling capabilities with detailed timing statistics for attention and feedforward layers
34800bd
HuanLinOTO
2mo ago
perf: add profiling and runtime tuning controls
e524f5a
HuanLinOTO
2mo ago
feat: add embedded HTTP inference server mode
ee6acc1
HuanLinOTO
2mo ago
fix: align conversion and inference paths
50b01b5
HuanLinOTO
2mo ago
perf: drastically reduce binary/artifact size
ecc0735
HuanLinOTO
3mo ago
Top contributors
Builders behind this project.
HuanLinOTO
14 commits