Benchmark tool for measuring speculative decoding speedups. Sweep draft/target model combinations and generate interactive charts.
Latest commits.
Builders behind this project.