An enhanced version of tattn's LocalLLMClient, with Qwen 3 VL support (both Llama Cpp and MLX), updated Llama Cpp, and Inference Performance tooling.
Latest commits.
Builders behind this project.