Batch-invariant operations for deterministic LLM inference on Apple Silicon using MLX
Latest commits.
Builders behind this project.