Distributed, secure multi-VM LLM inference system on AWS featuring a private subnet Python ML worker (Gemma-3), a public Bun API Gateway, and RPC orchestration via iii
Latest commits.
Builders behind this project.