LoreonLabsPlatform
DocsHome
  • Overview

Intelligence

  • Markets
  • Builders
  • Research
  • Ecosystems
  • Launchpads
  • Search
Ecosystems

Python

tiny-llm

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

PythonEmerging
GitHubWebsite
Stars
—
Forks
—
Contributors
8
Last push
8mo ago

Recent commits

Latest commits.

  • update dev-tools.py to fix --force in copy-test (#70)
    b6a3b00yangpeng9mo ago
  • Add tests for week 2, day 6 - continuous batching (#69)
    6635e4aEric Zhang9mo ago
  • Day 6, task 1 tests - RoPE with multiple offsets (#68)
    136ad7fEric Zhang9mo ago
  • CI workflow for pdm setup, build and testing refsol (#67)
    308388eEric Zhang9mo ago
  • Bump mlx to >=0.27 and fix build-ext from week 1, day 7 (#66)
    1f2ab12Eric Zhang9mo ago
update writeup progress
26aa2ffAlex Chi Z9mo ago
  • fix simple kv cache decoding (#65)
    1fc0752Alex Chi Z9mo ago
  • add chunked prefill and continuous batching writeup (#64)
    1449816Alex Chi Z9mo ago
  • Top contributors

    Builders behind this project.

    skyzh
    129 commits
    Connor1996
    9 commits
    ekzhang
    5 commits
    jiengup
    5 commits
    58191554
    4 commits
    KKKZOZ
    3 commits
    minatoaquaMK2
    1 commits
    magic3007
    1 commits