LoreonLabsPlatform
DocsHome
  • Overview

Intelligence

  • Markets
  • Builders
  • Research
  • Ecosystems
  • Launchpads
  • Search
Ecosystems

Other

tiny-llm

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

OtherEmerging
GitHubWebsite
Stars
—
Forks
—
Contributors
8
Last push
10mo ago

Recent commits

Latest commits.

  • clearify variables
    042acf5Alex Chi Z10mo ago
  • small fix about dim
    30b68a9Alex Chi Z10mo ago
  • update benches
    45cff24Alex Chi Z10mo ago
  • add week2day1 kv cache contents
    0b82b7fAlex Chi Z10mo ago
  • remove offset in week 1, not used
    1d7572fAlex Chi Z10mo ago
fix: resolve f-string syntax error in batch.py (#44)
4e1ccedEric Yue10mo ago
  • update readme
    cd87116Alex Chi Z10mo ago
  • qwen3 support
    ffbd15dAlex Chi Z10mo ago
  • Top contributors

    Builders behind this project.

    skyzh
    124 commits
    Connor1996
    9 commits
    minatoaquaMK2
    1 commits
    KKKZOZ
    1 commits
    magic3007
    1 commits
    shenxiangzhuang
    1 commits
    Phoenix500526
    1 commits
    shivangsharma1
    1 commits