A public, anonymous Cloudflare R2 mirror of MLX-quantised LLM weights — the same files rapid-mlx pull downloads under the hood. Browse the catalog, copy the install command, and have a model serving on your Mac in minutes.
$ rapid-mlx pull qwen3.5-4b-4bit
Not sure which model? Pick your Mac RAM tier; we'll surface the alias that gives the best single-user throughput at that footprint. The full catalog is below.
Every alias rapid-mlx ships with, joined against what is currently mirrored on R2. Files are immutable per HuggingFace revision and cached aggressively at the edge.