The canonical source for these docs lives in the repo. Start with:
- Getting started — install, first run, what to read next
- CLI reference — every subcommand and flag
- Architecture — how a
hipfire runbecomes tokens - Models — curated tags and BYO
- Quantize — HF / safetensors / GGUF → hipfire formats
- Serve — OpenAI-compatible HTTP API
- Benchmarks — measured perf per arch
- Multi-GPU — pipeline-parallel