What is Lemonade?
Lemonade is a local AI runtime that makes it easy to run models like gpt-oss on your own hardware with privacy by default. It is optimized for fast setup, OpenAI API compatibility, and practical performance across common local acceleration stacks.
Simple setup flow that gets the local stack running quickly.
Works with many apps out-of-box and integrates in minutes.
Configures dependencies for your GPU and NPU acceleration stack.
Works with llama.cpp, Ryzen AI SW, FastFlowLM, and more.
Run more than one model at the same time on a single machine.
A consistent experience across Windows, Linux, and macOS.
Choose your gpt-oss model
Use 20B for faster local responsiveness, or 120B for deeper reasoning quality.
Optimized for lower latency and local use cases.
Perfect for everyday tasks and quick responses.
Production-ready model for high reasoning tasks.
Ideal for complex reasoning and advanced applications.
Advanced features
Both models feature OpenAI's sliding window attention and attention sink mechanisms, allowing them to handle long conversations and contexts efficiently while maintaining response quality.
Install and run gpt-oss
Set up Lemonade, download your model, and start chatting locally in minutes.
Install Lemonade
Use these quick download links to get started:
| Operating System | Downloads |
|---|---|
| Windows | lemonade.msi |
| Ubuntu | lemonade-server_latest_amd64.deb |
| macOS (beta) | Lemonade-latest-Darwin.pkg |
Other platforms? See Installation Options for Docker, Snap, Arch, Fedora, and Debian.
Run gpt-oss models
Pull and run the 20B model:
lemonade-server pull gpt-oss-20b-GGUF
lemonade-server run gpt-oss-20b-GGUF
For higher reasoning quality, pull and run 120B:
lemonade-server pull gpt-oss-120b-GGUF
lemonade-server run gpt-oss-120b-GGUF
Tip: keep models pre-downloaded to avoid startup delays:
lemonade-server pull gpt-oss-20b-GGUF
lemonade-server pull gpt-oss-120b-GGUF
System requirements
Recommended memory guidance for each gpt-oss model.
gpt-oss-20b-GGUF
About 13GB RAM is recommended for optimal performance.
gpt-oss-120b-GGUF
Requires significantly more memory for optimal performance.
Ready to get started?
Install Lemonade and run gpt-oss locally with full privacy in just a few commands.