AnythingLLM is a versatile local LLM platform that allows you to chat with your documents and code using a variety of models. It supports the OpenAI-compatible API interface, allowing easy integration with local servers like Lemonade.
This guide will help you configure AnythingLLM to use Lemonade’s OpenAI-compatible server, and utilize the powerful @agent
capability to interact with documents, webpages, and more.
Lemonade integrates best with AnythingLLM when using models such as Qwen-1.5-7B-Chat-Hybrid
and Llama-3.2-1B-Instruct-Hybrid
, both of which support a context length of up to 3,000 tokens.
Keep in mind that when using the @agent
feature, multi-turn conversations can quickly consume available context. As a result, the number of back-and-forth turns in a single conversation may be limited due to the growing context size.
Setting | Value |
---|---|
Base URL | http://localhost:8000/api/v1 |
API Key | - |
Chat Model Name | Qwen-1.5-7B-Chat-Hybrid |
Token context window | 3000 |
Max Tokens | 3000 |
Agents are capable of scraping websites, listing and summarizing documents, searching the web, creating charts, and even saving files to your desktop or their own memory.
To start an agent session, simply go to any workspace and type @agent <your prompt>
. To exit the session, just type exit
.
You may turn on and off specific Agent Skills
by going to your Workspace Settings
→ Agent Configuration
→ Configure Agent Skills
.
Available agent skills include:
Here are some examples on how you can interact with Anything LLM agents:
@agent My name is Dr Lemon. Remember this in our next conversation
@agent What is my name according to your memory?
@agent Scrape this website and tell me what are the two ways of installing lemonade https://github.com/lemonade-sdk/lemonade/blob/main/docs/server/README.md
@agent Search the web for the best place to buy shoes
You can find more details about agent usage here.