Running agents locally with Lemonade and AnythingLLM

Overview

AnythingLLM is a versatile local LLM platform that allows you to chat with your documents and code using a variety of models. It supports the OpenAI-compatible API interface, allowing easy integration with local servers like Lemonade.

This guide will help you configure AnythingLLM to use Lemonade's OpenAI-compatible server, and utilize the powerful @agent capability to interact with documents, webpages, and more.

Expectations

Lemonade integrates best with AnythingLLM when using models such as Qwen-1.5-7B-Chat-Hybrid and Llama-3.2-1B-Instruct-Hybrid, both of which support a context length of up to 3,000 tokens.

Keep in mind that when using the @agent feature, multi-turn conversations can quickly consume available context. As a result, the number of back-and-forth turns in a single conversation may be limited due to the growing context size.

Setup

Prerequisites

Install Lemonade Server by following the Lemonade Server Instructions and using the installer .exe.
Install and set up AnythingLLM from their GitHub or website.

Configure AnythingLLM to Use Lemonade

In the bottom of the left menu, click on the wrench icon to "Open Settings".
Under the menu "AI Providers", click "LLM".
Select "Generic OpenAI" and enter the following info:

Setting Value

Base URL http://localhost:8000/api/v1

API Key -

Chat Model Name Qwen-1.5-7B-Chat-Hybrid

Token context window 3000

Max Tokens 3000
In the bottom left, click the back button to exit.
In the left menu, click "New Workspace" and give it a name.
Where you see your new workspace, click the gear icon to open the "Workspace Settings"
In the top menu of the window that opens, click on "Agent Configuration"
Under Chat Settings, select Generic OpenAI and click save.
Under Workspace Agent LLM Provider, select "Generic OpenAI" and click save.

Usage with @agent

Overview

Agents are capable of scraping websites, listing and summarizing documents, searching the web, creating charts, and even saving files to your desktop or their own memory.

To start an agent session, simply go to any workspace and type @agent <your prompt>. To exit the session, just type exit.

Agent Skills

You may turn on and off specific Agent Skills by going to your Workspace Settings → Agent Configuration → Configure Agent Skills.

Available agent skills include:

RAG & long-term memory
View and summarize documents
Scrape Websites
Generate & save files to browser
Generate Charts
Web Search
SQL Connector

Examples

Here are some examples on how you can interact with Anything LLM agents:

Rag & long-term memory
- @agent My name is Dr Lemon. Remember this in our next conversation
- Then, on a follow up chat you can ask @agent What is my name according to your memory?
Scrape Websites
- @agent Scrape this website and tell me what are the two ways of installing lemonade https://github.com/lemonade-sdk/lemonade/blob/main/docs/server/README.md
Web Search (enable skill before trying)
- @agent Search the web for the best place to buy shoes

You can find more details about agent usage here.

Setting	Value
Base URL	`http://localhost:8000/api/v1`
API Key	`-`
Chat Model Name	`Qwen-1.5-7B-Chat-Hybrid`
Token context window	`3000`
Max Tokens	`3000`

Running agents locally with Lemonade and AnythingLLM

Overview

Expectations

Setup

Prerequisites

Configure AnythingLLM to Use Lemonade

Usage with @agent

Overview

Agent Skills

Examples

Additional Resources