Microsoft AI Toolkit for VS Code

Overview

The AI Toolkit for Visual Studio Code is a VS Code extension that simplifies generative AI app development by bringing together cutting-edge AI development tools and models from various catalogs. It supports running AI models locally or connecting to remote models via API keys.

Demo Video

▶️ Watch on YouTube

Expectations

We have found that most LLMs work well with this application.

However, the Inference Parameters option is not fully supported, as Lemonade Server currently does not accept those as inputs (see server_spec.md for details).

Setup

Prerequisites

Install Lemonade Server by following the Lemonade Server Instructions and using the installer .exe.

Install AI Toolkit for VS Code

Open the Extensions tab in VS Code Activity Bar.
Search for "AI Toolkit for Visual Studio Code" in the Extensions Marketplace search bar.
Select the AI Toolkit extension and click install.

This will add an AI Toolkit icon to your VS Code Activity Bar.

Connect Lemonade to AI Toolkit

The AI Toolkit now supports "Bring Your Own Model" functionality, allowing you to connect to models served via the OpenAI API standard, which Lemonade uses.

Open the AI Toolkit tab in your VS Code Activity Bar.
In the right corner of the "My Models" section, click the "+" button to "Add model for remote inference".
Select "Add a custom model".
When prompted to "Enter OpenAI chat completion endpoint URL" enter:
```
http://localhost:8000/api/v1/chat/completions
```
When prompted to "Enter the exact model name as in the API" select a model (e.g., Phi-3-Mini-Instruct-Hybrid)
- Note: You can get a list of all models available here.
Select the same name as the display model name.
Skip the HTTP authentication step by pressing "Enter".

Usage

Once you've set up the Lemonade model in AI Toolkit, you can:

Use the AI Playground tool to directly interact with your added model.
Use the Prompt Builder tool to craft effective prompts for your AI models.
Use the Bulk Run tool to compute responses for custom datasets and easily visualize those responses on a table format.
Use the Evaluation tool to quickly assess your model's coherence, fluency, relevance, and similarity, as well as to compute BLEU, F1, GLEU, and Meteor scores.