Foundry Toolkit provides comprehensive support for a wide variety of generative AI models.
Within the model catalog, you can explore and utilize models from multiple hosting sources:
Deploy models directly to Foundry from within the model catalog, streamlining your workflow.
Use Microsoft Foundry, Foundry Local, and GitHub models added to Foundry Toolkit with GitHub Copilot. For more information, check out Changing the model for chat.

To find a model in the model catalog:
Select the Foundry Toolkit view in the Activity Bar
Select Developer Tools > Discover > Model Catalog to open the model catalog
Use the filters to reduce the list of available models.
Text Attachment, Image Attachment, Web Search, Structured Outputs, and more.Browse the models in different categories, such as:
Alternatively, use the search box to find a specific model by name or description
Depending on your starting point, there are different flows for adding a model to work with in Foundry Toolkit.
To add a model from the model catalog:
Locate the model you want to add in the model catalog
The flow for adding models is slightly different based on the providers:
Microsoft Foundry: Deploy a model in your Microsoft Foundry project. Refer to Deploy a model to Microsoft Foundry for detailed instructions.
Custom Model that is hosted remotely, requires an API key, and has an OpenAI chat completion compatible endpoint URL. Refer to the Add a custom model for detailed instructions.
Foundry Local: Foundry Local downloads and runs the model, which might take a few minutes depending on your internet speed. The model is available on a localhost page and added to Foundry Toolkit. Learn more in What is Foundry Local?.
Ollama: The model is downloaded from Ollama and added to Foundry Toolkit. Refer to the Add Ollama models for detailed instructions.
GitHub: Foundry Toolkit asks for your GitHub credentials to access the model repository. Once authenticated, the model is added directly into Foundry Toolkit.
Foundry Toolkit now supports GitHub pay-as-you-go models, so you can keep working after passing free tier limits.
ONNX: To add an ONNX model, first convert it to the Foundry Toolkit model format using the model conversion tool. After conversion, add the model to Foundry Toolkit.
Once a model is added, the model appears under MY RESOURCES/Models in the tree view, and you can use it in the Playground or Agent Builder.
Deploy a model to Microsoft Foundry directly from Foundry Toolkit. Run the model in the cloud and access it via an endpoint.
From the model catalog, select the model you want to deploy.
Select Deploy to Microsoft Foundry, either from the dropdown menu or directly from the Deploy to Microsoft Foundry button, as in the following screenshot:

In the model deployment tab, enter the required information, such as the model name, description, and any other settings, as in the following screenshot:

Select Deploy to Microsoft Foundry to start the deployment process.
Confirm the deployment by reviewing the details and selecting Deploy to proceed.
Once the deployment is complete, the model is available in the MY RESOURCES > Your project name > Models section of Foundry Toolkit, and you can use it in the playground or agent builder.
For self-hosted or deployed models accessible from the internet with an OpenAI compatible endpoint, add it to Foundry Toolkit for use in the playground.
There are two ways to add a custom model:
In the Model Catalog, choose the + Bring Your Own Model button

In the Model Catalog, scroll to the "Add Custom Models" section, and choose the Add a Custom Model button.

A dialog appears prompting you for the OpenAI compatible endpoint URL, model name, API key and other required information.

Ollama enables many popular genAI models to run locally with CPU via GGUF quantization. If Ollama is installed on your local machine with downloaded Ollama models, add them to Foundry Toolkit for use in the model playground.
Prerequisites for using Ollama models in the Microsoft Foundry Toolkit for Visual Studio Code extension is Ollama (Tested on Ollama v0.4.1).
To add local Ollama into the Foundry Toolkit
From one of the entrypoints mentioned previously, select Add Ollama Model.
MY RESOURCES > Local Resources, select the + button next to Models.

This opens the add model selector. Choose "Add Ollama Model".

In the Model Catalog, scroll down to the "Local Models" section, and select the "Ollama" tab. Choose the Add button next to a model listed there, or select Add your own model.

Select Continue after reading the acknowledgement that Ollama is a third-party model provider.

Next, select Select models from Ollama library. This displays the models you have installed in Ollama. Use the checkboxes to select the ones you want to use with Foundry Toolkit.

Foundry Toolkit only shows models that are already downloaded in Ollama and not yet added to Foundry Toolkit. To download a model from Ollama, you can run ollama pull <model-name>. To see the list of models supported by Ollama, see the Ollama library or refer to the Ollama documentation.
Or, if you start the Ollama runtime at a different endpoint, choose Provide custom Ollama endpoint to specify an Ollama endpoint.
You should now see one or more selected Ollama models in the list of models in the tree view.
Attachment isn't supported yet for Ollama models. Foundry Toolkit connects to Ollama using the OpenAI compatible endpoint and doesn't support attachments yet.
You can test a model in the playground for chat completions.
Use the actions on the model card in the model catalog:
You can manage your models in the MY RESOURCES/Models section of the Foundry Toolkit sidebar:
View the list of models added to Foundry Toolkit.
Right-click on a model to access options such as:
Right-click on ONNX section title to access options such as:
Some models require a publisher or hosting-service license and account to sign-in. In that case, before you can run the model in the model playground, you're prompted to provide this information.
In this article, you learned how to: