AI providers & models supported by GPT for Work

GPT for Work supports models from Anthropic, Azure, DeepSeek, Google, Mistral, OpenAI, OpenRouter, Perplexity, and xAI. GPT for Work also supports open-source models through Ollama and any OpenAI-compatible API endpoint. The tables below show which models you can use with and without an API key, in which GPT for Work add-ons, and at what price.

Models you can use without an API key

You can use models from OpenAI, Google, Anthropic, and Perplexity.

Models you can use with an API key

You can use models from OpenAI, Perplexity, Google, Anthropic, OpenRouter, DeepSeek, Mistral, Azure, xAI, and open-source models through Ollama and any OpenAI-compatible API endpoint.

You pay the API cost directly to the AI provider.

Models available through dedicated endpoints (Azure, Ollama, other local servers and cloud-based platforms) are currently free for personal use. Contact us to use GPT for Work with dedicated endpoints for professional use.

Notes

Legal considerations

Ensure the AI models you use comply with your local laws and suit your needs.

DeepSeek should not be used in regions where its use is prohibited. When its use is allowed, avoid sharing personal, confidential, or sensitive information.

Web search models

Some models can search the web for up-to-date information to use as context when generating responses. Moreover, some models allow you to choose how much context they gather. This is measured as context size, which ranges from low to high. Larger context sizes allow models to retrieve more information from each web source, which can produce richer and more nuanced responses.

In addition to the regular token cost, web search models incur a separate search cost when you do not use an API key. The search cost is billed per 1,000 searches. For models that allow you to choose the amount of context, larger context sizes consume more of your balance.

The following table lists the context size costs for each supported web search model when not using an API key, billed per 1,000 searches.

Provider	Model	Low	Medium	High
Google	gemini-2.5-flash	$35.00 (fixed context size)
Perplexity	sonar	$5.00	$8.00	$12.00
Perplexity	sonar-pro	$6.00	$10.00	$14.00

Reasoning models

Reasoning models split output tokens into completion tokens and reasoning tokens. Completion tokens are the tokens that make up the model's answer, while reasoning tokens are additional tokens generated during the model's reasoning process. You are billed for both types of tokens.

Custom endpoints

You can use any OpenAI-compatible API endpoint with GPT for Work. You can connect to two main types of services:

Cloud-based LLM platforms provide access to models over the internet with no software installation or setup required on your part. Popular examples include Anyscale, Fireworks AI, and Together AI. The available models vary from platform to platform.
Local LLM servers run on a local machine, such as your own computer or another computer on a local network. Popular examples include LM Studio, LocalAI, and Open WebUI. The available models depend on what's installed on the server you're using.

What's next

tip

The tables on this page were created with Awesome Table Apps.

Models you can use without an API key​

Models you can use with an API key​

Notes​

Legal considerations​

Web search models​

Reasoning models​

Custom endpoints​

What's next​