Skip to main content
Azure

Azure AI Foundry Models pricing

Empower enterprises to access top-tier models, deploy with confidence and optimize for performance

Azure AI Foundry is a unified platform to design, customize, manage, and support AI applications and agents. We have a wide choice of over 11,000+ models covering a broad range of foundation models, reasoning models, multimodal models, industry models and domain specific models integrated within Azure's enterprise-grade AI stack.

Explore pricing options

Apply filters to customize pricing options to your needs.

Prices are estimates only and are not intended as actual price quotes. Actual pricing may vary depending on the type of agreement entered with Microsoft, date of purchase, and the currency exchange rate. Prices are calculated based on US dollars and converted using London closing spot rates that are captured in the two business days prior to the last business day of the previous month end. If the two business days prior to the end of the month fall on a bank holiday in major markets, the rate setting day is generally the day immediately preceding the two business days. This rate applies to all transactions during the upcoming month. Sign in to the Azure pricing calculator to see pricing based on your current program/offer with Microsoft. Contact an Azure sales specialist for more information on pricing or to request a price quote. See frequently asked questions about Azure pricing.

Serverless Deployment

Azure AI Foundry Models continues to grow with cutting-edge additions—including models from OpenAI, DeepSeek, xAI’s Grok, Meta’s Llama, Mistral AI, FLUX by Black Forest Labs, and more. These models are fully hosted and managed on Azure, and available through both pay-as-you-go and provisioned throughput options. With Provisioned Throughput, you can even flex capacity across multiple models, such as OpenAI and DeepSeek, for greater efficiency and control.

Phi

Language models

Models Context Input (Per 1,000 tokens) Output (Per 1,000 tokens)
Phi-3-mini-4k-instruct 4K $- $-
Phi-3-mini-128k-instruct 128K $- $-
Phi-3.5-mini-instruct 128K $- $-
Phi-3-small-8k-instruct 8K $- $-
Phi-3-small-128k-instruct 128K $- $-
Phi-3-medium-4k-instruct 4K $- $-
Phi-3-medium-128k-instruct 128K $- $-
Phi-3.5-MoE-instruct 128K $- $-
Phi-4 128K $- $-
Phi-4-mini 128K $- $-
Phi-4-multimodal, text and image 128K $- $-
Phi-4-multimodal, audio 128K $- $-
Phi-4-mini-reasoning 128K $- $-
Phi-4-reasoning 32K $- $-
Phi-4-reasoning-plus 32K $- $-

Fine-tuning models

Models Training per 1,000 tokens Hosting per hour Input Usage per 1,000 tokens Output Usage per 1,000 tokens
Phi-3-mini-4k-instruct $- $- $- $-
Phi-3-mini-128k-instruct $- $- $- $-
Phi-3.5-mini-instruct $- $- $- $-
Phi-3-medium-4k-instruct $- $- $- $-
Phi-3-medium-128k-instruct $- $- $- $-
Phi-3.5-MoE-instruct $- $- $- $-
Phi-4 $- $- $- $-
Phi-4-mini $- $- $- $-

DeepSeek is a family of large language models (LLMs) developed by the DeepSeek AI research team. These models are designed to perform a wide range of natural language processing (NLP) and reasoning tasks, with a focus on open-source accessibility, performance optimization, and multi-modal capabilities.

Language models

Models Input (Per 1,000 tokens) Output (Per 1,000 tokens)
DeepSeek R1 Global $- $-
DeepSeek R1 DataZone $- $-
DeepSeek R1 Regional $- $-
DeepSeek V3 Global $- $-
DeepSeek V3 Regional $- $-
DeepSeek V3 0324 Global $- $-
DeepSeek V3 0324 Regional $- $-

Provisioned

Models Min PTU PTU Hourly Pricing PTU Monthly Reservation Pricing
DeepSeek R1 Global 100 $- $-
DeepSeek V3 Global 100 $- $-
DeepSeek V3 0324 Global 100 $- $-
Models Input (Per 1,000 tokens) Output (Per 1,000 tokens)
MAI-DS-R1 Global $- $-
MAI-DS-R1 Regional $- $-

Provisioned

Models Min PTU PTU Hourly Pricing PTU Monthly Reservation Pricing
MAI-DS-R1 Global 100 $- $-
Models Input (Per 1,000 tokens) Output (Per 1,000 tokens)
Grok-3 Global $- $-
Grok-3 DataZone $- $-
Grok-3 Regional $- $-
Grok-3 Mini Global $- $-
Grok-3 Mini DataZone $- $-
Grok-3 Mini Regional $- $-
Models Input (Per 1,000 tokens) Output (Per 1,000 tokens)
Llama 3.3 70B Datazone $- $-
Llama 3.3 70B Global $- $-
Llama 3.3 70B Regional $- $-
Llama4 Maverick 17B Datazone $- $-
Llama4 Mavrick 17B Global $- $-
Llama4 Maverick 17B Regional $- $-

Azure pricing and purchasing options

Connect with us directly

Get a walkthrough of Azure pricing. Understand pricing for your cloud solution, learn about cost optimization and request a custom proposal.

Talk to a sales specialist

See ways to purchase

Purchase Azure services through the Azure website, a Microsoft representative, or an Azure partner.

Explore your options

Additional resources

Azure AI Foundry Models

Learn more about Azure AI Foundry Models features and capabilities.

Pricing calculator

Estimate your expected monthly costs for using any combination of Azure products.

Documentation

Review technical tutorials, videos, and more Azure AI Foundry Models resources.

Talk to a sales specialist for a walk-through of Azure pricing. Understand pricing for your cloud solution.

Get free cloud services and a $200 credit to explore Azure for 30 days.

Added to estimate. Press 'v' to view on calculator
Can we help you?