Documentation Index
Fetch the complete documentation index at: https://mintlify.com/MicrosoftDocs/azure-ai-docs/llms.txt
Use this file to discover all available pages before exploring further.
Model Deployment
Deploy models in Microsoft Foundry using multiple options optimized for different scenarios.Deployment Methods
Serverless API Deployment
Characteristics:- Pay-per-token billing
- Microsoft-managed infrastructure
- Automatic scaling
- No capacity planning
Provisioned Throughput
Characteristics:- Reserved capacity (PTUs)
- Predictable cost and performance
- Dedicated resources
- Fungible across models
Managed Compute
Characteristics:- Deploy to Azure VMs
- Billed for VM hours
- Supports open-source models
- Full infrastructure control
Deployment Process
Regional Considerations
- Model availability varies by region
- Check Region Support
- Consider data residency requirements
- Evaluate latency for global users
Model Lifecycle
- GA: Full support and SLA
- Deprecation Notice: 6-12 months warning
- Deprecated: No new deployments
- Retired: Model unavailable