Get started with Avian

See details on API, Subscription and Finetune pricing.

Avian API Model Pricing

High-performance AI models at competitive prices

Model Name Est. Speed Context Length Tool Calling Input Priceper million tokens Output Priceper million tokens
Meta Llama 3.1 405B Instruct Enterprise ~ 130 tok/s 131,072 $3.00 $3.00
Meta Llama 3.3 70B Instruct Professional ~ 200 tok/s 131,072 $0.90 $0.90
Meta Llama 3.1 8B Instruct Starter ~ 450 tok/s 131,072 $0.20 $0.20

Enterprise-Grade Performance

Avian API offers competitive pricing for all models, at some of the highest speeds on the market by leveraging speculative decoding and running on the latest Nvidia H200 SXM GPUs. We have production grade capacity for all the models we serve, allowing usage with no rate limits to support you as you scale.