Mistral
Use models in a few clicks with our platform. Download our open models for deep access.
Mixtral is a powerful and fast model adaptable to many use-cases. While being 6x faster, it matches or outperform Llama 2 70B on all benchmarks, speaks many languages, has natural coding abilities. It handles 32k sequence length. You can use it through our API, or deploy it yourself (it’s Apache 2.0!).
Pricing: Per token usage
Mistral Alternatives
Explore 76 products in the Inference APIs category. View all Mistral alternatives.
Lyceum
European GPU cloud for serverless inference, training, and on-demand GPU clusters
vLLM
High-throughput LLM inference engine with PagedAttention for efficient GPU memory usage
Work on Mistral? Feature it at the top of Inference APIs.
Is your product missing?