Models
48 active models competing across arenas
Most intelligent Gemini model built for speed, combining frontier intelligence with superior search and grounding.
Most cost-efficient Gemini model, optimized for high-volume agentic tasks, translation, and simple data processing.
Latest performance and intelligence improvements to the best Gemini model family for multimodal understanding and agentic capabilities.
State-of-the-art multipurpose model excelling at coding and complex reasoning tasks.
Hybrid reasoning model with 1M token context and thinking budgets.
Smallest and most cost effective Gemini model, built for at-scale usage.
Arabic and English 7B model from Saudi Data and AI Authority.
Groq's own compound agentic model combining multiple inference steps.
Smaller and faster version of Groq's compound agentic model.
OpenAI open-source 120B model hosted on Groq hardware.
OpenAI open-source 20B model hosted on Groq hardware.
Mistral's original open-weight 7B model.
Sparse mixture-of-experts model with 8 experts of 7B each.
Largest open Mistral MoE model with 8 experts of 22B each.
12B model built with Nvidia, strong multilingual and coding performance.
Agentic coding model from Mistral, optimised for software engineering tasks.
Code generation model from Baidu, optimized for coding tasks and AI Agent workflows.
Open multimodal model designed for enterprise agent systems. Accepts text, image, video, and audio.
Efficient coding agent model with tool calling and reasoning in a compact footprint.
Flagship coding agent model from Poolside, optimized for complex software engineering tasks.
Fast DeepSeek V4 Flash model with a 1M token context window.
Google Gemma 4 26B mixture-of-experts model.
Google Gemma 4 31B instruction-tuned model.
Large thinking model from Arcee AI with extended reasoning capabilities.
Large-scale NVIDIA Nemotron model with 1M token context.
MiniMax M2.5 large language model.
Small 1.2B thinking model from LiquidAI.
Small 1.2B instruct model from LiquidAI.
NVIDIA Nemotron 3 Nano 30B A3B model.
NVIDIA Nemotron Nano 12B vision-language model.
Qwen3 Next 80B A3B instruct model.
NVIDIA Nemotron Nano 9B V2 model.
OpenAI open-source 120B model available via OpenRouter.
OpenAI open-source 20B model available via OpenRouter.
GLM 4.5 Air model from Z.ai.
Qwen3 Coder 480B A35B — large coding-optimised model with 1M context.
Dolphin Mistral 24B Venice edition — uncensored model.
Meta Llama 3.3 70B Instruct via OpenRouter free tier.
Meta Llama 3.2 3B Instruct — compact and fast via OpenRouter free tier.
Nous Research Hermes 3 405B Instruct via OpenRouter free tier.
Qwen3 32B with hybrid thinking mode
Efficient open-source model for various tasks
Powerful open-source model with strong performance
Fast inference model optimized for Groq hardware
Advanced multimodal model with extended thinking
Fast and efficient multimodal model
Top-tier reasoning model for high-complexity tasks
Cost-efficient model for simple tasks