免费AI模型

探索所有可在OpenRouter上免费使用的AI模型

所有免费模型 (28)

Baidu Qianfan: CoBuddy (free)

CoBuddy is a code generation model from Baidu, optimized for coding tasks and AI Agent workflows. It features high inference throughput and low end-to-end latency, with native support for tool...

openrouter

Owl Alpha is a high-performance foundation model designed for agentic workloads. Natively supports tool use, and long-context tasks, with strong performance in code generation, automated workflows, and complex instruction execution....

nvidia

NVIDIA: Nemotron 3 Nano Omni (free)

NVIDIA Nemotron™ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as a perception and context sub-agent in enterprise agent systems. It accepts text, image, video, and...

poolside

Poolside: Laguna XS.2 (free)

Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://poolside.ai), their efficient coding agent series. It combines tool calling and reasoning capabilities with a compact footprint, offering...

poolside

Poolside: Laguna M.1 (free)

Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai), optimized for complex software engineering tasks. Designed for agentic coding workflows, it supports tool calling and reasoning, with a 128K...

deepseek

DeepSeek: DeepSeek V4 Flash (free)

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

google

Google: Gemma 4 26B A4B (free)

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...

google

Google: Gemma 4 31B (free)

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

arcee-ai

Arcee AI: Trinity Large Thinking (free)

Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI. It shows strong performance in PinchBench, agentic workloads, and reasoning tasks. Launch video: https://youtu.be/Gc82AXLa0Rg?si=4RLn6WBz33qT--B7...

google

Google: Lyria 3 Pro Preview

Full-length songs are priced at $0.08 per song. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate high-quality, 48kHz...

google

Google: Lyria 3 Clip Preview

30 second duration clips are priced at $0.04 per clip. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate...

nvidia

NVIDIA: Nemotron 3 Super (free)

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...

minimax

MiniMax: MiniMax M2.5 (free)

MiniMax-M2.5 is a SOTA large language model designed for real-world productivity. Trained in a diverse range of complex real-world digital working environments, M2.5 builds upon the coding expertise of M2.1...

openrouter

Free Models Router

The simplest way to get free inference. openrouter/free is a router that selects free models at random from the models available on OpenRouter. The router smartly filters for models that...

liquid

LiquidAI: LFM2.5-1.2B-Thinking (free)

LFM2.5-1.2B-Thinking is a lightweight reasoning-focused model optimized for agentic tasks, data extraction, and RAG—while still running comfortably on edge devices. It supports long context (up to 32K tokens) and is...

liquid

LiquidAI: LFM2.5-1.2B-Instruct (free)

LFM2.5-1.2B-Instruct is a compact, high-performance instruction-tuned model built for fast on-device AI. It delivers strong chat quality in a 1.2B parameter footprint, with efficient edge inference and broad runtime support.

nvidia

NVIDIA: Nemotron 3 Nano 30B A3B (free)

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...

nvidia

NVIDIA: Nemotron Nano 12B 2 VL (free)

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...

qwen

免费AI模型

所有免费模型 (28)

Baidu Qianfan: CoBuddy (free)

Owl Alpha

NVIDIA: Nemotron 3 Nano Omni (free)

Poolside: Laguna XS.2 (free)

Poolside: Laguna M.1 (free)

DeepSeek: DeepSeek V4 Flash (free)

Google: Gemma 4 26B A4B (free)

Google: Gemma 4 31B (free)

Arcee AI: Trinity Large Thinking (free)

Google: Lyria 3 Pro Preview

Google: Lyria 3 Clip Preview

NVIDIA: Nemotron 3 Super (free)

MiniMax: MiniMax M2.5 (free)

Free Models Router

LiquidAI: LFM2.5-1.2B-Thinking (free)

LiquidAI: LFM2.5-1.2B-Instruct (free)

NVIDIA: Nemotron 3 Nano 30B A3B (free)

NVIDIA: Nemotron Nano 12B 2 VL (free)

Qwen: Qwen3 Next 80B A3B Instruct (free)

NVIDIA: Nemotron Nano 9B V2 (free)

OpenAI: gpt-oss-120b (free)

OpenAI: gpt-oss-20b (free)

Z.ai: GLM 4.5 Air (free)

Qwen: Qwen3 Coder 480B A35B (free)

Venice: Uncensored (free)

Meta: Llama 3.3 70B Instruct (free)

Meta: Llama 3.2 3B Instruct (free)

Nous: Hermes 3 405B Instruct (free)