Ollama
Models Docs Pricing
Sign in Download
Models Download Docs Pricing Sign in
⇅
Cloud models · Ollama
Cloud models on Ollama.
  • gemma4

    Gemma 4 models are designed to deliver frontier-level performance at each size. They are well-suited for reasoning, agentic workflows, coding, and multimodal understanding.

    vision tools thinking audio cloud e2b e4b 26b 31b

    567.2K  Pulls 17  Tags Updated  10 hours ago

  • minimax-m2.7

    MiniMax's M2-series model for coding, agentic workflows, and professional productivity.

    tools thinking cloud

    51.6K  Pulls 1  Tag Updated  2 weeks ago

  • qwen3.5

    Qwen 3.5 is a family of open-source multimodal models that delivers exceptional utility and performance.

    vision tools thinking cloud 0.8b 2b 4b 9b 27b 35b 122b

    4.8M  Pulls 58  Tags Updated  3 days ago

  • qwen3-coder-next

    Qwen3-Coder-Next is a coding-focused language model from Alibaba's Qwen team, optimized for agentic coding workflows and local development.

    tools cloud

    1M  Pulls 4  Tags Updated  1 month ago

  • qwen3-vl

    The most powerful vision-language model in the Qwen model family to date.

    vision tools thinking cloud 2b 4b 8b 30b 32b 235b

    2.9M  Pulls 59  Tags Updated  5 months ago

  • ministral-3

    The Ministral 3 family is designed for edge deployment, capable of running on a wide range of hardware.

    vision tools cloud 3b 8b 14b

    847K  Pulls 16  Tags Updated  3 months ago

  • devstral-small-2

    24B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.

    vision tools cloud 24b

    742.2K  Pulls 6  Tags Updated  3 months ago

  • nemotron-3-super

    NVIDIA Nemotron 3 Super is a 120B open MoE model activating just 12B parameters to deliver maximum compute efficiency and accuracy for complex multi-agent applications.

    tools thinking cloud 120b

    186.4K  Pulls 7  Tags Updated  3 weeks ago

  • qwen3-next

    The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.

    tools thinking cloud 80b

    483.8K  Pulls 10  Tags Updated  3 months ago

  • kimi-k2.5

    Kimi K2.5 is an open-source, native multimodal agentic model that seamlessly integrates vision and language understanding with advanced agentic capabilities, instant and thinking modes, as well as conversational and agentic paradigms.

    vision tools thinking cloud

    219.9K  Pulls 1  Tag Updated  2 months ago

  • glm-5

    A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.

    tools thinking cloud

    158.7K  Pulls 1  Tag Updated  1 month ago

  • rnj-1

    Rnj-1 is a family of 8B parameter open-weight, dense models trained from scratch by Essential AI, optimized for code and STEM with capabilities on par with SOTA open-weight models.

    tools cloud 8b

    421.6K  Pulls 6  Tags Updated  3 months ago

  • minimax-m2.5

    MiniMax-M2.5 is a state-of-the-art large language model designed for real-world productivity and coding tasks.

    tools thinking cloud

    154.2K  Pulls 1  Tag Updated  1 month ago

  • nemotron-3-nano

    Nemotron-3-Nano is a new Standard for Efficient, Open, and Intelligent Agentic Models, now updated with a 4B parameter count model.

    tools thinking cloud 4b 30b

    348.3K  Pulls 9  Tags Updated  2 weeks ago

  • devstral-2

    123B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.

    tools cloud 123b

    176.4K  Pulls 6  Tags Updated  3 months ago

  • gemini-3-flash-preview

    Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost.

    vision tools thinking cloud

    118.7K  Pulls 2  Tags Updated  3 months ago

  • cogito-2.1

    The Cogito v2.1 LLMs are instruction tuned generative models. All models are released under MIT license for commercial use.

    cloud 671b

    151.8K  Pulls 6  Tags Updated  4 months ago

  • glm-4.7

    Advancing the Coding Capability

    tools thinking cloud

    82.9K  Pulls 1  Tag Updated  3 months ago

  • deepseek-v3.2

    DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance.

    tools thinking cloud

    69K  Pulls 1  Tag Updated  3 months ago

  • minimax-m2

    MiniMax M2 is a high-efficiency large language model built for coding and agentic workflows.

    tools thinking cloud

    95.6K  Pulls 1  Tag Updated  5 months ago

© 2026 Ollama
Blog Contact