Use Open Source AI Models with Okara

Name: Open Source AI Models You Can Use | Okara
Brand: Okara
Rating: 4.9 (1200 reviews)

Current as of Sep 10, 2025

Know exactly which AI models are available and their capabilities. Okara's comprehensive model directory.

Qwen 2.5

Alibaba

Qwen2.5 has demonstrated top-tier performance on a wide range of benchmarks evaluating language understanding, reasoning, mathematics, coding, human preference alignment

...

Qwen 3

Alibaba

Qwen 3 235B A22B Instruct 2507 model. Mixture-of-experts LLM with math and reasoning capabilities

...

Qwen 3 Coder

Alibaba

Qwen3 Coder 480B is a specialized programming model designed for ultra-efficient agentic code generation with long context and state-of-the-art performance

...

A new generation of open-source, non-thinking mode model powered by Qwen3. This version demonstrates superior Chinese text understanding, augmented logical reasoning, and enhanced capabilities in text generation tasks over the previous iteration (Qwen3-235B-A22B-Instruct-2507).

...

Qwen 3 VL Instruct

Alibaba

The Qwen3 series VL models has been comprehensively upgraded in areas such as visual coding and spatial perception. Its visual perception and recognition capabilities have significantly improved, supporting the understanding of ultra-long videos, and its OCR functionality has undergone a major enhancement.

...

Qwen Image/Edit

Alibaba

Qwen Image/Edit generation model

...

Flux 2 Dev

Black Forest Labs

Flux 2 Dev generation model

...

Deepseek Reasoner

DeepSeek

DeepSeek-R1 provides customers a state-of-the-art reasoning model, optimized for general reasoning tasks, math, science, and code generation.

...

Deepseek V3.1

DeepSeek

DeepSeek V3.1 is an open-source, hybrid Mixture-of-Experts (MoE) model released by DeepSeek AI, featuring 671 billion total parameters, 37 billion active parameters per query, and a 128k token context window.

...

Deepseek V3.2

DeepSeek

DeepSeek-V3.2: A state-of-the-art model optimized for general tasks, math, science, and code generation, featuring a 128k token context window.

...

Llama 3.3

Llama 4 Maverick

Llama 4 Scout

Minimax M2

MiniMax

MiniMax-M2 redefines efficiency for agents. It is a compact, fast, and cost-effective MoE model (230 billion total parameters with 10 billion active parameters) built for elite performance in coding and agentic tasks, all while maintaining powerful general intelligence.

...

Minimax M2.1

MiniMax

MiniMax 2.1 is MiniMax's latest model, optimized specifically for robustness in coding, tool use, instruction following, and long-horizon planning.

...

Minimax M2.5

MiniMax

MiniMax M2.5 is MiniMax's most advanced model, offering elite performance across coding, reasoning, and agentic tasks with a 131K token context window.

...

Devstral 2

Mistral AI

open source model that excels at using tools to explore codebases, editing multiple files, and powering software engineering agents.

...

Devstral Small 2

Mistral AI

open source model that excels at using tools to explore codebases, editing multiple files, and powering software engineering agents.

...

Ministral 3B

Mistral AI

A compact, efficient model for on-device tasks like smart assistants and local analytics, offering low-latency performance. Part of the Mistral 3 family.

...

Ministral 8B

Mistral AI

A powerful small model with faster, memory-efficient inference, ideal for complex workflows and demanding edge applications. Part of the Mistral 3 family with multimodal capabilities.

...

Mistral Large 3

Mistral AI

Mistral's most capable model with 41B active parameters (675B total) using sparse mixture-of-experts architecture. Excels at multilingual conversations, image understanding, and general reasoning tasks.

...

Mistral Small

Mistral AI

Mistral-small-3.2 is a 24-billion-parameter open-source language model that is an incremental update to its predecessor, 3.1. It features improved instruction following, reduced repetitive outputs, and enhanced performance in coding and STEM tasks

...

Kimi K2

Moonshot AI

Kimi K2 0905 has shown strong performance on agentic tasks thanks to its tool calling, reasoning abilities, and long context handling. But as a large parameter model (1T parameters), it’s also resource-intensive. Running it in production requires a highly optimized inference stack to avoid excessive latency.

...

Kimi K2 Thinking

Moonshot AI

Kimi K2 Thinking is an advanced open-source thinking model by Moonshot AI. It can execute up to 200 – 300 sequential tool calls without human interference, reasoning coherently across hundreds of steps to solve complex problems. Built as a thinking agent, it reasons step by step while using tools, achieving state-of-the-art performance on Humanity's Last Exam (HLE), BrowseComp, and other benchmarks, with major gains in reasoning, agentic search, coding, writing, and general capabilities.

...

Kimi K2.5

Moonshot AI

Kimi K2.5 is Moonshot AI's most versatile model to date, featuring a native multimodal architecture that supports both visual and text input, thinking and non-thinking modes, and dialogue and agent tasks. Built for advanced reasoning and agentic workflows.

...

GPT-OSS 120B

OpenAI

This model excels at efficient reasoning across science, math, and coding applications. It's ideal for real-time coding assistance, processing large documents for Q&A and summarization, agentic research workflows, and regulated on-premises workloads.

...

GPT-OSS 20B

OpenAI

A compact, open-weight language model optimized for low-latency and resource-constrained environments, including local and edge deployments

...

Diffusion 3.5 Large

Stability AI

Stability Stable Diffusion 3.5 Large model

...

Z Image Turbo

Tongyi Mai

Z Image Turbo generation model

...

intellect 3

Unknown

INTELLECT-3 is a state-of-the-art performance for its size across math, code and reasoning.

...

GLM 4.6

Zhipu AI

GLM-4.6 achieves comprehensive enhancements across multiple domains, including real-world coding, long-context processing, reasoning, searching, writing, and agentic applications.

...

GLM 4.6 Vision

Zhipu AI

GLM-4.6V series are Z.ai’s iterations in a multimodal large language model. GLM-4.6V scales its context window to 128k tokens in training, and achieves SoTA performance in visual understanding among models of similar parameter scales.

...

GLM 4.6 Vision Flash

Zhipu AI

GLM-4.6V-Flash series are Z.ai’s iterations in a multimodal large language model. GLM-4.6V-Flash scales its context window to 128k tokens in training, and achieves SoTA performance in visual understanding among models of similar parameter scales.

...

GLM 4.7

Zhipu AI

GLM-4.7 is Z.ai’s latest flagship model, with major upgrades focused on two key areas: stronger coding capabilities and more stable multi-step reasoning and execution.

...

GLM 4.7 Flash

Zhipu AI

GLM-4.7-Flash balances high performance with efficiency, making it the perfect lightweight deployment option.

...

GLM 5

Zhipu AI

GLM-5 is Zai’s new-generation flagship foundation model, designed for Agentic Engineering, capable of providing reliable productivity in complex system engineering and long-range Agent tasks.

...

Showing 36 AI models

Use Open Source AI Models with Okara

Qwen 2.5

Qwen 3

Qwen 3 Coder

Qwen 3 Next

Qwen 3 VL Instruct

Qwen Image/Edit

Flux 2 Dev

Deepseek Reasoner

Deepseek V3.1

Deepseek V3.2

Llama 3.3

Llama 4 Maverick

Llama 4 Scout

Minimax M2

Minimax M2.1

Minimax M2.5

Devstral 2

Devstral Small 2

Ministral 3B

Ministral 8B

Mistral Large 3

Mistral Small

Kimi K2

Kimi K2 Thinking

Kimi K2.5

GPT-OSS 120B

GPT-OSS 20B

Diffusion 3.5 Large

Z Image Turbo

intellect 3

GLM 4.6

GLM 4.6 Vision

GLM 4.6 Vision Flash

GLM 4.7

GLM 4.7 Flash

GLM 5