Ollama Cloud
activeOllama's hosted cloud inference — free tier includes $10/month in shared inference credits with access to 400+ models. OpenAI-compatible API, no credit card required.
https://ollama.com/v1
Avg Latency
—
90-day Uptime
—%
Rate Limits
— RPM / — RPD
Sign-up Required
Yes
Info
Base URL https://ollama.com/v1
Sign-up Required Yes
Credit Card Not required
Context Window 128K
Last Verified 2026-03-30
openai-compatiblevisionfunction-call
Models (9)
deepseek-v3.2
CTX: 128K
kimi-k2.5
CTX: 128K
gemma3:27b
CTX: 128K
gemma3:12b
CTX: 128K
gemma3:4b
CTX: 128K
mistral-large-3:675b
CTX: 128K
gpt-oss:120b
CTX: 128K
gemini-3-flash-preview
CTX: 1M
qwen3-vl:235b
CTX: 128K
Quick Start
export API_KEY="your_api_key_here"
curl https://ollama.com/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $API_KEY" \
-d '{
"model": "deepseek-v3.2",
"messages": [
{"role": "user", "content": "Hello! How are you?"}
]
}'