Small. Mobile. Free. UAE-built. Mobile-optimized LLM inference on real Snapdragon hardware.
✅ LIVE $0.001/1K tokens 10x cheaper than OpenAI
https://api.dispatchai.ai/v1
import openai
client = openai.OpenAI(base_url="https://api.dispatchai.ai/v1", api_key="da-demo-key-0001")
response = client.chat.completions.create(
model="dispatchAI/SmolLM2-135M-Instruct-mobile",
messages=[{"role": "user", "content": "What is the capital of France?"}]
)
print(response.choices[0].message.content)
# → "The capital of France is Paris."
| Model | Size | Phone Speed |
|---|---|---|
| dispatchAI/SmolLM2-135M-Instruct-mobile | 101MB | 46 t/s |
| dispatchAI/Qwen2.5-0.5B-Instruct-mobile-int4 | 469MB | 23 t/s |
| dispatchAI/Llama-3.2-1B-Instruct-Q4-mobile | 770MB | 5.4 t/s |
| Type | Price |
|---|---|
| Input | $0.001/1K tokens |
| Output | $0.002/1K tokens |
Dispatch AI (FZE) — Sharjah Free Zone, UAE — License No. 10818