🚀 dispatchAI Inference API

Small. Mobile. Free. UAE-built. Mobile-optimized LLM inference on real Snapdragon hardware.

✅ LIVE $0.001/1K tokens 10x cheaper than OpenAI

Base URL

https://api.dispatchai.ai/v1

Quick Start

import openai
client = openai.OpenAI(base_url="https://api.dispatchai.ai/v1", api_key="da-demo-key-0001")
response = client.chat.completions.create(
    model="dispatchAI/SmolLM2-135M-Instruct-mobile",
    messages=[{"role": "user", "content": "What is the capital of France?"}]
)
print(response.choices[0].message.content)
# → "The capital of France is Paris."

Available Models

Model	Size	Phone Speed
dispatchAI/SmolLM2-135M-Instruct-mobile	101MB	46 t/s
dispatchAI/Qwen2.5-0.5B-Instruct-mobile-int4	469MB	23 t/s
dispatchAI/Llama-3.2-1B-Instruct-Q4-mobile	770MB	5.4 t/s

Pricing

Type	Price
Input	$0.001/1K tokens
Output	$0.002/1K tokens

Dispatch AI (FZE) — Sharjah Free Zone, UAE — License No. 10818