All models
Arize AI Qwen 2 1.5B Instruct
arize-ai/qwen-2-1.5b-instruct
- Modalities
- TextText
- In / Out / 1M
- $0.12
- $0.12 out
- Context
- 33K
- tokens
- Released
- —
One model id, the best provider every call
Request arize-ai/qwen-2-1.5b-instruct and NeuralRouter scores every provider below on price, latency and health, then routes each request to the best fit — with automatic failover and unified billing.
Providers(1)
| Provider | Input /1M | Output /1M | Cache read /1M | TTFT | Throughput | Uptime | Status | |
|---|---|---|---|---|---|---|---|---|
Together AItogether/arize-ai/qwen-2-1.5b-instruct | $0.12 | $0.12 | —cached | 226 msp95 | 132 tok/s | 100.0%30d | healthy | paymentsCheapestboltFastest |
Effective pricing
Input
$0.12
Output
$0.12
Cache read
—
not offered
Best price shown (from Together AI). Each provider's full rate is in the table above — you always pay the provider the router picks.
Performance
Time to first tokenlower is better
Together AI
226 ms
Throughputhigher is better
Together AI
132 tok/s
Fastest provider responds in 226 ms to first token. Time-series charts build up as the probe runs on a schedule.
Benchmarks
No benchmark scores reported for this model by its providers.
Activity
Usage analytics appear here as requests flow through this model.
Call it from any OpenAI client
Point your base URL at NeuralRouter and request this model — we handle provider selection, failover and billing on every call.
# OpenAI-compatible — same SDK, smarter routing curl https://api.neuralrouter.ai/v1/chat/completions \ -H "Authorization: Bearer $NR_KEY" \ -d '{"model":"arize-ai/qwen-2-1.5b-instruct","messages":[{"role":"user","content":"Hello"}]}'