All models

Arize AI Qwen 2 1.5B Instruct

arize-ai/qwen-2-1.5b-instruct

Modalities
TextText
In / Out / 1M
$0.12
$0.12 out
Context
33K
tokens
Released

One model id, the best provider every call

Request arize-ai/qwen-2-1.5b-instruct and NeuralRouter scores every provider below on price, latency and health, then routes each request to the best fit — with automatic failover and unified billing.

Providers(1)

ProviderInput /1MOutput /1MCache read /1MTTFTThroughputUptimeStatus
Together AItogether/arize-ai/qwen-2-1.5b-instruct
$0.12$0.12cached226 msp95132 tok/s100.0%30dhealthy
paymentsCheapestboltFastest

Effective pricing

Input
$0.12
Output
$0.12
Cache read
not offered

Best price shown (from Together AI). Each provider's full rate is in the table above — you always pay the provider the router picks.

Performance

Time to first tokenlower is better
Together AI
226 ms
Throughputhigher is better
Together AI
132 tok/s

Fastest provider responds in 226 ms to first token. Time-series charts build up as the probe runs on a schedule.

Benchmarks

No benchmark scores reported for this model by its providers.

Activity

Usage analytics appear here as requests flow through this model.

Call it from any OpenAI client

Point your base URL at NeuralRouter and request this model — we handle provider selection, failover and billing on every call.

# OpenAI-compatible — same SDK, smarter routing
curl https://api.neuralrouter.ai/v1/chat/completions \
  -H "Authorization: Bearer $NR_KEY" \
  -d '{"model":"arize-ai/qwen-2-1.5b-instruct","messages":[{"role":"user","content":"Hello"}]}'