Arize AI Qwen 2 1.5B Instruct

arize-ai/qwen-2-1.5b-instruct

Modalities: TextText
In / Out / 1M: $0.12; $0.12 out
Context: 33K; tokens
Released: —

One model id, the best provider every call

Request arize-ai/qwen-2-1.5b-instruct and NeuralRouter scores every provider below on price, latency and health, then routes each request to the best fit — with automatic failover and unified billing.

Providers(1)

Provider	Input /1M	Output /1M	Cache read /1M	TTFT	Throughput	Uptime	Status
Together AItogether/arize-ai/qwen-2-1.5b-instruct	$0.12	$0.12	—cached	226 msp95	132 tok/s	100.0%30d	healthy	paymentsCheapestboltFastest

Effective pricing

Input

$0.12

Output

$0.12

Cache read

—

not offered

Best price shown (from Together AI). Each provider's full rate is in the table above — you always pay the provider the router picks.

Performance

Time to first tokenlower is better

Together AI

226 ms

Throughputhigher is better

Together AI

132 tok/s

Fastest provider responds in 226 ms to first token. Time-series charts build up as the probe runs on a schedule.

Benchmarks

No benchmark scores reported for this model by its providers.

Activity

Usage analytics appear here as requests flow through this model.

Call it from any OpenAI client

Point your base URL at NeuralRouter and request this model — we handle provider selection, failover and billing on every call.

# OpenAI-compatible — same SDK, smarter routing
curl https://api.neuralrouter.ai/v1/chat/completions \
  -H "Authorization: Bearer $NR_KEY" \
  -d '{"model":"arize-ai/qwen-2-1.5b-instruct","messages":[{"role":"user","content":"Hello"}]}'