Qwen: Qwen3 VL 32B Instruct Cost Calculator

Model: Qwen: Qwen3 VL 32B Instruct, Context: 262144, Cost: $0.35 per 1M input tokens, $1.1 per 1M output tokens

Note: Calculation is approximate based on public data. Prices may change, check official websites.

Welcome to the Qwen3 VL 32B Instruct Cost Calculator. This tool helps you estimate the expenses associated with utilizing the Qwen3-VL-32B-Instruct model for your projects.

The Qwen3-VL-32B-Instruct is a powerful multimodal vision-language model, leveraging 32 billion parameters to deliver high-precision understanding and reasoning across text, images, and video. It combines deep visual perception with advanced text comprehension, excelling in fine-grained spatial reasoning, comprehensive document and scene analysis, and long-horizon video understanding. This model features robust OCR across 32 languages and enhanced multimodal fusion through Interleaved-MRoPE and DeepStack architectures, making it optimized for agentic interaction and visual tool use to tackle complex real-world multimodal tasks with state-of-the-art performance.

The model has a context window of 262,144 tokens.

Current pricing for Qwen3 VL 32B Instruct is:

  • Input tokens: $0.35 per 1,000,000 tokens
  • Output tokens: $1.1 per 1,000,000 tokens

Calculation Formula:

The total cost is calculated using the following formula:

Total Cost = ((Number of Input Tokens / 1,000,000) × $0.35 + (Number of Output Tokens / 1,000,000) × $1.1) × Number of Requests

Example:

If you process 1,000,000 input tokens and generate 500,000 output tokens in a single request:

  • Input Cost = (1,000,000 / 1,000,000) × $0.35 = $0.35
  • Output Cost = (500,000 / 1,000,000) × $1.1 = $0.55
  • Total Cost = ($0.35 + $0.55) × 1 = $0.90

Use this calculator to quickly estimate your potential costs for various token usages.