Qwen 2 72B Instruct

qwen/qwen-2-72b-instruct

About Qwen 2 72B Instruct

Qwen2 72B is a transformer-based model that excels in language understanding, multilingual capabilities, coding, mathematics, and reasoning.

It features SwiGLU activation, attention QKV bias, and group query attention. It is pretrained on extensive data with supervised finetuning and direct preference optimization.

For more details, see this blog post and GitHub repo.

Usage of this model is subject to Tongyi Qianwen LICENSE AGREEMENT.

Specifications

Context Length

32,768

Tokenizer

Qwen

Pricing

Prompt

0.899

Completion

0.899

Image

0

Request

0

Last updated: 4/11/2025