Qwen3.5-2B

Qwen3.5-2B is a 2 billion parameter vision-language model using Gated DeltaNet hybrid architecture with a 3:1 ratio of linear attention to full softmax attention. It supports 262K native context length and features both thinking and non-thinking modes.