qwen_3_5
qwen_3_5
¶
Text-only Qwen 3.5 with hybrid (full + linear) attention layers.
Vision keys in the HF checkpoint are skipped: loader is tolerant via
strict=False on roundtrip.
qwen_3_5
¶Text-only Qwen 3.5 with hybrid (full + linear) attention layers.
Vision keys in the HF checkpoint are skipped: loader is tolerant via
strict=False on roundtrip.