可控开源社区

michaelcjl/qwen3.5_0.8b

关注 1点赞 1复刻(Fork)

目录

michael

modify qwen3.5-0.8b

3个月前3次提交

1.jpgmodify qwen3.5-0.8b3个月前
Makefilemodify qwen3.5-0.8b3个月前
README.mdqwen3.5-0.8b3个月前
chat_template.jinjaqwen3.5-0.8b3个月前
config.jsonqwen3.5-0.8b3个月前
convert_qwen3_5_fp32.pymodify qwen3.5-0.8b3个月前
merges.txtqwen3.5-0.8b3个月前
model.safetensors.index.jsonqwen3.5-0.8b3个月前
preprocessor_config.jsonqwen3.5-0.8b3个月前
qwen3.5_run.cmodify qwen3.5-0.8b3个月前
qwen3.5_run_backup.cmodify qwen3.5-0.8b3个月前
requirements.txtqwen3.5-0.8b3个月前
run_qwen3.5.pyqwen3.5-0.8b3个月前
run_qwen3.5_0.8b.pymodify qwen3.5-0.8b3个月前
run_qwen3.5_0.8b_bak.pymodify qwen3.5-0.8b3个月前
test.pymodify qwen3.5-0.8b3个月前
test_02.pymodify qwen3.5-0.8b3个月前
tokenizer.jsonqwen3.5-0.8b3个月前
tokenizer_bpe.cqwen3.5-0.8b3个月前
tokenizer_bpe.hqwen3.5-0.8b3个月前
tokenizer_config.jsonqwen3.5-0.8b3个月前
video.mp4modify qwen3.5-0.8b3个月前
video_preprocessor_config.jsonqwen3.5-0.8b3个月前
vocab.jsonqwen3.5-0.8b3个月前

Qwen3.5-0.8B

This project provides inference implementation for Qwen3.5-0.8B model with multimodal support for images and videos.

Installation

pip install -r requirements.txt

Usage

Run the inference script:

python run_qwen3.5.py -p "Hello, introduce yourself." -n 100

Or build and run the C version:

make
./build/qwen3.5_run -m qwen3.5-0.8b.bin -p "你好，请介绍一下你自己。"

For multimodal input with images:

./build/qwen3.5_run -m qwen3.5-0.8b.bin --image image.jpg -p "Describe this image."

Architecture

Hybrid attention: Linear attention and full attention layers alternating
Gated Delta Networks for linear attention
Multimodal support with SigLIP vision encoder
24 layers, 1024 hidden size, 1280 intermediate size
Vision: 1176 hidden size, 27 blocks, patch size 14x14

Building the C version

make

Features

Text generation with Qwen3.5-0.8B
Image understanding (vision encoder integrated)
CPU inference in fp32
Hybrid attention architecture implementation
BPE tokenization

Convert safetensors to qwen3.5.bin

pip install torch transformers
python convert_qwen3_5_fp32.py --model . --out qwen3.5-0.8b.bin --verbose

然后运行 C 程序：

make
./build/qwen3.5_run -m qwen3.5-0.8b.bin -p "你好，请介绍一下你自己。"

Reference

Based on qwen3-0.6b implementation and Hugging Face Qwen3.5-0.8B model.

关于

22.8 MB

邀请码

Gitlink（确实开源）

加入我们
官网邮箱：gitlink@ccf.org.cn

QQ群

QQ群

公众号

公众号

版权所有：中国计算机学会技术支持：开源发展技术委员会
京ICP备13000930号-9 京公网安备 11010802047560号