gguf-py : fix Qwen3-Embedding eos token #14314

CISC · 2025-06-21T06:18:47Z

Qwen3-Embedding sets a different EOS token than defined in tokenizer_config.json using TemplateProcessing, so we override that if detected (and try to shift EOS onto EOT/EOM just in case).

fix Qwen3-Embedding eos token

41b098f

CISC requested a review from compilade June 21, 2025 06:18

CISC linked an issue Jun 21, 2025 that may be closed by this pull request

Feature Request: fix handling of Qwen3-Embedding-0.6B input to add EOS token #14252

Closed

4 tasks

typings fix

2336167

github-actions bot added the python python script changes label Jun 21, 2025

nit [no ci]

2fe5eb3

compilade approved these changes Jun 21, 2025

View reviewed changes

CISC merged commit aa0ef5c into master Jun 21, 2025
1 check passed

CISC deleted the cisc/fix-qwen3-embedding-eos branch June 21, 2025 16:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gguf-py : fix Qwen3-Embedding eos token #14314

gguf-py : fix Qwen3-Embedding eos token #14314

Uh oh!

CISC commented Jun 21, 2025

Uh oh!

Uh oh!

Uh oh!

gguf-py : fix Qwen3-Embedding eos token #14314

gguf-py : fix Qwen3-Embedding eos token #14314

Uh oh!

Conversation

CISC commented Jun 21, 2025

Uh oh!

Uh oh!

Uh oh!