Chinese replies randomly contain Thai/Russian/Arabic characters when serving Qwen3-30B-A3B-Instruct (AWQ 4bit) with vLLM

#2
by monsterbeasts - opened

When calling the model via the OpenAI Chat Completions API—even with prompts that require Chinese-only output or emoji-only—the response intermittently inserts snippets of Thai, Russian, Arabic, etc., within sentences or after emojis (e.g., “你好…(ธนาคาร остальн)… الأساسية”). The main Chinese text is correct but mixed with foreign scripts, appearing like garbled output.
image.png

image.png

Sign up or log in to comment