Fix Qwen3.5-0.8B WebGPU optimization config by xiaofeihan1 · Pull Request #388 · microsoft/olive-recipes

xiaofeihan1 · 2026-04-29T08:13:53Z

No description provided.

xiaofeihan1 · 2026-04-29T08:17:10Z

Hi @apsonawane PTAL, thanks!

Copilot

Pull request overview

This PR updates the Qwen3.5-0.8B “builtin” WebGPU Olive workflow configs to explicitly target a WebGPU-enabled local system, and adjusts the optimization script’s genai_config patching.

Changes:

Add systems.local_system (WebGpuExecutionProvider) and set "target": "local_system" in the WebGPU embedding/text/vision Olive JSON configs.
Update optimize.py to use vision_session_options when setting the decoder session_options.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

File	Description
Qwen-Qwen3.5-0.8B/builtin/webgpu/vision.json	Adds WebGPU system definition and selects it via `target`.
Qwen-Qwen3.5-0.8B/builtin/webgpu/text.json	Adds WebGPU system definition and selects it via `target`.
Qwen-Qwen3.5-0.8B/builtin/webgpu/embedding.json	Adds WebGPU system definition and selects it via `target`.
Qwen-Qwen3.5-0.8B/builtin/optimize.py	Changes which session options are written into `genai_config.json` for the decoder.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-29T08:17:42Z

    vision_session_options = {"log_id": "onnxruntime-genai", "provider_options": vision_provider_options}

-    config["model"]["decoder"]["session_options"] = session_options
+    config["model"]["decoder"]["session_options"] = vision_session_options


decoder.session_options is being set to vision_session_options. For device == "gpu", vision_provider_options explicitly disables CUDA graph capture, so this change unintentionally applies the vision/embedding workaround to the decoder too (and also hides the fact that session_options is otherwise undefined). Create a separate session_options based on provider_options and assign that to the decoder, keeping vision_session_options only for vision/embedding.

apsonawane · 2026-04-29T17:19:37Z

Why is this change required only for qwen3.5-0.8b model?

Co-authored-by: Copilot <copilot@github.com>

xiaofeihan1 · 2026-04-30T02:27:04Z

Why is this change required only for qwen3.5-0.8b model?

Thanks @apsonawane . Applied for other Qwen3.5 models. I have a follow-up question:
Why Qwen3.5-2B use k_quant_mixed but Qwen3.5-4B use rtn?

apsonawane · 2026-04-30T18:11:08Z

@xiaofeihan1 We tried few combinations of quantization and selected the one that gives us best accuracy and performance. That's why there is a difference.
2B and 0.8B are very sensitive to int4 quant that's why we kept is mixed while 4B and 9B perform better on int4 and gives better performance

Fix Qwen3.5-0.8B WebGPU optimization config

21add70

Copilot AI review requested due to automatic review settings April 29, 2026 08:13

Copilot started reviewing on behalf of xiaofeihan1 April 29, 2026 08:14 View session

Copilot AI reviewed Apr 29, 2026

View reviewed changes

fix

2e427f0

Co-authored-by: Copilot <copilot@github.com>

Merge branch 'main' into fix/qwen3.5-webgpu-config

52cef83

apsonawane requested a review from xiaoyu-work May 4, 2026 19:29

Merge branch 'main' into fix/qwen3.5-webgpu-config

d66c314

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Qwen3.5-0.8B WebGPU optimization config#388

Fix Qwen3.5-0.8B WebGPU optimization config#388
xiaofeihan1 wants to merge 4 commits intomicrosoft:mainfrom
xiaofeihan1:fix/qwen3.5-webgpu-config

xiaofeihan1 commented Apr 29, 2026

Uh oh!

xiaofeihan1 commented Apr 29, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 29, 2026

Uh oh!

apsonawane commented Apr 29, 2026

Uh oh!

xiaofeihan1 commented Apr 30, 2026

Uh oh!

apsonawane commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

xiaofeihan1 commented Apr 29, 2026

Uh oh!

xiaofeihan1 commented Apr 29, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

apsonawane commented Apr 29, 2026

Uh oh!

xiaofeihan1 commented Apr 30, 2026

Uh oh!

apsonawane commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants