Test CUDA Builds

[executorch][nvidia][tensorrt][15/n] Add linear model support with addmm and permute_copy converters #10605

Sign in to view logs

Triggered via pull request March 5, 2026 17:59

shoumikhin

opened #17926

gh/shoumikhin/40/head

Status Failure

Total duration 52m 25s

Artifacts 16

cuda.yml

on: pull_request

Matrix: export-model-cuda-artifact

Matrix: test-cuda-builds

unittest-cuda / linux-job

Matrix: test-models-cuda

Matrix: test-cuda-pybind

Waiting for pending jobs

Matrix: test-model-cuda-e2e

Waiting for pending jobs

check-all-cuda-builds

Annotations

2 errors and 2 warnings

export-model-cuda-artifact (Qwen, Qwen3-0.6B, quantized-int4-tile-packed) / linux-job

Process completed with exit code 1.

export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job

Process completed with exit code 1.

export-model-cuda-artifact (Qwen, Qwen3-0.6B, quantized-int4-tile-packed) / linux-job

No files were found with the provided path: /home/ec2-user/actions-runner/_work/_temp/artifacts/. No artifacts will be uploaded.

export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job

No files were found with the provided path: /home/ec2-user/actions-runner/_work/_temp/artifacts/. No artifacts will be uploaded.

Artifacts

Produced during runtime

Name	Size	Digest
Qwen-Qwen3-0.6B-cuda-non-quantized	1.1 GB	`sha256:6130eed5bb3c7f39b6ca1425a7887f5fb45061392babe7ef6364ac184426e506`
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only	1.1 GB	`sha256:ee4df2535920c1eb974771636481a461122d4224a6bee7d36ec90a0b9eaa4d73`
google-gemma-3-4b-it-cuda-non-quantized	7.22 GB	`sha256:8bc695303c560e6e33a4c7286a47b63826f3b72eb70d6ac1c18725343f03619a`
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed	3.36 GB	`sha256:3aa085bf9a87c6e77e0d95ae8c2bf80b52b826a03f21df944301fcaa16ea2ca3`
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed	2.8 GB	`sha256:13e65cb8177b9e560730a0c7491d4006fe96b46edd9158a21bc51ccb00646b97`
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only	6.14 GB	`sha256:db6aef59790e858a855ef439132e901ebdd2e9e64f1c88307f5abdb591df998a`
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-quantized-int4-tile-packed	15.5 GB	`sha256:b08f996ed4bf03b2ec19e6459ee513651e4ef6a567ab2ce2a6d0546044f60c51`
nvidia-parakeet-tdt-cuda-non-quantized	952 MB	`sha256:588570860d12e00231a74e3a289d911fb71854f9f375f99e21734f1aec8d908c`
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed	443 MB	`sha256:d9682b0bb8b559456029120168e29481ec6281ffa9bb70188eb03590a7c7d784`
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only	430 MB	`sha256:f19ad82b617758edd5eb7f9160579590513b988ed96ebf341678199cc194074b`
openai-whisper-large-v3-turbo-cuda-non-quantized	1.18 GB	`sha256:d18bb5edd7e1e89708eca74ad72467cc27cacc705092cf2d229f0749d895d727`
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed	491 MB	`sha256:3b4ecf955fa0163ef6f76b2267ce4e3ed90e31956a7c8d3165f65b656942d56d`
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only	485 MB	`sha256:03bafb2ee4e5cc6c5982829645de6559349e0b8f54a18bd97e3ddc422c2888d4`
openai-whisper-small-cuda-non-quantized	361 MB	`sha256:0055da5af398ec422f1fdacbada62c5e20d38d0818a377efdbf715601dc4219f`
openai-whisper-small-cuda-quantized-int4-tile-packed	172 MB	`sha256:559510f5db84bcfed365535dd0778efffb031bc0230d390c581814a49c008f57`
openai-whisper-small-cuda-quantized-int4-weight-only	270 MB	`sha256:83a089867fae0da3c5b110ac2c8f62e392f0d2965504e4bda045f6799d8157ed`