[executorch][nvidia][tensorrt][15/n] Add linear model support with addmm and permute_copy converters #10605
cuda.yml
on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda
/
linux-job
27m 17s
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Waiting for pending jobs
Matrix: test-model-cuda-e2e
Waiting for pending jobs
check-all-cuda-builds
2s
Annotations
2 errors and 2 warnings
|
export-model-cuda-artifact (Qwen, Qwen3-0.6B, quantized-int4-tile-packed) / linux-job
Process completed with exit code 1.
|
|
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
Process completed with exit code 1.
|
|
export-model-cuda-artifact (Qwen, Qwen3-0.6B, quantized-int4-tile-packed) / linux-job
No files were found with the provided path: /home/ec2-user/actions-runner/_work/_temp/artifacts/. No artifacts will be uploaded.
|
|
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
No files were found with the provided path: /home/ec2-user/actions-runner/_work/_temp/artifacts/. No artifacts will be uploaded.
|
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
Qwen-Qwen3-0.6B-cuda-non-quantized
|
1.1 GB |
sha256:6130eed5bb3c7f39b6ca1425a7887f5fb45061392babe7ef6364ac184426e506
|
|
|
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only
|
1.1 GB |
sha256:ee4df2535920c1eb974771636481a461122d4224a6bee7d36ec90a0b9eaa4d73
|
|
|
google-gemma-3-4b-it-cuda-non-quantized
|
7.22 GB |
sha256:8bc695303c560e6e33a4c7286a47b63826f3b72eb70d6ac1c18725343f03619a
|
|
|
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
|
3.36 GB |
sha256:3aa085bf9a87c6e77e0d95ae8c2bf80b52b826a03f21df944301fcaa16ea2ca3
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
|
2.8 GB |
sha256:13e65cb8177b9e560730a0c7491d4006fe96b46edd9158a21bc51ccb00646b97
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
|
6.14 GB |
sha256:db6aef59790e858a855ef439132e901ebdd2e9e64f1c88307f5abdb591df998a
|
|
|
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-quantized-int4-tile-packed
|
15.5 GB |
sha256:b08f996ed4bf03b2ec19e6459ee513651e4ef6a567ab2ce2a6d0546044f60c51
|
|
|
nvidia-parakeet-tdt-cuda-non-quantized
|
952 MB |
sha256:588570860d12e00231a74e3a289d911fb71854f9f375f99e21734f1aec8d908c
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
|
443 MB |
sha256:d9682b0bb8b559456029120168e29481ec6281ffa9bb70188eb03590a7c7d784
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
|
430 MB |
sha256:f19ad82b617758edd5eb7f9160579590513b988ed96ebf341678199cc194074b
|
|
|
openai-whisper-large-v3-turbo-cuda-non-quantized
|
1.18 GB |
sha256:d18bb5edd7e1e89708eca74ad72467cc27cacc705092cf2d229f0749d895d727
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
|
491 MB |
sha256:3b4ecf955fa0163ef6f76b2267ce4e3ed90e31956a7c8d3165f65b656942d56d
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
|
485 MB |
sha256:03bafb2ee4e5cc6c5982829645de6559349e0b8f54a18bd97e3ddc422c2888d4
|
|
|
openai-whisper-small-cuda-non-quantized
|
361 MB |
sha256:0055da5af398ec422f1fdacbada62c5e20d38d0818a377efdbf715601dc4219f
|
|
|
openai-whisper-small-cuda-quantized-int4-tile-packed
|
172 MB |
sha256:559510f5db84bcfed365535dd0778efffb031bc0230d390c581814a49c008f57
|
|
|
openai-whisper-small-cuda-quantized-int4-weight-only
|
270 MB |
sha256:83a089867fae0da3c5b110ac2c8f62e392f0d2965504e4bda045f6799d8157ed
|
|