mirror of
https://github.com/microsoft/onnxruntime.git
synced 2026-04-07 18:57:13 +00:00
## Summary This PR improves CUDA Plugin EP development and validation in three areas: Fixes the Windows CUDA Plugin EP build so the plugin can be compiled successfully with MSVC. Adds dedicated Windows and Linux GitHub Actions workflows for building and testing the CUDA Plugin EP. Expands the quick start documentation with instructions for running the CUDA Plugin EP Python tests locally. ## Changes ### Windows build fixes - Update the CUDA plugin CMake configuration to use the correct forced-include flags on Windows/MSVC. - Keep the existing forced-include behavior for non-MSVC toolchains. - Add the missing GetEnvironmentVar(const std::string&) forward declaration needed by plugin builds on Windows. ### CI coverage for CUDA Plugin EP Add a Windows CUDA Plugin EP workflow that: - builds ONNX Runtime with onnxruntime_BUILD_CUDA_EP_AS_PLUGIN=ON - uploads build artifacts - installs the built wheel - sets ORT_CUDA_PLUGIN_PATH - runs test_cuda_plugin_ep.py Add a similar Linux CUDA Plugin EP workflow. ### Documentation updates - Add a Running Tests section to the CUDA Plugin EP quick start. - Document test prerequisites, dependency installation, and ORT_CUDA_PLUGIN_PATH. - Clarify that CPU-only PyTorch is sufficient for test_cuda_plugin_ep.py because it is used for CPU-side reference computations.