CUTLASS CUDA C++ and Python DSL for Blackwell, Rubin, and future architectures. Optimize kernels for peak throughput... compiler, CUDA library, and DL frameworks teams to ensure fast, functional, and timely kernel delivery to customers...
. Qualifications Strong engineering skills, especially any of: CUDA/Triton/Pallas/CuTe DSL kernel development, lower-level PyTorch.../JAX/XLA development, CUDA Graphs, FPGA/ASIC experience Must have two or more years work experience building deep learning...
Hudson River Trading ⚡⚡ Wed, 03 Jun 2026 01:44:58 GMT
SW/FW GA release including firmware bundles, DGX BaseOS, GPU drivers, CUDA toolkit, DCGM, and DOCA/OFED. For DGX Station...: Validate the full NVIDIA AI software stack on DGX Station: CUDA toolkit, cuDNN, TensorRT, NCCL, Triton Inference Server, DCGM...
, and telemetry through the host OS, GPU and CPU drivers, and CUDA — to deliver a production-ready inference platform that operates... center stack, BMC firmware, BIOS, host OS, GPU/CPU drivers, CUDA, DCGM, and manageability telemetry as a single integrated...
Senior Software Development Engineer in Test
Generation, Reflex, CUDA, G-Sync, or related areas. Strong knowledge of PC gaming ecosystems, including launchers, overlays...
Nvidia ⚡ ⚡ Wed, 03 Jun 2026 05:12:40 GMT