: Apply roofline analysis and systematic profiling to decompose bottlenecks across CUDA kernels, frameworks, and serving layers... programming experience (CUDA). Track record of leading ambiguous, high-impact technical programs across multiple teams...
problems, diving into MLIR/ONNX and CUDA/TensorRT internals, and mentoring others on performance engineering. We value clear... and optimizing ONNX‑based model export and deployment pipelines GPU programming (CUDA) and familiarity with ML SW stack (e.g...
for you. What you’ll be doing (Responsibilities) Design, implement, benchmark, and iterate on CUDA-based kernels and custom operators... that make it easier to profile, debug, and validate CUDA kernels and accelerator-backend code across the AV stack. Partner...
and acceleration structure construction through GPU ray tracing and sensor-specific post-processing. Our stack includes C++, CUDA.... - Improve ray tracing performance through BVH optimization, shader efficiency, and pipeline profiling using CUDA and OptiX...
problems, and diving into MLIR/ONNX and CUDA/TensorRT internals. We value clear thinking, strong engineering fundamentals...) Experience building and optimizing ONNX‑based model export and deployment pipelines GPU programming (CUDA) and familiarity...
Senior Software Engineer, L4 - Autonomous Vehicles
To Stand Out From The Crowd: PhD in a relevant field or related research experience Knowledge of CUDA is a plus NVIDIA...
Nvidia ⚡ ⚡ Wed, 11 Mar 2026 04:08:20 GMT