Overview We are looking for a GPU Performance Engineer to build highly optimized CUDA kernels for low-latency...-efficient code. What you'll do Design, implement, and optimize custom CUDA kernels for latency-critical inference workloads...
interaction with software components. Maintain and update hardware designs as needed. CUDA (Compute Unified Device Architecture...) /OpenCL (Open Computing Language) Programming: Develop and optimize applications using CUDA or OpenCL, harnessing the full...
Xcelerate Solutions ⚡⚡ Sun, 29 Mar 2026 02:49:56 GMT
or PBS distributed parallel compute workloads utilizing MPI or OpenMP GPU-enabled compute resources supporting CUDA-based... environments and CUDA frameworks is desirable. Performance Troubleshooting Ability to identify issues affecting workload...
HPC Systems Engineer
GPU-enabled computer environments and CUDA workloads Experience with configuration management tools such as Ansible...
Parsons ⚡ $103500 - 181100 per year ⚡ Mon, 30 Mar 2026 04:11:06 GMT