) to identify bottlenecks in compute, memory bandwidth, and latency. Write and optimize high-performance kernels using CUDA, HIP...+ years in performance optimization, parallel computing, and low‑level systems using C++ and GPGPU frameworks (CUDA preferred...
-performance heterogeneous compute environments - Deploy signal processing algorithms through CUDA-based pipelines to meet project... requirements - Utilize expertise in digital signal processing, C++, & CUDA to optimize performance for applications...
Lockheed Martin ⚡ $91000 - 181113 per year ⚡ Thu, 14 May 2026 03:37:09 GMT
Member of Technical Staff, AI Engineering
) to identify bottlenecks in compute, memory bandwidth, and latency. Write and optimize high-performance kernels using CUDA, HIP...+ years in performance optimization, parallel computing, and low‑level systems using C++ and GPGPU frameworks (CUDA preferred...
Micron ⚡ ⚡ Sat, 16 May 2026 04:42:10 GMT