software in C++, Python, and CUDA for seamless and efficient deployment of state-of-the-art LLMs and Generative AI models... in developing inference backends and compilers for GPUs. Knowledge of Machine Learning techniques and GPU programming with CUDA...
the ML models, write custom CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic... and optimize custom CUDA kernels and TensorRT Plugins to maximize memory bandwidth and minimize latency on AI accelerators...
training workloads or parallel applications. Proven understanding of CPU and GPU architectures, CUDA, parallel filesystems.../SDKs (e.g. CUDA), NVIDIA Networking technologies (e.g., DPU, RoCE, InfiniBand), and/or ARM CPU solutions Hands...
be doing: Design, build, and harden containers for NIM runtimes, inference backends; enable reproducible, multi-arch, CUDA... and running GPU workloads in k8s, including NVIDIA device plugin, MIG, CUDA drivers/runtime, and resource isolation. Excellent...
Senior Robotics Engineer
Qualifications Experience with NVIDIA Jetson, Isaac SDK, Isaac Sim, Omniverse, CUDA, or GPU-accelerated robotics workloads...
Prime Robotics ⚡ $150000 per year ⚡ Tue, 14 Apr 2026 22:33:46 GMT