Developer Technology Engineer - AI job opportunity at NVIDIA.



DatePosted 11 Days Ago bot
NVIDIA Developer Technology Engineer - AI
Experience: General
Pattern: full-time
apply Apply Now
Salary:
Status:

Job

Copy Link Report
degreePhD
loacation China, Beijing, China
loacation China, Beijing....China

NVIDIA is looking for a passionate, world-class computer scientist to work in its Compute Developer Technology (DevTech) team. In this role, you will research and develop techniques to GPU-accelerate leading applications in high performance computing fields within machine and deep learning, scientific computing, and data processing, performing in-depth analysis and optimization to ensure the best possible performance on current- and next-generation GPU architectures. What you will be doing: Working directly with key application developers (especially LLM) to understand the current and future problems they are solving, creating and optimizing core parallel algorithms and data structures to provide the best solutions using GPUs, through both library development and direct contribution to the applications. This includes training and inference optimization for large language models, directly contributing to frameworks such as Megatron and TRTLLM, SGLang, vLLM... Collaborating closely with the architecture, research, libraries, tools, and system software teams at NVIDIA to influence the design of next-generation architectures, software platforms, and programming models, including by investigating impact on application performance and developer productivity. Engaging in deep optimization of high-performance operators, involving but not limited to CUDA deep optimization, instruction and compiler optimization. These optimizations will directly support customers or be integrated into products like cuDNN, cuBLAS, and CUTLASS... Some travel is required for conferences and for on-site visits with developers. What we need to see: A degree from university in an engineering or computer science related discipline (BS; MS or PhD preferred). 2+ working experience is required. Strong knowledge of C/C++ and/or Fortran. Deep knowledge of software design, programming techniques, and algorithms. Expert knowledge of LLM training/inference optimization, including but not limited to development and optimization experience in distributed training/inference, NCCL, NVSHMEM, IB, RoCE, etc. Strong mathematical fundamentals, including linear algebra and numerical methods. Experience with parallel programming, ideally CUDA C/C++ and OpenACC. Good communication and organization skills, with a logical approach to problem solving, good time management, and task prioritization skills.

Other Ai Matches

NVIDIA 2026 Internships: Systems Software Engineering - US Applicants are expected to have a solid experience in handling Job related tasks
Senior System Integration and Validation Engineer Applicants are expected to have a solid experience in handling Job related tasks
Software Engineering Manager - HPC CUDA Processing Libraries Applicants are expected to have a solid experience in handling Job related tasks
Senior ASIC Design Engineer - Hardware Applicants are expected to have a solid experience in handling Job related tasks
Senior Scientist, Synthetic Data and Privacy Applicants are expected to have a solid experience in handling Synthetic Data and Privacy related tasks
Principal Sourcer, Supply Chain Sourcing Strategy and Analytics Applicants are expected to have a solid experience in handling Supply Chain Sourcing Strategy and Analytics related tasks
Senior System Verification Engineer Applicants are expected to have a solid experience in handling Job related tasks
Senior Analog and Mixed Signal Engineer Applicants are expected to have a solid experience in handling Job related tasks
Accelerated Computing Research Intern Applicants are expected to have a solid experience in handling Job related tasks
Senior Developer Relations Manager Applicants are expected to have a solid experience in handling Job related tasks
Graphic Designer - Enterprise Launch Applicants are expected to have a solid experience in handling Job related tasks
remote-jobserver Remote
Senior Software Developer, AI Networking Applicants are expected to have a solid experience in handling AI Networking related tasks
Senior Product Test Engineer Applicants are expected to have a solid experience in handling Job related tasks
Senior ASIC Engineer - PMU Applicants are expected to have a solid experience in handling Job related tasks
Senior Technical Program Manager - Automotive Vehicles Applicants are expected to have a solid experience in handling Job related tasks
Strategic Sourcing Manager – Engineering and Enterprise Software Applicants are expected to have a solid experience in handling Job related tasks
Senior SWQA Test Developer – Automotive Traceability and Compliance Platform Applicants are expected to have a solid experience in handling Job related tasks
Senior GPU Memory Architect Applicants are expected to have a solid experience in handling Job related tasks
Senior CUDA Test Development Software Engineer, SDET Applicants are expected to have a solid experience in handling SDET related tasks
Senior Physical Design Engineer Applicants are expected to have a solid experience in handling Job related tasks
Digital Circuit Design Engineer Applicants are expected to have a solid experience in handling Job related tasks
Senior System Software Engineer, GPU Server Applicants are expected to have a solid experience in handling GPU Server related tasks
Senior Mixed Signal Design Engineer Applicants are expected to have a solid experience in handling Job related tasks