Senior Site Reliability Engineer, BCM - DGX Cloud job opportunity at NVIDIA.



bot
NVIDIA Senior Site Reliability Engineer, BCM - DGX Cloud
Requires: 8-years - XP
Pattern: full-time
apply Apply Now
Salary:
Status:
Copy Link Report
Bachelor's (B.A.)
California, Santa Clara, United States Of America
California, Sa..........United States Of America

Contributing to deployments and daily #operations of large scale next-generation GPU platforms __ Handling incidents in GPU clusters, bridging the gap between cluster operations and development __ Designing and implementing small features in the Base Command #Manager #product to become intimately familiar with the workings of the #product __ Validating complex cluster configurations including Slurm and Kubernetes orchestrators for performance, scalability and resilience, ensuring they meet real-world customer scenarios.

Other Ai Matches

Senior Front-End Power Engineer Applicants are expected to have a solid experience in handling Engineering related tasks
Senior Software Engineer, SPE Applicants are expected to have a solid experience in handling Marketing related tasks
Senior Technical Program Manager - Silicon Solutions Applicants are expected to have a solid experience in handling Technical Support Engineering related tasks
ASIC Design Engineer - Security Subsystem Applicants are expected to have a solid experience in handling Engineer related tasks
Senior Research Scientist - Digital Biology Applicants are expected to have a solid experience in handling Scientists related tasks
Senior QA Engineer, DPU Firmware Applicants are expected to have a solid experience in handling Engineer related tasks
GPU C++ Modeling Engineer - New College Grad 2026 Applicants are expected to have a solid experience in handling Engineering related tasks
Developer Technology Engineer Intern, HPC - 2026 Applicants are expected to have a solid experience in handling Engineer related tasks
Senior Software Architect - Deep Learning and HPC Communications Applicants are expected to have a solid experience in handling Software related tasks
Distinguished Software Engineer - NVLink Fusion Software Applicants are expected to have a solid experience in handling engineering related tasks
Engineering Manager, AI Developer Technology Applicants are expected to have a solid experience in handling Engineering related tasks
Senior Infrastructure Security Engineer - DGX Cloud Applicants are expected to have a solid experience in handling Damilola related tasks
Senior Site Reliability Engineer, BCM - DGX Cloud Applicants are expected to have a solid experience in handling Engineer related tasks
PhD Research Intern, AI for Climate and Weather Simulation - 2026 Applicants are expected to have a solid experience in handling Intern related tasks
Developer Relations Manager, Quantum Computing Applicants are expected to have a solid experience in handling Development related tasks
Senior Software Engineer - Isaac for Healthcare Applicants are expected to have a solid experience in handling Driver related tasks
Windows Driver Verification Engineer Applicants are expected to have a solid experience in handling Administration related tasks
SWAQ Tools Development Engineer Applicants are expected to have a solid experience in handling Engineering related tasks
Senior Deep Learning Engineer Applicants are expected to have a solid experience in handling Engineer related tasks
Product Program Manager Applicants are expected to have a solid experience in handling Manager related tasks
Senior Switch Firmware Engineer Applicants are expected to have a solid experience in handling Engineering related tasks
Senior Architect, C2C Applicants are expected to have a solid experience in handling Architect related tasks
Senior Package Layout Engineer - Hardware Applicants are expected to have a solid experience in handling Engineering related tasks