How do we ensure that the world’s most powerful AI supercomputers never drop a packet?
At NVIDIA, we are looking for a proactive Senior Network System Validation Engineer to join our Ethernet QA Team! In this role, we lead the AI and Data Center revolution by qualifying the high-speed routing and traffic engineering solutions that power the globe’s most advanced computing fabrics!
A Note on the Role’s Scope This position is a low-level systems engineering role focused on the intersection of hardware and software. Our work is distinct from “classic” QA (Web, UI, or Mobile testing). We focus on deep-tier routing stacks, hardware-software integration, and sophisticated traffic engineering. If your passion lies in network protocols and system-level validation, we want to hear from you!
What You’ll Be Doing * Architecture & Design: Reviewing architectural specifications for new high-speed networking features to ensure they meet the demands of modern data center environments. * Protocol Validation: Inventing and implementing end-to-end verification strategies for advanced Layer 3 routing, underlay and overlay technologies. * Traffic Engineering: Qualifying sophisticated QoS features such as PFC, ECN, and RoCE to ensure both lossless and lossy traffic behaves efficiently. * High-Performance Triage: Reproducing and debugging intricate system-level issues using packet captures and system logs to identify root causes within the networking stack. * Automation Infrastructure: Architecting and maintaining scalable automation frameworks in Python to support our validation standards.
What We Need To See * Professional Foundation: 5+ years of hands-on experience in Network Engineering, System Validation, or Protocol Testing. * Deep Networking Knowledge: A level of expertise equivalent to CCNP, JNCIP-ENT, or HCIP-Datacom certifications, demonstrating a deep mastery of networking theory and production-level practice. * L2/L3 Protocol & Fabric Mastery: Expert-level understanding of control and data planes. You should be highly confident in configuring and troubleshooting BGP (v4/v6) and OSPF, as well as modern fabric architectures such as EVPN-VXLAN and MLAG or Dual-ToR. * Traffic Management: A strong background in lossy and lossless networking, including the qualification of QoS mechanisms, congestion control (PFC, ECN, WRED), and RDMA-based technologies (e.g., RoCE) on high-speed platforms.
Ways To Stand Out From The Crowd * Systems-Level Automation: Advanced Python programming skills, specifically for architecting frameworks that interact with the Linux networking stack, Bash, and Network Operating Systems (NOS). * Linux Internals: In-depth knowledge of the Linux kernel networking stack (LPIC-2 level or similar). * Performance Tooling: Hands-on experience with traffic generators like IXIA, Spirent, or Trex to benchmark throughput and latency. * Virtualization: Experience with network virtualization and containerized environments (Linux KVM, ESXi, Docker).
What We Offer: * 36 days of paid vacation a year, weekends on all public holidays, paid sick leaves, company paid parental leave for mothers and fathers. * The most competitive salary on the market. * NVIDIA stock. * Premium medical insurance for employees and their children/spouses. * Life insurance. * Professional courses at Ivy League Universities. * Trainings and lectures. * English or other languages classes. * Personalized career development plan. * Wellbeing programs.