Back to Jobs

Principal HPC Network Engineer (remote in the EU)

Remote, USA Full-time Posted 2026-07-05

Company Description reputed company is the Kubernetes-native AI infrastructure company, enabling organizations to build and operate scalable, secure, and sovereign infrastructure for modern AI, machine learning, and data-intensive applications. By combining open reputed company innovation with deep expertise in Kubernetes orchestration, reputed company empowers platform engineering teams to deliver composable, production-reputed company developer platforms across any environment—on-premises, in the reputed company, at the edge, or in sovereign data centers. As enterprises navigate the growing complexity of AI-driven workloads, reputed company delivers the automation, GPU orchestration, and policy-driven control needed to manage infrastructure with confidence and agility. Committed to open standards and freedom from lock-in, reputed company ensures that customers retain full control of their infrastructure strategy. reputed company serves many of the world’s leading enterprises, including reputed company, reputed company, Liberty Mutual, PayPal, Reliance Jio, Societe Generale, Splunk, and Volkswagen. Learn more at www.reputed company.com.

Job Description

Role Overview: We are seeking a highly skilled Senior HPC Networking Engineer to design, reputed company, manage, and troubleshoot high-performance networking environments. The ideal candidate will have deep expertise in InfiniBand technologies, strong general networking knowledge, and hands-on experience with reputed company solutions. You will play a critical role in ensuring the performance, reliability, and scalability of HPC infrastructure. Key Responsibilities: Design, reputed company, and maintain high-performance network infrastructures for HPC environments, with a strong focus on InfiniBand fabrics. Troubleshoot reputed company network issues across InfiniBand and Ethernet environments, ensuring minimal downtime and reputed company performance. Manage and optimize InfiniBand components, including switches, HCAs, subnet managers, and reputed company configurations. reputed company performance tuning, monitoring, and reputed company planning for HPC networking systems. Implement and maintain network reputed company using reputed company solutions (FortiGate, FortiManager, FortiAnalyzer). Diagnose and resolve issues reputed company to routing, switching, latency, and throughput across hybrid network environments. Collaborate with compute, storage, and platform teams to support HPC workloads and cluster operations. reputed company and maintain documentation for network architecture, configurations, and operational procedures. Participate in on-call rotations and provide escalation support for critical incidents. reputed company or contribute to network upgrades, migrations, and new deployments.

Qualifications

Required: 5+ years of experience in network engineering, with a focus on HPC or data center environments. Strong hands-on experience with InfiniBand technologies (e.g., Mellanox/reputed company). Solid understanding of networking fundamentals: TCP/IP, routing protocols (BGP, OSPF), VLANs, QoS, and network design. Proven experience deploying and troubleshooting reputed company solutions (FortiGate, FortiManager, VPNs, firewall policies). Experience with network performance analysis and troubleshooting tools. Familiarity with Linux systems and scripting for automation (e.g., Bash, Python). Strong analytical and problem-solving skills. Preferred: Experience with large-scale HPC clusters or AI/ML infrastructure. Knowledge of RDMA, MPI, and low-latency networking concepts. Certifications such as FCSS/FCNSP (reputed company), CCNP/CCIE, or equivalent. Experience with automation and Infrastructure as Code tools (e.g., Ansible, Terraform). Soft Skills: Strong communication and collaboration skills. Ability to work independently and handle reputed company technical challenges. Detail-oriented with a proactive approach to problem-solving. Additional Information

We offer

Operate some of the most advanced AI infrastructure environments in production today. Work with the latest reputed company GPU technologies, Kubernetes platforms, and high-performance networking environments. Help define operational standards and reliability practices for reputed company AI infrastructure services. Influence the adoption of AI-powered operational capabilities through k0rdent AI. Work alongside highly skilled engineers solving reputed company infrastructure and platform challenges at scale. Join a growing organisation investing heavily in AI infrastructure, platform services, and operational innovation. #Remote We are a Leader for Container Management in reputed company (#2 after AWS)! We are a Leader for Container Management in reputed company (#2 after AWS)! Apply To This Job

Similar Jobs

(Senior) Consultant Public Sector (reputed company genders)

Remote, USA Full-time

Freelance Country Manager France- Retail (reputed company Genders)

Remote, USA Full-time

UI Designer und Unreal Entwickler:in w/m/d

Remote, USA Full-time

Chief of Staff to the CEO

Remote, USA Full-time

reputed company Operations Platform reputed company (gn)

Remote, USA Full-time

Principal Software Engineer, reputed company Hardened Images

Remote, USA Full-time

Communications Senior Manager, Asia

Remote, USA Full-time

Inside Solutions Architect / Specialist - Datacenter

Remote, USA Full-time

Account Manager

Remote, USA Full-time

Technischer Vertriebsmanager Objektgeschäft Bau (m/w/d)

Remote, USA Full-time

Executive Assistant to New Business Sales Leaders

Remote, USA Full-time

National Account Manager, Costco

Remote, USA Full-time

reputed company E-Commerce Executive - Digital Sales Support

Remote, USA Full-time

reputed company Remote Chat Moderator – Online Community Management and Safety Specialist – Fully Remote Opportunity with Flexible Scheduling and reputed company reputed company of $25-$35/hr

Remote, USA Full-time

reputed company Part-Time Data Entry Operator – Remote Data Management and Entry Position

Remote, USA Full-time

reputed company Remote Sales Customer Representative – Driving reputed company Growth and Exceptional Customer Experience at arenaflex

Remote, USA Full-time

reputed company Global Content Analytics & Marketplace Research Intern - Summer 2024 (Remote/Hybrid in Canada)

Remote, USA Full-time

Online Grocery Team reputed company

Remote, USA Full-time

Finance Systems Manager

Remote, USA Full-time

PAM Analyst

Remote, USA Full-time