About the Role
We are seeking a Mid-Senior Network Engineer - Datacenter Operations to join our rapidly scaling AI infrastructure company. This Network Engineer remote position offers an exciting opportunity to work with cutting-edge technology and play a crucial role in supporting the backbone of AI research and development.
What You'll Do
- Own network operations for an assigned datacenter region, supporting deployments, turn-ups, and expansions.
- Act as Tier 2/3 escalation point for network incidents, ensuring swift resolution of issues.
- Troubleshoot complex L1–L3 and fabric-level issues to maintain optimal network performance.
- Coordinate network break-fix activities with onsite teams and vendors to ensure minimal downtime.
- Manage RMAs and vendor escalations effectively.
- Build and maintain regional/network observability dashboards to monitor performance metrics.
- Validate production readiness and operational handover for new deployments.
Requirements
- 4+ years of experience in network engineering with a strong focus on production operations.
- Proven experience in running and troubleshooting live datacenter networks.
- Strong incident response and outage leadership experience.
- Hands-on experience with EVPN/VXLAN, BGP, CLOS, and high-radix switching.
- Confident in troubleshooting L2/L3, routing, fabric, and physical faults.
- Experience with SQL-backed dashboards (Grafana, Tableau, or similar).
- Working knowledge of Python for operations, analysis, or scripting.
- Willingness to travel approximately 30-40% of the time.
Nice to Have
- Experience in AI/ML or HPC network operations (RDMA, RoCEv2, lossless Ethernet).
- Previous site, campus, or regional operations ownership.
- Hands-on hardware break-fix and RMA coordination experience.
- Familiarity with network monitoring, alerting, and telemetry tools.
- Experience in follow-the-sun or globally distributed operations.
What We Offer
- Competitive salary range of $150,000 - $250,000, commensurate with experience.
- Meaningful equity opportunities to share in our success.
- Generous PTO policy to support work-life balance.
- Remote flexibility with an emphasis on in-office presence when necessary.
- Opportunity to work with a talented team in a high-growth environment.
This role offers a unique opportunity to work in a high-growth AI infrastructure company, with competitive compensation and the chance to make a significant impact.
About Realm
Explore Realm careers in 2026 and discover exciting job opportunities across remote, hybrid, and office roles. Our platform offers advanced filters, application tracking, and valuable company insights to help you find the perfect position at Realm. Stay informed with the latest industry news while tailoring your resume to increase your chances of landing your dream job. Start your journey today!
Who Will Succeed Here
Proficient in advanced network protocols including EVPN and VXLAN, with hands-on experience in configuring BGP for efficient data routing and managing both Layer 2 and Layer 3 networking environments.
Self-motivated and disciplined, capable of working effectively in a fully remote setting while managing multiple tasks and projects, demonstrating strong time management and organizational skills.
Analytical mindset with a strong understanding of Python scripting to automate network tasks and proficient in using monitoring tools like Grafana and Tableau for data visualization and performance metrics.
Learning Resources
Career Path
Market Overview
Skills & Requirements
Domain Trends
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months