Remote Senior Staff Software Engineer - Reliability Engineering
About the Role
We are seeking a Remote Senior Staff Software Engineer to join NMI's Reliability Engineering team. This role is crucial in evolving our product-oriented approach to reliability, focusing on enhancing the performance and operational maturity of our critical platform services. As a Remote Senior Staff Software Engineer, you will play a pivotal role in transitioning our engineering organization from reactive incident response to intentional, engineered reliability through robust systems, tooling, and standards.
What You'll Do
- Design and build reliability-focused frameworks, tooling, and standards that enhance platform uptime, performance, and operational confidence.
- Drive initiatives that shift reliability from reactive response to proactive engineering, emphasizing prevention, early detection, and fast recovery.
- Collaborate with engineering teams to integrate reliability into system design, development practices, and deployment workflows.
- Establish and refine observability practices, including metrics, logging, alerting, and dashboards that provide clear operational insights.
- Identify systemic risks and failure patterns, leading efforts to address them through automation, architectural improvements, and process refinement.
- Contribute hands-on to production codebases, internal tools, and platform services with a focus on long-term maintainability.
- Influence technical direction across teams through design reviews, technical proposals, and clear written communication.
- Enhance operational maturity through improved incident practices, post-incident learning, and continuous improvement loops.
- Mentor engineers by modeling strong ownership, technical judgment, and disciplined delivery.
- Participate in on-call rotations with a clear mandate to reduce operational load over time through engineering.
Requirements
- 8+ years of experience building and operating production-grade software systems in complex environments.
- Strong experience in reliability engineering practices and principles.
- Proficiency in designing and implementing observability frameworks.
- Hands-on experience with cloud platforms and microservices architecture.
- Excellent communication skills for cross-team collaboration and mentorship.
Nice to Have
- Experience with automation tools and CI/CD pipelines.
- Familiarity with incident management and post-mortem processes.
- Knowledge of programming languages such as Python, Go, or Java.
What We Offer
- Competitive salary ranging from $140,000 to $180,000 per year.
- Flexible remote work environment with a focus on work-life balance.
- Opportunities for professional development and mentorship.
- Health and wellness benefits.
- Collaborative and innovative team culture.
This Remote Senior Staff Software Engineer position at NMI offers a unique opportunity to lead reliability engineering initiatives in a flexible work environment, with a competitive salary and a focus on professional growth.
Generating success profile...
Analyzing job requirements and market data
Loading market overview...
Analyzing market trends and skill demands
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months