Affirm01.03.26
AI SCORE 8.5

Director of Software Engineering - Site Reliability Engineering (Remote)

$267K–$360K/year

About the Role

Affirm is seeking a talented and experienced Director of Software Engineering - Site Reliability Engineering (Remote) to lead our Site Reliability Engineering team. In this role, you will be responsible for ensuring the reliability, availability, and performance of our services while working remotely from the United States. As a key member of our engineering leadership team, you will drive initiatives that enhance our systems and processes, ensuring a seamless experience for our users.

What You'll Do

  • Lead and mentor a team of Site Reliability Engineers, fostering a culture of excellence and continuous improvement.
  • Develop and implement strategies to enhance system reliability, performance, and scalability.
  • Collaborate with cross-functional teams to design and deploy robust infrastructure solutions.
  • Oversee incident management processes, ensuring rapid response and resolution of service disruptions.
  • Utilize monitoring and observability tools to proactively identify and address potential issues.
  • Drive automation initiatives to improve operational efficiency and reduce manual intervention.
  • Participate in architectural discussions and contribute to the overall technical direction of the organization.
  • Engage with stakeholders to align engineering efforts with business objectives and user needs.

Requirements

  • 10+ years of experience in software engineering, with at least 5 years in a leadership role focused on Site Reliability Engineering.
  • Strong expertise in cloud platforms (AWS, GCP, Azure) and container orchestration technologies (Kubernetes, Docker).
  • Proficient in programming languages such as Python, Go, or Java.
  • Experience with monitoring and logging tools (Prometheus, Grafana, ELK stack).
  • Excellent problem-solving skills and a proactive approach to system reliability.
  • Strong communication and interpersonal skills to collaborate effectively with diverse teams.
  • Experience in incident response and disaster recovery planning.
  • Ability to work independently in a remote environment while managing multiple priorities.

Nice to Have

  • Experience in FinTech or related industries.
  • Familiarity with DevOps practices and CI/CD pipelines.
  • Knowledge of security best practices in cloud environments.

What We Offer

  • Competitive salary ranging from $267,000 to $360,000, commensurate with experience.
  • Remote work flexibility, allowing you to work from anywhere in the United States.
  • Comprehensive health benefits, including medical, dental, and vision coverage.
  • Generous paid time off and holidays to support work-life balance.
  • Opportunities for professional development and continuous learning.
  • Collaborative and inclusive company culture that values diversity.
  • Equity options to share in the company's success.
  • Access to cutting-edge technologies and tools to enhance your work experience.
Why This Job8.5 of 10

This Director of Software Engineering role at Affirm offers a unique opportunity to lead a talented team in a remote setting, with a competitive salary and a focus on system reliability.

Salary Range
Required
0/1
Optional
0/1
Bonus
0/1

Who Will Succeed Here

Deep expertise in cloud platforms such as AWS, GCP, or Azure, with hands-on experience deploying and managing applications in a cloud-native environment, particularly using Kubernetes and Docker for container orchestration.

Strong analytical mindset with a proactive approach to incident management and performance optimization, utilizing tools like Prometheus for monitoring and alerting to ensure system reliability and uptime.

Proven leadership experience in remote settings, demonstrating the ability to mentor and guide teams in adopting best practices in Site Reliability Engineering while fostering a culture of continuous improvement and accountability.

Learning Resources

Site Reliability Engineering: How Google Runs Production Systemsbook

Career Path

Director of Software Engineering - Site Reliability Engineering(Now)Senior Director of Engineering(1-2 years)VP of Engineering or Chief Technology Officer(3-5 years)

Market Overview

Market Size 2024
$10.5B
Annual Growth
23.4%
AI Adoption in SRE
45%
Investment in SRE Tools
+150%
Labour Demand for SRE Roles
+35%
Avg Salary for SRE Directors
$180K

Skills & Requirements

Required
Site Reliability EngineeringAWSGCP
Growing in Demand
Infrastructure as Code (IaC)Machine Learning Operations (MLOps)Chaos Engineering
Declining
Traditional IT Operations ManagementOn-Premise Server Management

Domain Trends

Increase in Cloud-Native Adoption
By 2025, 85% of organizations are expected to operate on a cloud-native architecture, driving demand for SRE practices.
Rise of Observability Tools
The observability tools market is projected to grow by 30% annually, with solutions like Prometheus gaining significant traction in SRE teams.
Focus on Automation and AIOps
AIOps is being adopted by 60% of SRE teams to enhance incident response times, with automation becoming a key component of reliability strategies.

Industry News

Loading latest industry news...

Finding relevant articles from the last 6 months

All job postings are automatically gathered by algorithms. We do not review or verify listings, be careful when applying and do not sign-in with iCloud or Google services.