Remote Systems Reliability Engineer (SRE) - Cloud Infrastructure Focus
About the Role
We are seeking a talented Remote Systems Reliability Engineer (SRE) to join our dynamic team. In this role, you will be responsible for ensuring the reliability and performance of our cloud infrastructure. As a Systems Reliability Engineer, you will work closely with development teams to implement best practices in reliability, automation, and monitoring. This is a fantastic opportunity to make an impact while working in a mission-driven company.
What You'll Do
- Design and implement reliable cloud infrastructure solutions using AWS, Azure, and Kubernetes.
- Develop and maintain Infrastructure-as-Code (IaC) using Terraform and CloudFormation.
- Monitor system performance and troubleshoot issues to ensure optimal uptime and reliability.
- Collaborate with cross-functional teams to enhance observability and performance tuning.
- Automate operational tasks to improve efficiency and reduce manual intervention.
Requirements
- 3+ years of experience as a Systems Reliability Engineer or similar role.
- Strong proficiency in cloud platforms such as AWS and Azure.
- Experience with scripting languages like Python or Go.
- Familiarity with CI/CD pipelines and DevOps practices.
- Knowledge of compliance frameworks like NIST SP-800 53 and FISMA is a plus.
Nice to Have
- Experience with observability platforms and monitoring tools.
- Familiarity with database scripting and performance tuning.
- Knowledge of robotics and automation technologies.
What We Offer
- Rich medical, dental, vision, and EAP benefits.
- Caregiver leave for new parents.
- Reimbursements for wellness and learning development.
- Company-provided laptop and home-office stipend.
- Enrollment in a 401k program.
This Remote Systems Reliability Engineer position offers a competitive salary and a chance to work in a mission-driven environment. Enjoy rich benefits and equity options.
Who Will Succeed Here
Proficient in AWS and Azure cloud services, with hands-on experience in building and managing infrastructure using Terraform and CloudFormation to ensure scalable and reliable deployments.
Strong problem-solving mindset with a focus on automation using Python and Go, enabling efficient CI/CD pipelines and reducing manual intervention in deployment processes.
Self-motivated and disciplined, thriving in a remote work environment, capable of managing time effectively and collaborating asynchronously with cross-functional teams.
Learning Resources
Career Path
Market Overview
Skills & Requirements
Domain Trends
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months