About the Role
We’re on the lookout for a Site Reliability Engineer to join our innovative team at Agentero. This remote Site Reliability Engineer position is perfect for professionals based in Latin America who are ready to make a significant impact in the insurance technology sector.
What You'll Do
- Design and implement observability solutions that alert on symptoms rather than outages, ensuring early warnings before customer impact.
- Create and maintain runbooks that document actions, transforming findings into repeatable processes and automation.
- Build and maintain cloud infrastructure using Infrastructure-as-Code principles, collaborating with backend engineers to enhance service reliability.
- Participate in a business-hours on-call rotation aligned with US time zones, enabling a follow-the-sun model with no midnight pages.
- Champion a culture of reliability, focusing on automation, documentation, and continuous improvement.
Requirements
- At least 4 years of relevant experience in SRE, DevOps, or Infrastructure roles.
- Proficiency with Infrastructure-as-Code tools, preferably Terraform.
- Experience with cloud platforms such as AWS or GCP.
- Strong Linux systems administration and troubleshooting skills.
- Programming ability in Go, Python, or similar languages.
- Familiarity with observability and monitoring tools like Datadog, Prometheus, or Grafana.
- Excellent communication skills in English.
Nice-to-Haves
- Experience with GCP, particularly Cloud Run and Cloud Monitoring.
- Background in incident management and writing effective runbooks.
- Experience with CI/CD pipelines and deployment automation.
What We Offer
- Competitive salary of 45-65K EUR plus equity.
- Remote-first work environment, allowing you to work from anywhere in Latin America.
- Home office setup budget and training development budget.
- Business-hours on-call with a follow-the-sun model.
- Collaborate with an international team across Spain and the US.
- Opportunities for team offsites in exciting locations.
If you’re ready to join a dynamic team and help shape the future of insurance technology, apply today for this remote Site Reliability Engineer position!
Join Agentero as a Site Reliability Engineer and work remotely from LATAM. This role offers a competitive salary, equity options, and the chance to innovate in the InsurTech industry.
Who Will Succeed Here
Proficient in using Terraform for infrastructure as code, with a strong understanding of AWS and GCP services to automate deployments and optimize cloud resources.
Self-motivated and disciplined in a remote work environment, capable of managing time effectively and prioritizing tasks to ensure proactive monitoring and incident response with tools like DataDog and Prometheus.
Hands-on experience with programming in Go and Python, enabling the development of custom monitoring solutions and automation scripts to enhance system reliability and performance.
Learning Resources
Career Path
Market Overview
Skills & Requirements
Domain Trends
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months