Site Reliability Engineer - Remote Opportunity at Upsun
About the Role
We are looking for a Site Reliability Engineer to join our remote team at Upsun. As a Site Reliability Engineer, you will play a critical role in enhancing our cloud application platform, ensuring reliability and scalability while automating operational tasks. This Site Reliability Engineer remote position is perfect for those who thrive in a collaborative environment and are eager to drive continuous improvements.
What You'll Do
- Refine monitoring and observability using tools like Prometheus, Grafana, and ELK Stack to ensure system visibility.
- Automate deployments and workflows by transitioning manual processes to automated solutions using Infrastructure as Code (IaC) tools such as Terraform and Ansible.
- Optimize CI/CD pipelines to ensure fast, reliable releases and scalability.
- Manage cloud infrastructure on platforms like AWS, GCP, and Azure while minimizing technical debt.
- Support incident management and lead post-mortem analysis for continuous improvement.
- Collaborate with cross-functional teams to integrate reliability practices into the development lifecycle.
- Drive technical innovation by introducing new tools and technologies that enhance system reliability.
Requirements
- Solid understanding of DevOps, Cloud Operations, or SRE principles with a focus on reliability and scalability.
- Hands-on experience with Linux systems, including performance tuning and troubleshooting.
- Proficiency in programming languages such as Go (preferred) or Python.
- Strong scripting skills in languages like Python, Bash, or Go for automating workflows.
- Extensive experience with cloud platforms like AWS, GCP, and Azure.
- Experience with containerization technologies like Docker and Kubernetes is a plus.
- Strong problem-solving skills and the ability to collaborate effectively across teams.
Nice to Have
- Experience with monitoring/logging frameworks and CI/CD pipelines.
- Knowledge of security best practices in cloud environments.
What We Offer
- A product you can believe in - Join us in transforming how businesses build and manage web applications.
- An award-winning workplace recognized by Forbes’ Top 30 Companies for Remote Jobs.
- A culture that values your voice in an inclusive work environment.
- Flexible PTO and comprehensive healthcare coverage.
- Company stock options and a professional development budget.
- Annual team gatherings and internet reimbursement.
- Inclusive parental leave and a remote work travel program.
At Upsun, we celebrate diversity and are committed to fostering an inclusive workplace where everyone can thrive. If you’re ready to make a positive impact as a Site Reliability Engineer remote, we want to hear from you!
This Site Reliability Engineer role at Upsun offers a unique opportunity to work remotely while enhancing system reliability and scalability. With a strong focus on innovation and collaboration, you'll be part of a diverse global team.
Generating success profile...
Analyzing job requirements and market data
Loading market overview...
Analyzing market trends and skill demands
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months