Site Reliability Engineer (SRE) - Remote Role for Public Sector
About the Role
We are seeking a talented Site Reliability Engineer (SRE) - Remote to join our innovative technology company focused on critical systems for government security agencies. In this role, you will ensure the stability, security, and auditability of complex on-premises infrastructures directly within German and European data centers. If you are passionate about operational excellence and thrive in environments where reliability is paramount, this Site Reliability Engineer (SRE) - Remote position is perfect for you.
What You'll Do
- Manage technical operations and platform responsibilities, ensuring high availability of systems.
- Install, operate, and harden Kubernetes clusters in government data centers.
- Build and maintain GitOps pipelines using Argo CD/Flux, Helm/Kustomize, and Artifact Registries.
- Implement observability stacks (Prometheus, Grafana, Loki, OpenTelemetry) including SLA-compliant dashboards and alerting.
- Act as Incident Commander during on-site incidents and security-critical situations.
- Coordinate with IT Security, internal engineering teams, and legal entities.
- Conduct root cause analyses and prepare legally compliant incident reports.
- Ensure compliance with security, audits, and regulations including network segmentation, TLS/mTLS, and vulnerability management.
Requirements
- 5-8 years of experience as an SRE/DevOps Engineer, preferably with on-call responsibilities.
- Deep knowledge of Kubernetes (on-prem/hybrid), GitOps, Helm/Kustomize, and automation tools like Ansible and Terraform.
- Proficient in observability, incident response, and security architectures.
- Experience in regulated environments (public sector, finance, healthcare, etc.).
- Strong scripting skills (Bash, Python) and familiarity with CI/CD in restricted networks.
- Excellent knowledge of IAM, secrets management, TLS/mTLS, and SIEM integration.
Nice to Have
- Familiarity with governmental IT structures.
- Certifications such as CKA/CKAD, ISO 27001, CISSP, or GDPR Practitioner.
- Experience with digital evidence and logging systems.
What We Offer
- Remote-first work environment in Germany with regular team events in Berlin.
- Home office budget and top-notch equipment.
- 30 days of vacation for genuine relaxation.
- A mission-driven role with high societal relevance.
- Stable, long-term partnerships in the public sector.
- Access to modern tooling landscapes (Kubernetes, GitOps, observability).
- Field scholarships, high-quality equipment, and compliance training.
- A significant impact on the stability, security, and quality of critical systems.
If you are ready to take on operational responsibilities and be part of a team that ensures the stability, security, and auditability of critical infrastructures for government agencies, we look forward to your application!
This Site Reliability Engineer role offers a unique opportunity to work remotely while ensuring the stability and security of critical public sector systems. With a competitive salary and impactful work, it's a great fit for experienced engineers.
About ZABEL
Explore ZABEL careers in 2026 and discover exciting job opportunities across remote, hybrid, and office roles. Utilize our advanced filters to refine your search, track your applications effortlessly, and gain valuable insights about our company culture. Whether you're looking for entry-level positions or experienced roles, ZABEL offers a diverse range of career pathways tailored just for you. Start your journey today!
Generating success profile...
Analyzing job requirements and market data
Loading market overview...
Analyzing market trends and skill demands
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months