About the Role
We are seeking a Site Reliability Engineer (SRE) End User Services to join our dynamic team at Leidos. This remote SRE role focuses on ensuring the reliability, performance, and scalability of complex distributed systems. As part of the NMCI Service Management Integration and Transport (SMIT) group, you will play a crucial role in supporting the Navy-Marine Corps Intranet, which includes cybersecurity services, network operations, and data transport.
What You'll Do
- Proactively manage incidents using metrics and tools like Aternity to monitor end-user performance and identify potential issues.
- Lead software deployment planning, coordination, and execution across end-user devices, ensuring minimal disruption.
- Analyze service performance metrics to identify areas for improvement and advocate for automation and best practices.
- Define and maintain a product vision and roadmap for End User Services, translating business requirements into actionable features.
- Engage with stakeholders to create user stories and acceptance criteria that communicate their needs effectively.
- Document product requirements, progress, and updates for stakeholders, ensuring clear communication.
- Utilize scripting languages like PowerShell or Python for automation to improve site performance and reliability.
- Collaborate with engineering and operations teams to implement automated solutions for incident prevention.
Requirements
- Bachelor’s degree with 5+ years of relevant experience in Site Reliability Engineering.
- Active DoD Secret security clearance is mandatory.
- Experience with proactive incident management using performance monitoring tools.
- Strong leadership skills in managing software deployments and end-user services.
- Exceptional communication skills, both written and oral, including technical analysis and executive-level briefings.
- Hands-on experience with Agile and DevSecOps concepts.
- Proficiency in scripting languages for automation.
- Familiarity with ITIL processes and service quality improvement methodologies.
Nice to Have
- Certified Scrum Product Owner (CSPO) certification.
- ITILv4 and Agile SAFe certifications.
- Previous experience with NGEN-NMCI or similar programs.
- Advanced vendor certifications (e.g., Azure, Aternity).
- Experience with Risk Management Framework (RMF) and DISA STIGs.
What We Offer
- Competitive salary ranging from $92,300 to $166,850.
- Comprehensive health and wellness programs.
- Income protection and paid leave.
- Retirement plans and additional benefits.
- Opportunities for professional development and growth.
- Flexible remote work environment.
- Engaging company culture focused on innovation and disruption.
- Commitment to diversity and inclusion in the workplace.
This remote Site Reliability Engineer role at Leidos offers a competitive salary and the opportunity to work on innovative projects that impact national security.
Generating success profile...
Analyzing job requirements and market data
Loading market overview...
Analyzing market trends and skill demands
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months