Data Center Incident Program Manager - Remote
About the Role
OpenAI is seeking a dedicated Data Center Incident Program Manager to join our team remotely from the United States. In this role, you will oversee incident management processes within our data centers, ensuring operational excellence and swift resolution of any incidents. As a Data Center Incident Program Manager remote, you will play a crucial role in maintaining the integrity and reliability of our AI systems.
What You'll Do
- Lead incident management processes and ensure timely resolution of issues.
- Collaborate with cross-functional teams to enhance incident response strategies.
- Develop and implement best practices for incident management in data centers.
- Monitor incident trends and provide insights for continuous improvement.
- Train and mentor team members on incident management protocols.
Requirements
- 3-5 years of experience in incident management or related field.
- Strong understanding of data center operations and incident response.
- Excellent communication and leadership skills.
- Experience with incident management tools and methodologies.
- Ability to work effectively in a remote environment.
Nice to Have
- Experience in the AI or tech industry.
- Familiarity with cloud services and infrastructure.
- Project management certification.
What We Offer
- Competitive salary ranging from $125,600 to $228,000.
- Flexible remote work environment.
- Opportunities for professional growth and development.
- Health, dental, and vision insurance.
- Generous paid time off and holidays.
This role offers a unique opportunity to manage incident processes in a leading AI company, with a competitive salary and remote flexibility.
Who Will Succeed Here
Proficient in Incident Management frameworks such as ITIL or COBIT, with experience in using tools like ServiceNow for tracking and resolving incidents in data center operations.
Strong project management skills demonstrated through experience with Agile methodologies, enabling efficient handling of multiple incidents and projects concurrently in a remote environment.
A proactive problem solver with a mindset focused on continuous improvement, capable of analyzing incident trends and implementing strategic changes to enhance operational reliability.
Learning Resources
Career Path
Market Overview
Skills & Requirements
Domain Trends
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months