Sanofi logo
Incident Management Reliability Engineer
full-timeHyderabad

Summary

Location

Hyderabad

Type

full-time

Explore Jobs

About this role

About the job 

Our Team: 

Service Quality cultivates a culture of service excellence where quality is more than a benchmark – it's a shared purpose. Through synergistic collaboration, advanced monitoring, and empathetic customer advocacy, we strive to elevate every interaction and transform challenges into opportunities for growth. 

 

Main responsibilities: 

The Incident Management Reliability Engineer is responsible for ensuring the stability, resilience, and reliability of critical IT services. This role combines strong incident management expertise with reliability engineering principles to minimize disruptions, drive rapid recovery from major incidents, and continuously improve system performance and availability. 

 

  • Incident Management 

  • Lead the end-to-end management of Major Incidents (P1/P2), ensuring timely resolution and effective stakeholder communication. 

  • Act as command centre lead during critical outages, coordinating across technical and business teams. 

  • Ensure accurate and detailed incident documentation, including root cause, timeline and resolution steps. 

  • Drive post-incident-reviews and ensure action items are implemented to prevent recurrence. 

  • Maintain consistent communication and escalation processes aligned with ITSM best practices (e.g. ITIL) 

  • Reliability Engineering 

  • Collaborate with service owners and platform teams to enhance service reliability, observability, and fault tolerance. 

  • Implement proactive monitoring, alerting, and automated recovery mechanisms. 

  • Analyse incident trends and develop reliability improvement plans. 

  • Participate in capacity planning, change reviews, and failure mode analysis to anticipate and mitigate risks. 

  • Develop and track SLOs/SLIs/SLAs to measure service health and performance. 

  • Continuous Improvemen

  • Partner with problem management to identify recurring issues and lead root cause elimination initiatives. 

  • Automate operational tasks and enhance service recovery using scripts, runbooks, and AIOps tools. 

  • Contribute to the evolution of the Major Incident Process, ensuring best practices are embedded across the organization. 

  • Key Performance Indicators 

  • Mean Time to Resolve (MTTR) and Mean Time to Detect (MTTD). 

  • Reduction in number and impact of recurring incidents. 

  • Adherence to SLA/SLO targets. 

  • Completion rate of post-incident actions. 

  • Stakeholder satisfaction and transparency during incidents. 

 

About you 

  • Experience

  • 10+ years' experience. 

  • Preferred Certifications: 

  • ITIL v4 or Service Operations certification. 

  • SRE Foundation / Practitioner certification. 

  • Cloud certifications (AWS, Azure, or GCP). 

  • Incident Command System (ICS) or equivalent leadership training in crisis response. 

  • Soft skills

  • Communication (verbal and written). 

  • Technical skills

  • Virtualization 

  • Cloud Technologies 

  • Database 

  • Networking 

  • Containerization 

  • Automation 

  • Middleware/Scheduling 

  • Infrastructure as code 

  • Languages:  

  • English 

 

Pursue progress, discover extraordinary 

 

Better is out there. Better medications, better outcomes, better science. But progress doesn’t happen without people – people from different backgrounds, in different locations, doing different roles, all united by one thing: a desire to make miracles happen. So, let’s be those people.  

At Sanofi, we provide equal opportunities to all regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, or gender identity.  

Watch our ALL IN video and check out our Diversity Equity and Inclusion actions at sanofi.com!  

Pursue progress, discover extraordinary

Better is out there. Better medications, better outcomes, better science. But progress doesn’t happen without people – people from different backgrounds, in different locations, doing different roles, all united by one thing: a desire to make miracles happen. So, let’s be those people.

At Sanofi, we provide equal opportunities to all regardless of race, colour, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, ability or gender identity.

Watch our ALL IN video and check out our Diversity Equity and Inclusion actions at sanofi.com!

Other facts

Tech stack
Incident Management,Reliability Engineering,Monitoring,Automation,Cloud Technologies,Database,Networking,Containerization,Infrastructure As Code,Communication,Problem Management,SLOs,SLIs,SLAs,AIOps,Capacity Planning

About Sanofi

Headquartered in Paris, France, it is the fifth largest pharmaceutical company in the world [3] and is dedicated to the research, development, production and marketing of pharmaceutical products in seven main areas: cardiovascular disease, thrombosis, oncology, diabetes, central nervous system, internal medicine and vaccines.

Team size: 10,001+ employees
LinkedIn: Visit
Industry: Pharmaceutical Manufacturing

What you'll do

  • The Incident Management Reliability Engineer is responsible for ensuring the stability, resilience, and reliability of critical IT services. This includes leading the management of major incidents and collaborating with teams to enhance service reliability and performance.

Ready to join Sanofi?

Take the next step in your career journey

Frequently Asked Questions

What does a Incident Management Reliability Engineer do at Sanofi?

As a Incident Management Reliability Engineer at Sanofi, you will: the Incident Management Reliability Engineer is responsible for ensuring the stability, resilience, and reliability of critical IT services. This includes leading the management of major incidents and collaborating with teams to enhance service reliability and performance..

Why join Sanofi as a Incident Management Reliability Engineer?

Sanofi is a leading Pharmaceutical Manufacturing company.

Is the Incident Management Reliability Engineer position at Sanofi remote?

The Incident Management Reliability Engineer position at Sanofi is based in Hyderabad, India. Contact the company through Clera for specific work arrangement details.

How do I apply for the Incident Management Reliability Engineer position at Sanofi?

You can apply for the Incident Management Reliability Engineer position at Sanofi directly through Clera. Click the "Apply Now" button above to start your application. Clera's AI-powered platform will help match your profile with this opportunity and guide you through the application process. You can also learn more about Sanofi on their website.