MSD logo
Manager, Scientific Data Engineering
full-timeIndia

Summary

Location

India

Type

full-time

Explore Jobs

About this role

Job Description

Manager, Scientific Data Engineering

The Opportunity

  • Based in Hyderabad, join a global healthcare biopharma company and be part of a 130-year legacy of success backed by ethical integrity, forward momentum, and an inspiring mission to achieve new milestones in global healthcare.
  • Be part of an organisation driven by digital technology and data-backed approaches that support a diversified portfolio of prescription medicines, vaccines, and animal health products.
  • Drive innovation and execution excellence. Join a team that is passionate about using data, analytics, and insights to drive decision-making and create custom software, allowing us to tackle some of the world's greatest health threats.

Our Technology Centers focus on creating a space where teams can come together to deliver business solutions that save and improve lives. An integral part of our company's IT operating model, Tech Centers are globally distributed locations where each IT division has employees to enable our digital transformation journey and drive business outcomes. These locations, in addition to the other sites, are essential to supporting our business and strategy.

A focused group of leaders in each Tech Center helps ensure we can manage and improve each location, from investing in the growth, success, and well-being of our people to making sure colleagues from each IT division feel a sense of belonging, to managing critical emergencies. Together, we must leverage the strength of our team to collaborate globally to optimize connections and share best practices across the Tech Centers.

Role Overview

  • Design, develop, and maintain data pipelines to extract data from various sources and populate a data lake and data warehouse. 
  • Work closely with data scientists, analysts, and business teams to understand data requirements and deliver solutions aligned with business goals.
  • Build and maintain platforms that support data ingestion, transformation, and orchestration across various data sources, both internal and external.
  • Use data orchestration, logging, and monitoring tools to build resilient pipelines. 
  • Automate data flows and pipeline monitoring to ensure scalability, performance, and resilience of the platform.
  • Monitor, troubleshoot, and resolve issues related to the data integration platform, ensuring uptime and reliability.
  • Maintain thorough documentation for integration processes, configurations, and code to ensure easy onboarding for new team members and future scalability.
  • Develop pipelines to ingest data into cloud data warehouses.
  • Establish, modify and maintain data structures and associated components.
  • Create and deliver standard reports in accordance with stakeholder needs and conforming to agreed standards.
  • Work within a matrix organizational structure, reporting to both the functional manager and the project manager.
  • Participate in project planning, execution, and delivery, ensuring alignment with both functional and project goals.

What should you have

  • Bachelor’s degree in information technology, Computer Science or any Technology stream.
  • 3+ years of developing data pipelines & data infrastructure, ideally within a drug development or life sciences context.
  • Demonstrated expertise in delivering large-scale information management technology solutions encompassing data integration and self-service analytics enablement.
  • Experienced in software/data engineering practices (including versioning, release management, deployment of datasets, agile & related software tools).
  • Ability to design, build and unit test applications on Spark framework on Python.
  • Build PySpark based applications for both batch and streaming requirements, which will require in-depth knowledge on Databricks/ Hadoop.
  • Experience working with storage frameworks like Delta Lake/ Iceberg
  • Experience working with MPP Datawarehouse’s like Redshift
  • Cloud-native, ideally AWS certified.
  • Strong working knowledge of at least one Reporting/Insight generation technology
  • Good interpersonal and communication skills (verbal and written).
  • Proven record of delivering high-quality results.
  • Product and customer-centric approach.
  • Innovative thinking, experimental mindset. 

Mandatory Skills

Skill Category

Skills

Foundational Data Concepts

SQL (Intermediate / Advanced)

Python (Intermediate)

Cloud Fundamentals (AWS Focus)

AWS Console, IAM roles, regions, concept of cloud computing

AWS S3

Data Processing & Transformation

Apache Spark (Concepts & Usage)

Databricks (Platform Usage), Unity Catalog, Delta Lake

ETL & Orchestration

AWS Glue (ETL, Catalog), Lambda

Apache Airflow (DAGs and Orchestration)
or other orchestration tool

dbt (Data Build Tool)

Matillion (or similar ETL tool)

Data Storage & Querying

Amazon Redshift / Azure Synapse

Trino / Equivalent

AWS Athena / Query Federation

Data Quality & Governance

Data Quality Concepts / Implementation

Data Observability Concepts

Collibra / equivalent tool

Real-time / Streaming

Apache Kafka (Concepts & Usage)

DevOps & Automation

CI / CD concepts, Pipelines
(GitHub Actions / Jenkins / Azure DevOps)

Our technology teams operate as business partners, proposing ideas and innovative solutions that enable new organizational capabilities. We collaborate internationally to deliver services and solutions that help everyone be more productive and enable innovation.

Who we are:

We are known organization Rahway, New Jersey, USA in the United States and Canada and MSD everywhere else. For more than a century, we have been bringing forward medicines and vaccines for many of the world's most challenging diseases. Today, our company continues to be at the forefront of research to deliver innovative health solutions and advance the prevention and treatment of diseases that threaten people and animals around the world.

What we look for:

Imagine getting up in the morning for a job as important as helping to save and improve lives around the world. Here, you have that opportunity. You can put your empathy, creativity, digital mastery, or scientific genius to work in collaboration with a diverse group of colleagues who pursue and bring hope to countless people who are battling some of the most challenging diseases of our time. Our team is constantly evolving, so if you are among the intellectually curious, join us—and start making your impact today.

#HYDIT2026

Required Skills:

Business Intelligence (BI), Database Administration, Data Engineering, Data Management, Data Modeling, Data Visualization, Design Applications, Information Management, Software Development, Software Development Life Cycle (SDLC), System Designs

Preferred Skills:

Current Employees apply HERE

Current Contingent Workers apply HERE

Search Firm Representatives Please Read Carefully 
Merck & Co., Inc., Rahway, NJ, USA, also known as Merck Sharp & Dohme LLC, Rahway, NJ, USA, does not accept unsolicited assistance from search firms for employment opportunities. All CVs / resumes submitted by search firms to any employee at our company without a valid written search agreement in place for this position will be deemed the sole property of our company.  No fee will be paid in the event a candidate is hired by our company as a result of an agency referral where no pre-existing agreement is in place. Where agency agreements are in place, introductions are position specific. Please, no phone calls or emails. 

Employee Status:

Regular

Relocation:

VISA Sponsorship:

Travel Requirements:

Flexible Work Arrangements:

Hybrid

Shift:

Valid Driving License:

Hazardous Material(s):

Job Posting End Date:

01/30/2026

*A job posting is effective until 11:59:59PM on the day BEFORE the listed job posting end date. Please ensure you apply to a job posting no later than the day BEFORE the job posting end date.

Other facts

Tech stack
Data Engineering,Data Integration,Python,Apache Spark,AWS,ETL,Data Warehousing,Data Quality,DevOps,Data Processing,Data Transformation,Data Storage,Data Visualization,Agile,Collibra,Apache Airflow

About MSD

At MSD, known as Merck & Co., Inc., Rahway, NJ, USA in the United States and Canada, we are unified around our purpose: We use the power of leading-edge science to save and improve lives around the world. For more than 130 years, we have brought hope to humanity through the development of important medicines and vaccines. We aspire to be the premier research-intensive biopharmaceutical company in the world – and today, we are at the forefront of research to deliver innovative health solutions that advance the prevention and treatment of diseases in people and animals. We foster a diverse and inclusive global workforce and operate responsibly every day to enable a safe, sustainable and healthy future for all people and communities. For more information, visit www.msd.com and connect with us on Facebook, Instagram, Twitter, and YouTube.

Team size: 10,001+ employees
LinkedIn: Visit
Industry: Pharmaceutical Manufacturing

What you'll do

  • Design, develop, and maintain data pipelines to extract data from various sources and populate a data lake and data warehouse. Work closely with data scientists, analysts, and business teams to understand data requirements and deliver solutions aligned with business goals.

Ready to join MSD?

Take the next step in your career journey

Frequently Asked Questions

What does a Manager, Scientific Data Engineering do at MSD?

As a Manager, Scientific Data Engineering at MSD, you will: design, develop, and maintain data pipelines to extract data from various sources and populate a data lake and data warehouse. Work closely with data scientists, analysts, and business teams to understand data requirements and deliver solutions aligned with business goals..

Why join MSD as a Manager, Scientific Data Engineering?

MSD is a leading Pharmaceutical Manufacturing company.

Is the Manager, Scientific Data Engineering position at MSD remote?

The Manager, Scientific Data Engineering position at MSD is based in India, India. Contact the company through Clera for specific work arrangement details.

How do I apply for the Manager, Scientific Data Engineering position at MSD?

You can apply for the Manager, Scientific Data Engineering position at MSD directly through Clera. Click the "Apply Now" button above to start your application. Clera's AI-powered platform will help match your profile with this opportunity and guide you through the application process. You can also learn more about MSD on their website.