UPS logo
GCP Infrastructure Engineer - Google Cloud, Terraform, Python, Bash, GKE, CI/CD
full-timeIndia

Summary

Location

India

Type

full-time

Explore Jobs

About this role

Before you apply to a job, select your language preference from the options available at the top right of this page.

Explore your next opportunity at a Fortune Global 500 organization. Envision innovative possibilities, experience our rewarding culture, and work with talented teams that help you become better every day. We know what it takes to lead UPS into tomorrow—people with a unique combination of skill + passion. If you have the qualities and drive to lead yourself or teams, there are roles ready to cultivate your skills and take you to the next level.

Job Description:

Job Summary:

We are seeking a highly skilled GCP Infrastructure Engineer to design, build, and manage the cloud infrastructure that powers Generative AI (GenAI) applications at scale. In this role, you will leverage Google Cloud Platform (GCP) Vertex AI, IBM Watsonx, and containerization technologies such as Docker and Kubernetes (GKE) to deliver secure, scalable, and high-performance AI solutions. You will own the end-to-end infrastructure lifecycle — from design and provisioning to automation, monitoring, and optimization — while enabling data scientists and ML engineers to seamlessly deploy and operate GenAI workloads.

Key Responsibilities:

Cloud Infrastructure & Platform Engineering

  • Design, provision, and maintain scalable, secure, and cost-efficient infrastructure for GenAI applications on GCP.

  • Deploy and manage containerized workloads using Docker and Kubernetes (GKE).

  • Configure and optimize Vertex AI and IBM Watsonx platforms for training, fine-tuning, and serving LLMs and other generative models.

  • Implement high-performance GPU/TPU clusters to support distributed training and large-scale inference.

  • Ensure business continuity through backup, disaster recovery, and multi-region deployments.

Automation & Reliability

  • Develop and maintain Infrastructure as Code (IaC) templates with Terraform, or Cloud Deployment Manager.

  • Adopt GitOps practices (Flux) for infrastructure lifecycle management.

  • Build and optimize CI/CD pipelines for data pipelines, model workflows, and GenAI applications.

  • Apply SRE principles (SLIs, SLOs, SLAs) to guarantee platform reliability and uptime.

Security, Governance & Compliance

  • Embed DevSecOps best practices across the infrastructure lifecycle, including policy-as-code, vulnerability scanning, and secrets management.

  • Enforce identity and access management (IAM), network segmentation, and data encryption in compliance with standards (HIPAA, SOX, GDPR, FedRAMP).

  • Collaborate with enterprise security and compliance teams to implement governance frameworks for GenAI platforms.

Monitoring, Observability & Cost Optimization

  • Implement observability stacks (Prometheus, Grafana, Cloud Monitoring, Datadog) for both infra health and ML-specific metrics (model drift, data anomalies).

  • Define KPIs to monitor system health, performance, and adoption across AI workloads.

  • Optimize cloud cost efficiency for GPU/TPU-intensive workloads using autoscaling, preemptible instances, and utilization monitoring.

Collaboration & Enablement

  • Partner with data scientists, ML engineers, and software teams to streamline GenAI application development and deployment.

  • Provide onboarding, documentation, and reusable templates to enable faster adoption of AI infrastructure.

  • Stay current with the latest advancements in GenAI, cloud-native infrastructure, and container orchestration.

Required Qualifications

Education

Bachelor's or master’s degree in computer science, Software Engineering, or a related field.

Experience

  • 5+ years of experience in cloud infrastructure engineering, DevOps, or platform engineering.

  • Experience with GenAI use cases (chatbots, content generation, code assistants, etc.).

  • Strong hands-on expertise with Google Cloud Platform (GCP), especially Vertex AI.

  • Experience with IBM Watsonx for AI application deployment and management.

  • Proven skills in Docker, Kubernetes (GKE), and container orchestration at scale.

  • Proficiency in Python, Bash, or other relevant scripting languages.

  • Strong understanding of cloud networking, IAM, and security best practices.

  • Experience with CI/CD tools (GitHub Actions, GitLab CI, Jenkins) and IaC tools (Terraform, Pulumi, Ansible, Deployment Manager).

  • Familiarity with data pipelines and integration tools (Dataflow, Apache Beam, Pub/Sub, Kafka).

  • Excellent problem-solving, debugging, and communication skills.

Preferred Experience

  • Experience in MLOps practices for model deployment, monitoring, and retraining.

  • Exposure to multi-cloud or hybrid cloud environments (GCP, AWS, Azure, on-prem).

  • Hands-on experience with feature stores (Vertex AI Feature Store, Feast) and ML observability tools (EvidentlyAI, Fiddler).

  • Knowledge of distributed training frameworks (Horovod, DeepSpeed, PyTorch Distributed).

  • Contributions to open-source projects in infrastructure, MLOps, or GenAI.

  • Experience managing infrastructure in regulated industries.

Preferred Certifications:

  • Google Cloud Certified - Professional Cloud Architect

  • Google Cloud Certified - Machine Learning Engineer

  • Certified Kubernetes Administrator (CKA) or Certified Kubernetes Application Developer (CKAD)

  • IBM Certified Watsonx Generative AI Engineer – Associate

  • IBM Certified Solution Architect - Cloud Pak for Data

  • Other relevant certifications in AI, Machine Learning, or Cloud-Native technologies.


Employee Type:
 

Permanent


UPS is committed to providing a workplace free of discrimination, harassment, and retaliation.

Other facts

Tech stack
GCP,Terraform,Python,Bash,GKE,CI/CD,Docker,Kubernetes,Vertex AI,IBM Watsonx,DevSecOps,IaC,MLOps,Cloud Networking,Security Best Practices,Monitoring

About UPS

Operating in more than 200 countries and territories, we’re committed to moving our world forward by delivering what matters. Beginning as a small messenger service, UPS was started by two enterprising teenagers and a $100 loan. Now, we’re almost 500,000 UPSers strong, with operations around the globe.

As a transportation and logistics leader, we are proud to offer innovative solutions to our customers—both big and small. We also support the communities we serve. Just take a look at The UPS Foundation’s social impact report!

Headquartered in Atlanta, we can be found on the web at ups.com and about.ups.com. Job seekers can visit upsjobs.com to learn more. Our active social media channels include Facebook, Instagram, Twitter, YouTube, and TikTok.

Facebook: www.facebook.com/ups
Instagram: www.instagram.com/ups/
Twitter: www.twitter.com/ups
TikTok: UPS
YouTube: www.youtube.com/ups

Website
https://about.ups.com/
The UPS Foundation’s social impact report:
https://about.ups.com/us/en/social-impact/reporting/the-ups-foundations-social-impact-report.html
Career Site
upsjobs.com

Team size: 10,001+ employees
LinkedIn: Visit
Industry: Truck Transportation
Founding Year: 1907

What you'll do

  • The GCP Infrastructure Engineer will design, build, and manage cloud infrastructure for Generative AI applications, ensuring scalability, security, and performance. Responsibilities include deploying containerized workloads, implementing automation, and collaborating with data scientists and ML engineers.

Ready to join UPS?

Take the next step in your career journey

Frequently Asked Questions

What does a GCP Infrastructure Engineer - Google Cloud, Terraform, Python, Bash, GKE, CI/CD do at UPS?

As a GCP Infrastructure Engineer - Google Cloud, Terraform, Python, Bash, GKE, CI/CD at UPS, you will: the GCP Infrastructure Engineer will design, build, and manage cloud infrastructure for Generative AI applications, ensuring scalability, security, and performance. Responsibilities include deploying containerized workloads, implementing automation, and collaborating with data scientists and ML engineers..

Why join UPS as a GCP Infrastructure Engineer - Google Cloud, Terraform, Python, Bash, GKE, CI/CD?

UPS is a leading Truck Transportation company.

Is the GCP Infrastructure Engineer - Google Cloud, Terraform, Python, Bash, GKE, CI/CD position at UPS remote?

The GCP Infrastructure Engineer - Google Cloud, Terraform, Python, Bash, GKE, CI/CD position at UPS is based in India, India. Contact the company through Clera for specific work arrangement details.

How do I apply for the GCP Infrastructure Engineer - Google Cloud, Terraform, Python, Bash, GKE, CI/CD position at UPS?

You can apply for the GCP Infrastructure Engineer - Google Cloud, Terraform, Python, Bash, GKE, CI/CD position at UPS directly through Clera. Click the "Apply Now" button above to start your application. Clera's AI-powered platform will help match your profile with this opportunity and guide you through the application process. You can also learn more about UPS on their website.