Cognizant logo
Site Reliability Engineer (L3 support)
full-time$65k - $117k

Summary

Salary

$65k - $117k

Type

full-time

Explore Jobs

About this role

We are seeking a highly motivated and technically proficient Site Reliability Engineer (L3 support) to join our team. The ideal candidate will be responsible for the stability, performance, and availability of our critical, GCP deployed applications. This role requires a strong blend of software development, systems administration, and operational expertise across GCP, Java/Spring Boot microservices, and container orchestration environments.

In this role, you will:

  • Provide expert-level (L3) support for complex, high-priority incidents, ensuring timely resolution and root cause analysis (RCA).
  • Participate in a 24/7 on-call rotation using PagerDuty to respond to and mitigate critical alerts and system issues.
  • Utilize JIRA Service Desk for tracking, prioritizing, and managing incident, problem, and service requests.
  • Troubleshoot and debug Java-based microservices built with Spring Boot and exposed via RestAPI. Analyze logs, trace transactions, and identify code-level issues.
  • Manage, monitor, and support applications deployed on Google Cloud Platform (GCP), specifically within GKE (Google Kubernetes Engine) and related container/serverless environments (e.g., Cloud Functions/Knative, often shortened as KF).
  • Maintain and support ETL/workflow jobs orchestrated by Apache Airflow.
  • Familiarity with Managed File Transfer (MFT) solutions like IBM Sterling MFT and concepts of secure file transfer (e.g., EDE/EDI) is required for supporting relevant data pipelines.
  • Implement and manage end-to-end monitoring using Observability/ELK (Elasticsearch, Logstash, Kibana) or similar platforms to ensure proactive alerting and operational visibility.
  • Use UpTrends or similar synthetic monitoring tools to validate end-user application performance and availability.
  • Strictly adhere to the formal Change Management process for all production deployments and modifications.
  • Plan, document, and participate in Disaster Recovery (DR) testing and execution to ensure business continuity.
  • Utilize Postman for API validation, testing, and troubleshooting integration issues.

Work model

We believe hybrid work is the way forward as we strive to provide flexibility wherever possible. Based on this role’s business requirements, this is a hybrid position requiring 2–3 days a week in a client or Cognizant office in Irving - TX. Regardless of your working arrangement, we are here to support a healthy work-life balance through our various wellbeing programs.

The working arrangements for this role are accurate as of the date of posting. This may change based on the project you’re engaged in, as well as business and client requirements. Rest assured; we will always be clear about role expectations.

Please note:      A few of our roles may require in-person interviews at Cognizant offices or client locations, depending on project or client needs.

What you need to have to be considered 

  • Minimum 8+ years of experience in a Production Support, SRE, or L3 Application Support role, supporting Cloud-Native environments.
  • Deep hands-on experience with Java and Spring Boot for developing or supporting production-grade microservices.
  • Proven experience supporting applications deployed on Google Cloud Platform (GCP), especially GKE (Kubernetes).
  • Strong knowledge of Linux operating systems and shell scripting.
  • Familiarity or experience with IBM Sterling MFT or other secure Managed File Transfer solutions.
  • Familiarity with incident management tools like JIRA and on-call rotation platforms like PagerDuty.
  • Experience with Change Management best practices and Disaster Recovery procedures.

Salary and Other Compensation:

The annual salary for this position is between $ 65,447 to $ 117,000 depending on experience and other qualifications of the successful candidate. This position is also eligible for Cognizant’s discretionary annual incentive program, based on performance and subject to the terms of Cognizant’s applicable plans.

Benefits: Cognizant offers the following benefits for this position, subject to applicable eligibility requirements :

  • Medical/Dental/Vision/Life Insurance
  • Paid holidays plus Paid Time Off
  • 401(k) plan and contributions
  • Long-term/Short-term Disability
  • Paid Parental Leave
  • Employee Stock Purchase Plan

Disclaimer: The salary, other compensation, and benefits information is accurate as of the date of this posting. Cognizant reserves the right to modify this information at any time, subject to applicable law.

Other facts

Tech stack
Site Reliability Engineering,L3 Support,GCP,Java,Spring Boot,Microservices,Container Orchestration,Incident Management,JIRA,Apache Airflow,Managed File Transfer,Disaster Recovery,Observability,ELK,API Validation,Shell Scripting

About Cognizant

Cognizant (Nasdaq-100: CTSH) engineers modern businesses. We help our clients modernize technology, reimagine processes and transform experiences so they can stay ahead in our fast-changing world. Together, we’re improving everyday life. See how at www.cognizant.com or @cognizant.

Team size: 10,001+ employees
LinkedIn: Visit
Industry: IT Services and IT Consulting

What you'll do

  • The Site Reliability Engineer will provide expert-level support for complex incidents and ensure the stability and performance of GCP deployed applications. Responsibilities include troubleshooting Java-based microservices, managing applications on GCP, and participating in on-call rotations.

Ready to join Cognizant?

Take the next step in your career journey

Frequently Asked Questions

What does Cognizant pay for a Site Reliability Engineer (L3 support)?

Cognizant offers a competitive compensation package for the Site Reliability Engineer (L3 support) role. The salary range is USD 65k - 117k per year. Apply through Clera to learn more about the full compensation details.

What does a Site Reliability Engineer (L3 support) do at Cognizant?

As a Site Reliability Engineer (L3 support) at Cognizant, you will: the Site Reliability Engineer will provide expert-level support for complex incidents and ensure the stability and performance of GCP deployed applications. Responsibilities include troubleshooting Java-based microservices, managing applications on GCP, and participating in on-call rotations..

Why join Cognizant as a Site Reliability Engineer (L3 support)?

Cognizant is a leading IT Services and IT Consulting company. The Site Reliability Engineer (L3 support) role offers competitive compensation.

How do I apply for the Site Reliability Engineer (L3 support) position at Cognizant?

You can apply for the Site Reliability Engineer (L3 support) position at Cognizant directly through Clera. Click the "Apply Now" button above to start your application. Clera's AI-powered platform will help match your profile with this opportunity and guide you through the application process. You can also learn more about Cognizant on their website.