Kohl's logo
Reliability Engineer (Remote)
full-timeTallahassee

Summary

Location

Tallahassee

Type

full-time

Explore Jobs

About this role

Role Specific Information

Job Description

About the Role

As Reliability Engineer, you will ensure the resilience and availability of Kohl’s systems and applications and collaborate closely with development teams to review designs, conduct risk assessments and implement robust monitoring and failover mechanisms. 

What You’ll Do

  • Drive incident response efforts, perform root cause analysis and implement preventative measures to enhance system reliability

  • Establish consistent practices that elevate Kohl’s operational excellence through automation and process improvements

  • Follow software lifecycle and drive reliability, observability and efficiency across product teams within an assigned domain

  • Identify repeated toil and find opportunities for automation and risk reduction

  • On-call on a rotation to respond to production incidents and conduct blameless retros and root-cause analyses (RCAs) to drive a culture of continuous improvements

  • Proactively identify failures before they cause outages using chaos engineering techniques such as edge cases, failure modes and design review

  • Advise on capacity planning and provide continuous assessments on systems behavior and consumption

  • Work with product managers to identify and prioritize work for reliability best practices (i.e., leveraging SLIs/SLOs/Error Budgets)

  • Additional tasks may be assigned

What Skills You Have

Required

  • Bachelor's Degree or equivalent in MIS, Computer Science or related field

  • 2+ years of experience in software development

  • Strong programming skills in one or more languages (Java, Python, Go or Node.js)

  • Working knowledge of systems architecture, operating system internals and network fundamentals 

  • Experience working with one cloud platform (e.g., GCP, AWS, or Azure)

Preferred

  • Experience with monitoring techniques and tools (e.g., CloudWatch, Grafana, Prometheus, OpenTelemetry, Tracing) 

  • Working knowledge around containerization and container orchestration (e.g., Docker, Kubernetes, Rancher) 

Essential Functions

The requirements listed below are representative of functions you will be required to perform, however you may be required to perform additional functions. Kohl’s may revise this job description at any time. To perform this job successfully, you must be able to perform each essential function satisfactorily. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions, absent undue hardship.

  • Ability to perform the accountabilities listed in the “What You’ll Do” Section

  • Ability to comply with dress code requirements

  • Basic math and reading skills, legible handwriting, and basic computer operation

  • Ability to maintain prompt and regular attendance and meet scheduling requirements as set by the company

  • Ability to learn and comply with all company policies, procedures, standards and guidelines

  • Ability to give direction and to receive, understand and proactively respond to direction from leadership and other company personnel

  • Ability to work as part of a team and interact effectively and appropriately with others

  • Ability to maintain composure and work in a fast paced environment while accomplishing multiple tasks within established timeframes

  • Ability to satisfactorily complete company training programs

  • Ability to use a personal computer for tasks such as communicating, preparing reports, etc.

  • Ability to plan, prioritize and monitor activities across business units

  • Ability to complete or oversee the completion of assigned projects in a timely manner

Other facts

Tech stack
Reliability Engineering,Incident Response,Root Cause Analysis,Automation,Monitoring,Cloud Platforms,Programming,Systems Architecture,Containerization,Kubernetes,Docker,Capacity Planning,Chaos Engineering,Observability,Process Improvements,Software Development

About Kohl's

Kohl’s is a leading omnichannel retailer with more than 1,100 stores in 49 states.

Kohl's business is built on a solid foundation of more than 60 million customers, an unmatched brand portfolio, industry-leading loyalty and Kohl's Card programs, a convenient and accessible nationwide store footprint, and large digital business on Kohls.com and the Kohl's mobile app.

Team size: 10,001+ employees
LinkedIn: Visit
Industry: Retail
Founding Year: 1962

What you'll do

  • As a Reliability Engineer, you will ensure the resilience and availability of Kohl’s systems and applications while collaborating with development teams. You will drive incident response efforts, perform root cause analysis, and implement preventative measures to enhance system reliability.

Ready to join Kohl's?

Take the next step in your career journey

Frequently Asked Questions

What does a Reliability Engineer (Remote) do at Kohl's?

As a Reliability Engineer (Remote) at Kohl's, you will: as a Reliability Engineer, you will ensure the resilience and availability of Kohl’s systems and applications while collaborating with development teams. You will drive incident response efforts, perform root cause analysis, and implement preventative measures to enhance system reliability..

Why join Kohl's as a Reliability Engineer (Remote)?

Kohl's is a leading Retail company.

Is the Reliability Engineer (Remote) position at Kohl's remote?

The Reliability Engineer (Remote) position at Kohl's is based in Tallahassee, Florida, United States. Contact the company through Clera for specific work arrangement details.

How do I apply for the Reliability Engineer (Remote) position at Kohl's?

You can apply for the Reliability Engineer (Remote) position at Kohl's directly through Clera. Click the "Apply Now" button above to start your application. Clera's AI-powered platform will help match your profile with this opportunity and guide you through the application process. You can also learn more about Kohl's on their website.