We are looking for a Senior Cloud & Platform Engineer to design, automate, and manage scalable cloud and hybrid infrastructure environments. This role is ideal for a technically strong individual contributor who takes full ownership of platform reliability, performance, and security. You will work closely with engineering, QA, and IT teams to build robust CI/CD pipelines, automate operational workflows, and ensure seamless deployment of applications across AWS and on-prem environments.
Key Responsibilities
Architect, deploy, and manage scalable AWS cloud infrastructure and hybrid setups combining on-prem and cloud resources.
Implement Infrastructure-as-Code (IaC) using tools such as Terraform, CloudFormation, or equivalent frameworks.
Automate provisioning, configuration, and operational processes using Python, Bash, and Ansible.
Build, maintain, and optimize CI/CD pipelines with tools like GitHub Actions, Jenkins, ArgoCD, or similar.
Manage containerized environments, including Docker-based workloads and Kubernetes orchestration.
Implement robust monitoring, logging, and alerting systems to ensure high platform availability and performance.
Own production systems, lead incident management, perform root cause analysis, and implement preventive measures.
Apply DevSecOps best practices including secure deployments, secrets management, and access controls.
Collaborate with cross-functional teams to improve system stability, reliability, and developer productivity.
Drive continuous improvement initiatives for automation, infrastructure performance, and operational efficiency.
Required Skills & Experience
8+ years of experience in DevOps, Platform Engineering, or Site Reliability Engineering (SRE).
Strong expertise with AWS services, Kubernetes, and Infrastructure-as-Code tools (Terraform, CloudFormation).
Hands-on Linux system administration experience (Ubuntu, CentOS, Rocky Linux).
Proficient in scripting and automation using Python and Bash.
Experience with monitoring, logging, and observability tools such as Prometheus, Grafana, ELK, Datadog, or CloudWatch.
Proven ability to manage hybrid infrastructure environments combining cloud and on-prem resources.
Preferred Skills & Competencies
Designing security-focused architectures (DevSecOps, Zero Trust, compliance-driven environments).
Experience operating large-scale, distributed, and high-availability systems.
Strong problem-solving mindset with a proactive and ownership-driven approach.
Excellent collaboration skills across engineering, QA, and IT teams.
Key Skills
DevOps & Platform Engineering
Cloud Infrastructure (AWS & Hybrid)
Linux Server Administration
CI/CD Pipeline Design & Automation
Kubernetes & Container Orchestration
Infrastructure as Code (Terraform, CloudFormation)
Monitoring & Observability