SRE & Platform Engineering Leader | 14+ Years in Cloud-Native Systems | Building Reliable, Scalable, and Observable Infrastructure |AWS GCP OCI Kubernetes Certified
With 14+ years of experience in site reliability engineering, platform development, and DevOps leadership, I am a results-driven and impact-focused Senior Manager currently leading global SRE and Platform teams at JPMC/WePay. I specialize in building and scaling highly available, fault-tolerant, cloud-native infrastructure across Google Cloud, AWS, and Oracle Cloud, with deep expertise in Kubernetes, Python, Java, and observability tooling.
Throughout my career, I’ve led globally distributed teams across the US, UK, and India, driving cross-functional initiatives that enhanced service reliability, improved developer velocity, and reduced operational toil. I’ve successfully introduced automation at scale, streamlined incident response, and implemented service mesh, Terraform modules, and zero-downtime deployment strategies that significantly improved uptime, cost-efficiency, and deployment confidence. I take pride in fostering a culture of continuous improvement, mentorship, and collaboration, while aligning technical outcomes with business goals.
Certified as an AWS Solutions Architect and Kubernetes Administrator (CKA), I’ve contributed to large-scale infrastructure transformations in high-stakes environments, including FinTech, Open Banking, and Public Cloud. I’ve partnered with leadership to drive platform modernization, compliance automation, and product observability—developing tools such as a CVE scanner, business action monitoring, and compliance readiness automation that have directly boosted team productivity and customer satisfaction.
Whether it’s leading 24/7 operational support, automating critical pipelines, or driving post-incident analysis and metrics improvement, I bring a blend of technical depth, leadership, and a customer-first mindset to every initiative. I’m passionate about building reliable systems, empowering engineers, and delivering meaningful impact at scale.
Chat with Clera and we'll introduce you to the right opportunities.
This profile is based on publicly available information. Jagdeep is not affiliated with or endorsed by Clera. Privacy policy.
Envestnet | Yodlee · Full-time
While working as an SRE my role is to ensure 100% system reliability. To keep the system up and working effectively , I perform below activities regularly : Mentoring/guiding/leading a small team in 3 timezone and helping them to tune faster to the expectations ( MG). . Reported to Sr. Directory and helped with better Observability, Reliability, Scalability and Team Scaling . Prioritizing and planning the tasks/project for the team to maximize the impact for work done. . Improving service observability, four golden SRE signals . Built up team capability by utilizing skills sets of network engineers, programmers, and systems administrators to improve troubleshooting, resulting in high site availability. · Works with design engineering to propose architectural changes, and foster communication between different organizational units. - Measure/Maintain/Enforce SLI/SLO/SLAs · System Performance analysis to fine tune application performance and hence the API response time. · Sets and/or adjusts configuration changes to production systems, as well as standardization of trouble shooting procedures. · Standardizes and maintains internal technical documentation (e.g. Wiki, run-books) .Implement and monitor key metrics for analysis · Performs Root Cause Analyses and Corrective Actions to improve site availability and integrity · Works directly with the Operations Control Center to define best practices in regards to escalations and incident documentation. · Writing automation, monitoring script, setting up alerts in various monitoring tools to ensure the better health of the system by taking the preventive steps at early stage to avoid any major impact to customer. · Subject Matter Expert , assisted in better reliability and troubleshooting - 24*7 On call rotational shift support.
Grade: 74 Activities and societies: *Participated in different Group Discussion and debates *Member of technical societies in College *Participated in Talent Hunt for haryana *Participated in Cleaning the surrounding activities *Organized technical fest in College *Coursework includes following subject: - Data Structure - Database Management System - Java - Software Engineering - Operating System Concepts - Algorithm Analysis & Design. - Discrete Mathematics *Academic records are good in college. *Appreciated by many Lecturers for good scores and various technical performances *Participated in many technical events *Participated in social activities like 'Help Yourself', 'Light The Dark' and various other groups *Organized technical and cultural festival in college
Claim it to keep it up to date, or request removal. We're happy to help either way.