We just announced our $3M Pre-Seed. Watch our — launch video.
Machine Learning Engineer | M.S in Machine Learning | Ex SE @CGI
With a passion for machine learning modeling, math and software engineering.
Spent most of time in graduate school dealing with research papers and understanding ML.
Personal website: https://pachipulusu.vercel.app/
Programming Languages: Python, R, C, C++, Scala
Cloud: AWS (S3, EC2, Lambda), Snowflake
Tools and OS: PowerBI, Tableau, Jupyter Notebook, VSCode, WEKA, RStudio, Databricks, Git, Github Actions, Mac OS, Windows, Linux
Chat with Clera and we'll introduce you to the right opportunities.
This profile is based on publicly available information. Sai Chaitanya is not affiliated with or endorsed by Clera. Privacy policy.
• Developed scalable data pipelines and reporting solutions with APIs, Tableau/PowerBI interactive dashboards, and Python (Pandas, NumPy) automation, cutting reporting timelines by 86% and enabling real-time analytics in high-volume environments • Performed statistical analyses with K-means clustering, seasonality modeling with time-series techniques (ARIMA), and A/B testing in Python (Scikit-learn, Statsmodels), increasing creator participation by 13% and optimizing engagement strategies • Created Tableau/PowerBI dashboards with ML insights (XGBoost) reducing reporting time 35%; maintained KPI dashboards via SQL/Python automation for metric tracking and decisions in Snowflake • Optimized ETL pipelines using SQL, Snowflake warehousing, Airflow orchestration, and ML anomaly detection (Autoencoders), improving data quality 30% for cloud initiatives in AWS Redshift • Partnered with 3 engineers to define requirements and build SQL/DBT pipelines with PySpark, boosting transformation speed 30%; developed SQL/PySpark/DBT workflows with ML preprocessing (Pandas/NumPy) for AWS Redshift analytics • Led A/B testing frameworks with SQL and Python (Statsmodels, SciPy) to evaluate feature impacts, resulting in 15% uplift in user retention; mentored junior analysts on experimental design in Agile environments
Community Dreams Foundation · Full-time
â—¦ Architected an HR tool using BERT, RoBERTa, and Sentence-BERT embeddings to match resumes with job descriptions, cutting manual screening by 89% from 10 hours to 1 hour per job opening and speeding up the hiring by 40% through context-aware ranking â—¦ Built a cloud-native pipeline with Python, FastAPI, and Kubeflow on Kubernetes for automated interview scheduling, achieving 92% candidate selection precision measured in pilot with 50 companies â—¦ Automated rejection emails with sentiment-aware templates (VADER score >0.6), handling 200+ weekly communications and reducing admin work by 90% from 15 hours/week to 1.5 hours/week while ensuring empathetic, bias-free communication â—¦ Automated the process of sending personalized rejection emails, reducing time spent by 90% â—¦ Designed a RAG chatbot with Mistral on AWS SageMaker, achieving 90% response relevance as measured by BLEU and ROUGE-L scores against human answers on a benchmark set of 500 historical support cases, reducing ticket escalations by 25%
◦ Developed a real-time ingest layer using Kafka Connect to capture 8 sensor data streams (400 events/second) from factory equipment, reducing data availability lag from overnight batch to <5 minutes for maintenance teams ◦ Wrote and maintained 15+ Python ETL scripts to process daily Shell refinery data files (CSV, JSON) into a centralized SQL data warehouse, enabling dashboard KPIs previously unavailable to operating teams ◦ Led migration of 52 legacy servers to AWS EC2 (t3.xlarge instances) using Terraform, reducing monthly costs by $18k and ensuring 99.9% uptime over 6-month period ◦ Redesigned Databricks medallion architecture by implementing automatic schema validation, data quality checks, and incremental processing patterns, reducing job failures from 12 weekly incidents to 3 (75% decrease) and cutting average recovery time from 4 hours to 45 minutes for Shell’s refinery sensor data lake ◦ Achievement: Awarded Best Employee for Q4 2021 for exceptional contributions to project efficiency and innovation, leading 30% of the overall data migration effort
ImbueDesk ENS Pvt Ltd · Full-time
â—¦ Developed facial expression recognition system using OpenCV, TensorFlow achieving 97% accuracy on FER2013 dataset â—¦ Designed an image processing pipeline with Tesseract OCR for vehicle ID recognition (50k plates/day), orchestrated with Kubernetes â—¦ Created and deployed predictive maintenance dashboards with Python visualization tools on AWS Beanstalk that reduced equipment downtime by 28% â—¦ Built a Kafka-based image processing pipeline processing 35MB/hour, reducing processing latency from 1.2s to 0.4s per image with a 3-node consumer topology. Implemented back-pressure handling for peak traffic periods (7AM-9AM) when processing volume increased by 300%
Grade: 3.9
Claim it to keep it up to date, or request removal. We're happy to help either way.