
Akshansh .
Senior Data Engineer | Scalable Data Infrastructure | Streaming Pipelines | ML-Driven Systems | AWS, Kafka, Data Lake,Python,SQL
Updated 7 months ago
11+
Years Experience
7
Roles
56
Skills
2
Education
About
Lead Data Engineer | Scalable ETL Pipelines | Real-Time Streaming | ML-Driven Systems | LLM Ops AWS, Kafka, Python, SQL Results-driven Lead Data and AI Platform with expertise in scalable data architectures, real-time ML pipelines, and processing structured and unstructured data (geospatial, image, video). Proven ability to collaborate cross-functionally, leveraging expertise in Kafka, Spark, and cloud technologies to enhance data-driven decision-making. Eager to engage in cutting-edge projects that foster innovation and excellence in technology. Key Expertise & Achievements Scalable ETL & Data Transformation Built high-throughput ETL pipelines using AWS Glue, PySpark, and Airflow, reducing batch processing times by 40%. Designed metadata-driven ingestion frameworks to handle structured, semi-structured, and unstructured data at scale. Automated data validation, deduplication, and schema evolution, ensuring data quality and accuracy. Real-Time Streaming & Analytics Developed low-latency streaming pipelines using Apache Flink, Kafka Streams, and Kinesis, reducing event processing lag by 50%. Implemented real-time aggregations and transformations using KSQL and Flink SQL, enabling sub-second insights. Optimized Kafka partitioning, compression, and retention strategies, reducing storage costs by 25%. Data Lake Architecture & Governance Built multi-layered data lakes using Delta Lake, Iceberg, and Hudi, ensuring efficient storage and ACID compliance. Implemented data lineage tracking with AWS Glue Catalog and Apache Atlas for better discoverability and governance. Developed fine-grained access control policies using AWS Lake Formation and IAM, ensuring data security and compliance. Machine Learning Data Pipelines Designed feature engineering pipelines for predictive analytics, improving forecasting model accuracy by 20%. Integrated ML workflows with Spark ML and SageMaker, optimizing model training and inference at scale. Developed recommendation systems using graph-based ML, increasing user engagement. Performance Optimization & Cost Reduction Reduced ETL processing time by 30% with distributed computing and optimized SQL transformations. Implemented data partitioning, clustering, and indexing, cutting query execution times by 50%. Reduced compute and storage costs by 20% through efficient resource allocation and query tuning. I am passionate about building future-ready data ecosystems, enabling real-time insights, machine learning, and high-scale analytics to drive business impact.
See Related Jobs
Based on skills & location
Get AI Resume
Generate a polished resume
Salary Benchmark
What does a Senior Data Engineer earn?
Experience (7 roles)
Senior Data Engineer
CurrentLed the design and implementation of the Unified PubSub Client (PSC) to optimize data pipeline efficiency. The PubSub systems (e.g., Kafka) improved scalability, reliability, and developer velocity, reducing dependencies between client applications and PubSub services. Built a robust and secure paym...
3 roles · Jun 2020 - May 2022
Data Platform Engineer
Built and scaled real-time data aggregation pipelines using Apache Flink and Apache Kafka, efficiently supporting a 10x growth in data volume while ensuring low-latency and high-throughput data processing. Designed and implemented machine learning forecasting models using Python, enabling accurate p...
Data Engineer
Designed and implemented a real-time aggregation framework for datasets, including user levels, organizational hierarchies, roles, channels, and memberships, improving data processing efficiency and scalability. Enhanced intermediate data processing by leveraging Kafka tools like KSQL and KStreams, ...
Data Engineering Intern
Developed a forecasting framework to predict key organizational metrics, such as message volume and active users, for future time periods (months or years). Built models to forecast executive metrics and enable strategic planning for organizational performance. Designed a classification system to ca...
Graduate Research Assistant
USC Viterbi School of Engineering
Specialized in spatial-visual indexing of multimedia data, developing efficient geo-spatial data structures for faster and scalable search functionality. Designed and implemented a geo-spatial indexing system using R-trees* to optimize multimedia data storage and retrieval. Extracted visual and spat...
Is this your profile, Akshansh?
Claim it to keep it updated or request removal.
Education (2)
University of Southern California
2019 - 2020
Skills: Databases · Apache Spark Streaming
Maharaja Agrasen Institute Of Technology, Delhi
2012 - 2016
Skills (56)
Backend
Data Engineering
DevOps
MLOps
Other
Certifications (8)▼
Generative AI Application Development
Databricks
R Programming
Coursera Course Certificates
Programming for Everybody (Python)
Coursera
Introduction to R
DataCamp
Kaggle R Tutorial on Machine Learning
DataCamp
Coursera Mentor Community and Training Course
Coursera Course Certificates
ML Operations
Databricks
Data Analysis and Statistical Inference
DataCamp
Publications (6)▼
Spatial Aggregation of Visual Features for Big Image Data Search
IEEE BigMM 2019 · Jan 1, 2019
Boomerang: Rebounding the Consequences of Reputation Feedback on Crowdsourcing Platforms
ACM UIST 2016
Classification and Fraud Detection in Finance Industry
IJCA(International Journal of Computer Applications) · Oct 18, 2017
Crowd Guilds: Worker-led Reputation and Feedback on Crowdsourcing Platforms
CSCW · Feb 25, 2017
Investigating the "Wisdom of Crowds" at Scale
ACM UIST · Nov 1, 2015
The Daemo Crowdsourcing Marketplace
CSCW 2017
Languages (2)▼
Volunteer Experience (3)▼
Student Mentor
CurrentUSC Viterbi School of Engineering
Jun 2019 - Present · 6 yrs 3 mos
Mentor
Coursera
Jun 2016 - Jul 2018 · 2 yrs 2 mos
Volunteer
Child Rights and You
Jun 2015 - Jun 2018 · 3 yrs 1 mo
Frequently Asked Questions
What is Akshansh .'s current role?▼
Where did Akshansh . study?▼
What skills does Akshansh . have?▼
Where is Akshansh . based?▼
Related Jobs
View all jobs →Senior Account Executive | Mid-Market (m/w/d)
Superchat
Full Stack Software Engineer
Talk Dog Inc
Staff Software Engineer, Product
Lunar
Founding Engineer (Full-Stack)
Uplane
Founding AI Engineer
Uplane
(Senior) Associate, Legal AI (m/f/d)
bayshore
Other Profiles
Browse all →
Germán Pineda
Principal Software Engineer at Full-time · 3 yrs 1 mo
United States
Dessie DiMino
Software Engineer at Turing · Contract
United States
Thomas Fu
Software Engineer | TLM at Google Research · Full-time
United States

Nidhi P.
Product Manager | Bridging Data, Strategy & User-Centered Design
United States
Mikka Pineda
Freelance Software Engineer at Self-employed
United States
Looking for your next role?
Chat with Clera to discover job matches, salary insights, and get a polished AI-generated resume.
Chat with CleraThis profile is based on publicly available information. Akshansh is not affiliated with or endorsed by Clera. Privacy Policy