Gaurav Acharya
Building data systems
that scale.
MS in Computer Science candidate focused on data engineering, AI systems, and backend infrastructure.
I design real-time pipelines, machine learning workflows, and production-oriented backend systems using Python, Kafka, Spark, Docker, SQL, and modern data tooling.

Work
Featured Projects
Data Engineering
Real-Time Ad Analytics Platform
A distributed analytics platform designed for ingesting, processing, and reporting user and advertisement event data with near real-time visibility.
View Case Study →
Backend Systems
Online Voting System
A secure web application for election workflows, voter participation, and controlled ballot handling across a structured backend system.
View Case Study →
Computer Vision
Dog Breed Prediction System
A computer vision application for dog breed classification using deep learning, image preprocessing, and prediction delivery through an application interface.
View Case Study →
Expertise
Data Engineering & System Design
Focused on building scalable data pipelines, real-time systems, and production-grade backend infrastructure.
Data Engineering
Backend Systems
Databases & Storage
Cloud & DevOps
AI / ML Systems
Background
Experience
Data Analyst
Jun 2022 — Jul 2024Merkle Inc.
- •Built automated data pipelines and reporting workflows using Python and SQL, reducing manual reporting effort by 70%
- •Analyzed 10,000+ survey responses using Pandas and advanced Excel to generate actionable insights, improving decision-making by 15%
- •Developed interactive dashboards in Tableau and Power BI for real-time tracking of campaign performance
- •Integrated APIs (Qualtrics, Decipher, Confirmit) to streamline data collection and processing workflows
- •Implemented version-controlled workflows using Git and Docker, reducing production errors by 40%
Master's in Computer Science
2024 — 2026Illinois Institute of Technology
- •Focused on Data Engineering, Distributed Systems, and Machine Learning
- •Built real-time data pipelines using Kafka and PySpark for scalable event processing
- •Developed machine learning systems for forecasting and classification using TensorFlow and Scikit-learn
- •Worked on backend systems and APIs using Flask and modern cloud tooling
Credentials
Certifications
Certifications reinforcing my foundation in data analytics, engineering systems, and applied problem solving.

Google Data Analytics Professional Certificate
Google · 2024

Meta Data Analyst Professional Certificate
Meta · 2024

IBM Data Analyst Professional Certificate
IBM · 2024

IBM Data Engineer Professional Certificate
IBM · 2025
GitHub
Selected Projects
real-time-ad-analytics
Kafka + PySpark pipeline for real-time event ingestion, processing, and scalable analytics.
Online-Voting-System
Backend-driven voting system with authentication, vote handling, and result processing.
Lie-Detection-System
ML classification pipeline using feature engineering and behavioral data for prediction.
Dog-Breed-Prediction-System
CNN-based image classification system with Flask API for real-time predictions.
Contact
Let’s Connect
I’m actively interested in opportunities across data engineering, backend systems, AI infrastructure, and real-time analytics.