I'm an AI & ML Engineer specializing in production LLM backends, RAG pipelines, multi-agent systems, and reinforcement learning. B.Tech Mathematics & Computing student at PEC Chandigarh.
Mathematics & Computing @ PEC Chandigarh
I build production-grade ML systems — from async LLM backends and multi-agent pipelines to real-time RAG routers and CV pipelines. Passionate about the intersection of mathematical theory and robust software engineering.
Shortlisted from 60,000 applicants worldwide
Successfully cleared the competitive Week 1 cryptography and protocol development challenges. Contributing to open-source Bitcoin core developer tools and scaling infrastructure (LND/gRPC and Go).
Shortlisted from 60,000 applicants worldwide for the prestigious open-source program. Cleared Week 1 cryptographic and core protocol development challenges successfully. Engaging with Bitcoin protocol development, transaction building, and lightning network architectures.
Built and deployed production LLM backends using FastAPI, Celery, and Docker. Designed async task queuing with Celery + Redis for concurrent ML inference workloads. Specialised in lightweight, resource-efficient ML deployments via containerised model serving.
Built a production RAG pipeline with LangChain, ChromaDB, and Gemini featuring hybrid retrieval, multi-turn memory, and Fernet-encrypted user-isolated storage. Implemented JWT auth, rate limiting, Celery task queuing, STT/TTS voice support, async streaming, and a modular FastAPI backend. Designed and ran an LLM evaluation framework comparing Gemini models across empathy, cultural relevance, and safety metrics.
Built real-time CV + NLP pipelines for hazard classification and misinformation filtering. Implemented DBSCAN geospatial clustering with PostGIS for live disaster hotspot detection. Designed FastAPI + PostgreSQL + Cloudinary + Celery backend for async media processing and alert generation.
Built an end-to-end LLM pipeline converting raw financial datasets into structured visual reports. Resolved latency by optimising vector store queries and API call chains.
I got tired of copying and pasting preprocessing and model training loops for every new dataset, so I built SwiftPredict. It's a single-command AutoML library that handles imputation, encoding, and training end-to-end. I rewrote v2 to make the CLI deterministic and published it to PyPI.
pip install swiftpredict-v2My submission for the Google GenAI Hackathon 2025. I built a production-grade RAG pipeline from scratch. It features user-isolated database encryption using Fernet, multi-turn conversation memory, and streaming audio support. I also ran LLM evaluation scripts to benchmark Gemini's empathy metrics.
An adaptive RAG gateway proxy and query router service. Written in Golang (router proxy and semantic cache gateway) and Python (gRPC intelligence service classifying query complexity). Minimizes LLM usage costs and query latency.
I designed a multi-agent system where five specialized LangChain agents (Writer, Syntax Fixer, Optimizer, Doc Generator, and Reviewer) peer-review and optimize code. By orchestrating their communication using FastAPI, I observed a 40% improvement in complexity metrics.
To deeply understand GAN dynamics, I built a DCGAN from scratch using PyTorch. I ran into mode collapse early on, which I resolved by implementing one-sided label smoothing and mini-batch discrimination. I tracked FID scores over 200 epochs to measure visual quality.
I was the Team Lead and Backend/AI Engineer for this Smart India Hackathon 2025 project. We built a disaster response platform. I combined computer vision (OpenCV) and NLP pipelines for hazard classification, and mapped live disaster hotspots using DBSCAN spatial clustering with PostGIS.
SwiftPredict v2 is a pip-installable AutoML library that automates feature preprocessing, model search, training, and evaluation. Hover or click any node in the architectural flowchart below to inspect its script files and data flow.
A pipeline that coordinates automatic data preprocessing, scaling, encoding, and model cross-validation. Client interfaces communicate with the core library while FastAPI manages asynchronous metric logging to MongoDB.
Hover or click any node on the left to inspect file-level details.
Keiro is a self-hostable adaptive RAG gateway proxy. It coordinates rapid local cache matching via a Golang router and routes cache-misses to Python classification and semantic retrieval gRPC services.
A dual-language service orchestration. A high-concurrency Go proxy routing client requests through an LRU semantic cache, communicating via gRPC calls to a Python backend handling model-based routing and response synthesis.
Hover or click any node on the left to inspect file-level details.
Samudra Sachet is a real-time disaster monitoring and hazard alert system. Live media reports, Tweets, and citizen reports flow through a FastAPI gateway, store media on Cloudinary, run CNN and NLP validation pipelines, and perform DBSCAN hotspot clustering written to Firestore.
A real-time disaster reporting and alert dispatcher. Anonymous and authenticated citizens stream crisis reports to FastAPI, validating text urgency via GloVe regressors, scoring image panic metrics with PyTorch CNNs, and clustering incidents using DBSCAN to Firestore.
Hover or click any node on the left to inspect file-level details.
I'm actively looking for ML Engineering internships, AI backend roles, and research positions in RL and GenAI. If you're building something interesting, I'd love to chat.