Software Engineer specializing in AI-powered platforms, low-latency microservices, and large-scale data pipelines. Proven track record deploying LLM-driven features, vector search, and real-time processing systems using Python, Spring Boot, Kafka, and GCP.
Large-scale trend detection system processing ~20M records weekly. Pipeline spans embedding generation, clustering, LLM summarisation, and Guardrails validation — streamed via Kafka into Spring Boot for real-time insight extraction.
High-performance image classification API optimised for concurrency. Leveraged asynchronous FastAPI and Triton batch inference to cut infrastructure costs by ~75% while substantially improving response latency under load.
Vector-based deduplication engine using Qdrant — delivering high-speed similarity search and scalable content filtering at 400ms average latency across a corpus of 20M+ embedded points.
Embedded firmware for health wearables on nRF52832 using FreeRTOS. Implemented power-efficient scheduling, real-time accelerometer signal processing, and custom BLE GATT services for live health data synchronisation.
Pandit Deendayal Energy University — graduated with a strong academic record, building foundations in systems programming, algorithms, and embedded electronics that directly inform production engineering work today.