Software Engineer specializing in AI-powered platforms, low-latency microservices, and large-scale data pipelines. Proven track record deploying LLM-driven features, vector search, and real-time processing systems using Python, Spring Boot, Kafka, and GCP.
Large-scale trend detection system processing ~20M records weekly to extract trending stories from unstructured data. Pipeline spans embedding generation, clustering, LLM summarisation, and Guardrails validation — streamed via Kafka into Spring Boot for real-time insight extraction.
High-performance image classification API optimised for concurrency, built for cartoon image detection. Leveraged asynchronous FastAPI and Triton batch inference to cut infrastructure costs by ~75% and reduce latency by 75% under load.
Vector-based deduplication engine using Qdrant — delivering high-speed similarity search and scalable content filtering at 400ms average latency across a corpus of 20M+ embedded points. Reduced duplicate content by ~20% and powers natural language-driven universal search, replacing legacy Elasticsearch.
Embedded firmware for health-focused smartwatches on nRF52832 using FreeRTOS. Implemented power-efficient scheduling, real-time accelerometer signal processing, and custom BLE GATT services for live health data synchronisation.
Pandit Deendayal Energy University — graduated with a strong academic record, building foundations in systems programming, algorithms, and embedded electronics that directly inform production engineering work today.