✦ Software & AI Engineering ✦

Engineering scalable systems for
global data ecosystems.

Software Engineer focused on distributed systems, high-concurrency architectures, and AI-driven data pipelines. Experienced in building resilient, production-grade infrastructure at scale.

Based in India
S
Professional Track

Ontic Technologies

Software Engineer

Oct 2024 — Apr 2026

  • Designed and built high-throughput social intelligence platforms using Spring Boot & Python to orchestrate Kafka-based streaming pipelines.
  • Implemented high throughput architecture using GRPC with TGI for inhouse LLM's and Triton server for embeddings and inference.
  • Reduced inference latency through asynchronous FastAPI services and optimized batch-processing strategies.
  • Developed a scalable abstraction layer for vector databases enabling efficient semantic search and deduplication.
  • Benchmarked and setup the Vector store (Qdrant) that stores multiple embeddings with about volume of 20 Million points.
  • Added monitoring and tracing using Open telemetry with Uptrace framework.

Infocusp Innovations

Software Engineer

Jul 2022 — Sep 2024

  • Architected event-driven data pipelines using Apache Beam and Pub/Sub for petabyte-scale operational metrics.
  • Designed cloud data models in Spanner, integrating BigQuery and Looker Studio for analytical visualizers.
  • Developed FreeRTOS firmware for smart wearables, including real-time signal processing and custom BLE GATT protocols.
  • Automated enterprise CI/CD workflows to enforce architectural consistency, linting, and automated testing benchmarks.
Engineering Builds
01 / HYBRID ARCHITECTURE

Trends

Designed a large-scale trend detection system processing ~20M records weekly. Pipeline includes embedding generation, clustering, summarization, and validation layers, enabling real-time insight extraction.

Spring BootKafkagRPCPython
02 / Cartoon Detection

Cartoon Detection

Developed a high-performance image classification API optimized for concurrency. Leveraged asynchronous FastAPI and Triton batch inference, reducing infrastructure costs by ~75% while improving response latency.

FastAPIAsyncIOPyTorchBatching
03 / SEARCH & VECTOR

Semantic Deduplication

Engineered a vector-based deduplication engine using Qdrant, enabling high-speed similarity search and scalable content filtering with low-latency performance.

QdrantVector SearchSimilarity Search
04 / FIRMWARE

Health Firmware

Built embedded firmware for health devices on nRF52832 using FreeRTOS. Implemented power-efficient scheduling and custom BLE services for real-time health data synchronization.

CFreeRTOSnRF52832BLE
Expertise

Backend

  • Python
  • Java / Spring Boot
  • FastAPI / gRPC
  • REST Architecture
  • Uptrace
  • MongoDB

Infrastructure

  • Apache Kafka / PubSub
  • GCP (Spanner, BigQuery)
  • Apache Beam / Dataflow
  • Docker / CI-CD

Specialized

  • Vector Databases (Qdrant)
  • Columnar Databases
  • LLM & Embedding servers
  • FreeRTOS / C++
  • BLE Protocol Stack