9+ years engineering high-throughput backend platforms with Java, Kafka, and AWS. Now building LLM-powered AI agents and RAG applications that solve real business problems at production scale.
I'm a Lead Software Engineer and Principal Architect with deep expertise in distributed backend systems. Currently part of the founding team at Exly, a platform empowering creators in the Indian market.
My craft spans system design, cloud infrastructure, event-driven architecture, and increasingly, AI/LLM-powered applications. I've built Kafka-powered notification pipelines, serverless CDNs, and production RAG systems — always with reliability as the north star.
I believe the next wave of great architecture will have AI agents embedded at its core — and I'm building that future right now.
Combining deep backend expertise with LLM engineering to build AI agents and RAG applications that operate at production scale — not just demos.
I architect AI systems with the same rigor I bring to distributed backends: fault-tolerant pipelines, observability, cost optimization, and graceful degradation. Every AI feature I ship is backed by real infrastructure — Kafka for event ingestion, PostgreSQL for state, Redis for caching, and vector databases for semantic search.
Production RAG application ingesting Exly's entire knowledge base — help docs, FAQs, onboarding guides — into a vector store. Queries are embedded, matched via cosine similarity, and grounded answers are generated with citations. Deployed with guardrails against hallucination and fallback to human agents.
Multi-step AI agent framework chaining LLM calls with tool-use — database lookups, API calls, Slack notifications, email drafting. Built with retry logic, token budget management, and structured output parsing. Enables non-technical teams to automate complex support and ops workflows.
Backend powering all AI features — async document ingestion via Kafka, embedding pipeline with batched processing, vector index management, prompt versioning, and cost tracking per LLM call. Redis-backed caching of frequently-hit embeddings for horizontal scaling.
Event-driven notification infrastructure spanning 5 microservices with Kafka streaming.
1200x latency improvementGlobally distributed image delivery with CloudFront, Lambda@Edge, and S3 with edge caching.
50% faster page loadsHigh-throughput Email/WhatsApp campaign orchestration with millions of messages per batch.
75% faster executionStrategic migration to optimized AWS infrastructure with right-sized compute and cost governance.
60% cost savingsOpen to Principal Architect and Staff Engineer roles — especially at the intersection of distributed systems and AI engineering. Interested in hard problems, event-driven platforms, and teams building at scale. Based in India, open to relocation.
Fill out a quick form with your name, email, and message — I'll get back to you as soon as possible.
Contact Me→