M.Tech in Computer Science & Engineering | IIT Kanpur (2022-2024)
B.Tech in Computer Science & Business Systems | Sister Nivedita University (2018-2022)
Building real-time voice AI systems with TTS, ASR, and streaming pipelines
Developed LLM applications, RAG systems, multi-language translators, and speech processing pipelines
π DieTraAudio Processing Pipeline Denoising + Diarization + Transcription using DeepFilterNet, NVIDIA NeMo, and Mistral Voxtral. π― 10-12GB VRAM vs traditional 80GB+ |
7 Attention Mechanisms From vanilla to Flash & Multi-Latent Attention with complexity analysis. π Deep dive into modern LLM architectures |
|
AI-powered MoM Generator Whisper + GPT-3.5 + PyAnnote diarization with React frontend. β‘ Auto-generates professional meeting minutes |
π€ Intelligent ChatBotRAG-based Conversational AI Context-aware chatbot with document retrieval and LangChain. π¬ Multi-turn conversations with memory |
π¨ More Projects
- π¬ CrewAI YouTube Blogger - Multi-agent content creation
- π YOLOv8 Traffic Measurement - 97.5% mAP vehicle detection
- π Text Summarization MLOps - Pegasus with MLOps
- π₯ YouTube to Notes - Gemini-Pro summarization
- π· Wine Quality ML - Complete pipeline with MLflow & AWS
- π€ Face Recognition Attendance - Real-time tracking
- π£οΈ Voice Assistant - AI chatbot with audio capabilities
Last updated: January 06, 2026 at 01:27 UTC
πΌ Open to Voice AI, ML/DL roles and research collaborations
π§ [email protected]
"Building real-time AI systems, one model at a time" ποΈβ¨


