An AI enthusiast driven to create intelligent solutions that address real-world challenges.An AI enthusiast driven to create intelligent solutions that address real-world challenges. I create cutting-edge AI applications and streamline complex workflows to deliver impactful, user-centered experiences.
Driven by curiosity and a love for problem-solving, I'm passionate about creating intelligent solutions that make a real difference in people's lives.
Exploring the frontiers of AI and technology to create meaningful impact
Deep learning, neural networks, and AI model optimization
Image processing, object detection, and visual AI systems
Explainable AI, cross-modal attribution, and counterfactual reasoning
Full-stack applications with modern frameworks and technologies
Big data processing, analytics pipelines, and cloud infrastructure
Creating solutions that bridge technology and real-world impact
A comprehensive overview of my technical skills and areas of expertise across the full stack
Whether you're building web applications, mobile apps, or AI-powered solutions, I'm equipped with the tools and knowledge to bring your vision to life. Technology evolves rapidly, and I'm committed to staying current with the latest trends and best practices. Currently diving deep into Explainable AI (XAI) with cross-modal attribution and counterfactual reasoning.
Advanced XAI Techniques
Cross-modal Attribution
Counterfactual Reasoning
My professional journey and key contributions across different roles
A showcase of my recent work and personal projects
Foresight is an open-source accessibility accelerator for visually impaired users, integrating ShareGPT4V for visual analysis, Detectron2 for object grounding, and Gemma 7B for improved accuracy. With speech-to-text and text-to-speech interaction, it reduces hallucinations and identifies 12% more useful information than baseline models, delivering reliable, context-aware assistance
AuthEZ is a passwordless authentication IDP as an SDK that eliminates vulnerabilities in traditional logins. It integrates SSO, Facenet-based facial verification with liveness checks, Resemblyzer voice embeddings, and RSA cryptographic signatures via QR codes for legacy devices, delivering secure and seamless access.
Mental health surveillance and assessment platform for teens and students. Features personalized music recommendations with Solfeggio frequencies and AI-powered mental health assistance.
Introducing StudentBuzz, a cutting-edge collaborative platform that empowers students to showcase their skills, projects, and accomplishments while fostering a vibrant community of learning and collaboration. With StudentBuzz, students can take their educational journey to the next level by connecting with like-minded peers, engaging in club activities, and collaborating with students from other colleges.
Security of parked cars against theft is a long existing concern. We present an automated way of detecting vehicle theft as it happens using moving object detection and barcode scanning for each parking entry. The detected edges of the output should give a clear image of the moving object from the video. The security personnel or the parking lot operator gets notified about the movement.
Innovative recruitment platform eliminating bias through faceless hiring processes, developed during Volvo Group internship. The platform included multi-level access controls across the admin and candidate portals, enhancing security and usability.
A visual showcase of my professional journey, achievements and awards.
Finalist in Unisys Innovation Program Y14 out of 700 global applicants for the project AuthEZ, a passwordless authentication IDP as an SDK that eliminates vulnerabilities in traditional logins
Professional certifications and published research showcasing continuous learning and thought leadership
Acquired knowledge of applying Enterprise Design Thinking and its value
Concepts and applications of Convolutional Neural Networks
Advanced proficiency in applications of ML algorithms and best practices
Supervised learning techniques for regression and classification problems
Professional-level expertise in Google Cloud Platform to perform Data and ML tasks
Workflow embodied in a mobile app that helps visually impaired individuals by combining vision, speech, and generative AI. It reduces AI hallucinations, improves accuracy, and even identifies 12% more information than baseline models.
Co-authors:
Owais Iqbal, Prajwal B Mehendarkar, Ridhiman Singh
AuthEZ is an Identity Provider SDK that combines facial verification, voice verification, and in-app authentication techniques to provide a secure and user-friendly authentication experience. In-app verification, suitable for legacy devices, employs public key cryptography and RSA digital signatures through QR codes. Developers can seamlessly integrate these services into web applications, enhancing user experience and data protection.
Co-authors:
Owais Iqbal, Prajwal B Mehendarkar, Ridhiman Singh
Let's work together to bring ideas to life
I'm always interested in new opportunities and exciting projects. Whether you have a question or just want to say hi, I look forward to hearing from you!