Anugunj Naman

I am a graduate student at Purdue University with a focus on machine learning. My prior role was as an Software Engineer at Elevance Health, where I collaborated with the Responsible AI and XAI Team on enhancing fairness, explainability, and addressing data drift in machine learning applications.

I earned a Bachelor of Technology degree in Computer Science and Engineering from the Indian Institute of Information Technology, Guwahati in 2022.

Previously, I served as a Research Intern at NVIDIA, focusing on speech recognition, meta-learning, and few-shot learning. Additionally, I was a Research Intern at the AI-ML-NLP Lab at IIT Patna, working on multimodal learning.

I have contributed to the open-source HuggingFace Transformers library, including two vision transformers: Microsoft's CvT: Introducing Convolutions to Vision Transformers and Meta's LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference.

Email / CV / Google Scholar / Twitter / Github

Research

I have a keen interest in applied machine learning, particularly in the area of multimodal learning. I am also dedicated to developing efficient and robust machine learning models. Over the past year, I have focused on developing sampling methods for large language models (LLMs).

Indic Languages Automatic Speech Recognition using Meta-Learning Approach
Anugunj Naman, Kumari Deepshikha,
Proceedings of The Fourth International Conference on Natural Language and Speech Processing (ICNLSP), Association for Computational Linguistics (ACL), 2021

Speech to Text conversion of different Indic Languages using Meta Learning.

A Multimodal Author Profiling System for Tweets
Chanchal Suman, Anugunj Naman, Sriparna Saha, Pushpak Bhattacharyya
IEEE Transactions on Computational Social Systems, 2021

Author profiling for gender prediction of anonymous twitter users.

Fixed-MAML for Few Shot Classification in Multilingual Speech Emotion Recognition
Anugunj Naman, Chetan Sinha, Liliana Mancini
Proceedings of Machine Intelligence and Smart Systems (MISS), Springer, 2021

Speech Emotion Recognition of different languages using Meta Learning.

Projects

PATH: Pavement Assessment Tracking and Health
Anugunj Naman, Ashish Goswami Chetan Sinha, Harshit Dixit, Rutuja Patole, Siddharth Rajawat,
Smart India Hackathon, 2020

A Deep Learning Solution for Automation of Pavement Condition Index.

Glasses: A Computer Vision Library for Image Classification, Object Detection and Segmentation
Francesco Zuppichini, Anugunj Naman
IN PROGRESS

A compact, concise, and customizable deep learning library. This library currently supports deep learning models for computer vision. Glasses is a model toolbox to make it easier for everybody to use, learn and share deep learning models.

Template from Jon Barron's website.