Sagar Dutta

Post Doctoral Fellow, MadhavLab, Indian Institute of Technology Kanpur

prof_pic.jpg

Department of Electrical Engineering

Indian Institute of Technology Kanpur

Kanpur, Uttar Pradesh, India

I received my Ph.D. degree in Electronics and Communication Engineering from the National Institute of Technology Silchar, India in 2022. My Ph.D thesis is titled “Pattern Recognition Based on Microwave Signal Using Artificial Intelligence” where I worked on pattern recognition based on antenna’s reactive’s field for applications like Human Activity Recognition (HAR) and Machine Health Monitoring.

As a postdoctorate, I joined The Machine Analysis of Data for Human Audition and Visualization (MADHAV) Lab in the Department of Electrical Engineering at the Indian Institute of Technology Kanpur. The group’s research interests lie at the intersection of the theory and application of machine learning with a focus on Machine Learning for Audio Signal Processing. As a trained western classical guitarist, my research focus naturally shifted towards the application of machine learning in audio signal processing and music information retrieval.

During my postdoctorate at MADHAV LAB, I developed and researched on Audio-based Recommendation System for India’s National Broadcaster, Prasar Bharati, and I played a leading role in handling the deployment of the AI model for Prasar Bharati’s multimedia for efficient search and recommendation.

Currently I am pursuing my second postdoctorate at the RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion, University of Oslo, Norway

Research Interest:

My research interest lies at the intersection of machine learning, audio signal processing, and music information retrieval, with a particular focus on multimodal representation learning. I dedicate my efforts to formulating learning-based methodologies predominantly for the processing of audio content, encompassing both musical elements and a variety of other sounds, but not limited to these modalities.

Currently, I am engaged in research on the synchronization of multimodal representations, specifically focusing on integrating motion capture data with music. My work aims to explore and develop applications for this synchronization.

news

Jun 1, 2023 Paper titled “AudioNet: Supervised Deep Hashing for Retrieval and Ranking of Audio Events” submitted in IEEE Transactions on Audio, Speech and Language Processing

selected publications

  1. Classification of lower limb activities based on discrete wavelet transform using on-body creeping wave propagation
    Sagar Dutta, Banani Basu, and Fazal Ahmed Talukdar
    IEEE Transactions on Instrumentation and Measurement, 2020
  2. Classification of induction motor fault and imbalance based on vibration signal using single antenna’s reactive near field
    Sagar Dutta, Banani Basu, and Fazal Ahmed Talukdar
    IEEE Transactions on Instrumentation and Measurement, 2021
  3. Classification of motor faults based on transmission coefficient and reflection coefficient of omni-directional antenna using DCNN
    Sagar Dutta, Banani Basu, and Fazal Ahmed Talukdar
    Expert Systems with Applications, 2022
  4. Cascaded neural network based small array synthesis with robustness to noise
    Sagar Dutta, Banani Basu, and Fazal Ahmed Talukdar
    International Journal of RF and Microwave Computer-Aided Engineering, 2021
  5. Classification of scattering parameters of body-embedded wideband textile antennas for early diagnosis and monitoring of breast cancer
    Nirmalya Das, Banani Basu, Sagar Dutta, and 1 more author
    International Journal of Microwave and Wireless Technologies, 2023