Non-linear frequency warping using constant-Q transformation for speech emotion recognition
In: ICCCI 2021-International Conference on Computer Communication and Informatics ICCCI 2021-International Conference on Computer Communication and Informatics, Jan 2021, Coimbatore, India. ⟨10.1109/ICCCI50826.2021.9402569⟩ 2021 International Conference on Computer Communication and Informatics (ICCCI-2021) 2021 International Conference on Computer Communication and Informatics (ICCCI-2021), Jan 2021, Coimbatore, India. ⟨10.1109/ICCCI50826.2021.9402569⟩; (2021-01-27)
Online
unknown
Zugriff:
In this work, we explore the constant-Q transform (CQT) for speech emotion recognition (SER). The CQT-based time-frequency analysis provides variable spectro-temporal resolution with higher frequency resolution at lower frequencies. Since lower-frequency regions of speech signal contain more emotion-related information than higher-frequency regions, the increased low-frequency resolution of CQT makes it more promising for SER than standard short-time Fourier transform (STFT). We present a comparative analysis of short-term acoustic features based on STFT and CQT for SER with deep neural network (DNN) as a back-end classifier. We optimize different parameters for both features. The CQT-based features outperform the STFT-based spectral features for SER experiments. Further experiments with cross-corpora evaluation demonstrate that the CQT-based systems provide better generalization with out-of-domain training data.
Comment: Accepted for publication in 2021 IEEE International Conference on Computer Communication and Informatics (IEEE ICCCI 2021)
Titel: |
Non-linear frequency warping using constant-Q transformation for speech emotion recognition
|
---|---|
Autor/in / Beteiligte Person: | Saha, Goutam ; Sahidullah ; Singh, Premjeet ; Indian Institute of Technology Kharagpur (IIT Kharagpur) ; Speech Modeling for Facilitating Oral-Based Communication (MULTISPEECH) ; Inria Nancy - Grand Est ; Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Natural Language Processing & Knowledge Discovery (LORIA - NLPKD) ; Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA) ; Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA) ; Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS) ; Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA) ; Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL) |
Link: | |
Quelle: | ICCCI 2021-International Conference on Computer Communication and Informatics ICCCI 2021-International Conference on Computer Communication and Informatics, Jan 2021, Coimbatore, India. ⟨10.1109/ICCCI50826.2021.9402569⟩ 2021 International Conference on Computer Communication and Informatics (ICCCI-2021) 2021 International Conference on Computer Communication and Informatics (ICCCI-2021), Jan 2021, Coimbatore, India. ⟨10.1109/ICCCI50826.2021.9402569⟩; (2021-01-27) |
Veröffentlichung: | HAL CCSD, 2021 |
Medientyp: | unknown |
Schlagwort: |
|
Sonstiges: |
|