SIR Beam Selector for Amazon Echo Devices Audio Front-End
In: 2019 IEEE International Workshop on Signal Processing Systems (SiPS), 2019-10-01
Online
unknown
Zugriff:
The Audio Front-End (AFE) is a key component in mitigating acoustic environmental challenges for far-field automatic speech recognition (ASR) on Amazon Echo family of products. A critical component of the AFE is the Beam Selector, which identifies which beam points to the target user. In this paper, we proposed a new SIR beam selector that utilizes subband-based signal-to-interference ratios to learn the locations of the audio sources and therefore further improve the beam selection accuracy for multi-microphone based AFE system. We analyzed the performance of a Signal to Interference Ratio (SIR) beam selector with a comparison to classic beam selector using the datasets collected under various conditions. This method is evaluated and shown to simultaneously decrease word-error-rate (WER) for speech recognition by up to 46.20% and improve barge-in performance via FRR by up to 39.18%.
Titel: |
SIR Beam Selector for Amazon Echo Devices Audio Front-End
|
---|---|
Autor/in / Beteiligte Person: | Kristjansson, Trausti ; Philip Ryan Hilmes ; Zhang, Xianxian |
Link: | |
Zeitschrift: | 2019 IEEE International Workshop on Signal Processing Systems (SiPS), 2019-10-01 |
Veröffentlichung: | IEEE, 2019 |
Medientyp: | unknown |
DOI: | 10.1109/sips47522.2019.9020406 |
Schlagwort: |
|
Sonstiges: |
|