Speech Recognition Code Python

Metadata-Enhanced Speech Emotion Recognition: Augmented Residual Integration and Co-Attention in Two-Stage Fine-Tuning

Abstract: Speech Emotion Recognition (SER) involves analyzing vocal expressions to determine the emotional state of speakers, where the comprehensive and thorough utilization of audio information is ...

IEEE

From Characters to Subwords: Modeling Unit Conversion for Low-resource Speech Recognition

Abstract: Multilingual automatic speech recognition (ASR) models greatly facilitate recognizing low-resource languages by sharing representations across similar languages. However, the commonly ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Metadata-Enhanced Speech Emotion Recognition: Augmented Residual Integration and Co-Attention in Two-Stage Fine-Tuning

From Characters to Subwords: Modeling Unit Conversion for Low-resource Speech Recognition

Trending now