Abstract: Environmental noise poses a significant challenge to speech emotion recognition (SER) systems, as it distorts acoustic features and masks critical emotional cues. While traditional speech ...