Audio-Visual Automatic Speech Recognition-Benefits and Challenges
Human speech production and perception are bi-modal which involves evaluating the speaker’s visual cues and speech signal. Audio-visual automatic speech recognition (AV-ASR) has been an active research area in today’s fastest growing world. This method uses both auditory and visual data to enhance speech recognition. Why Audio-Visual Automatic Speech Recognition? Speech is one of the…