Question

The robustness of this task can be improved by applying transforms like SPLICE and CMVN. An architecture of composed weighted finite-state transducers is used for this task by the open-source package Kaldi, which does not support the frequently-used CTC loss function in alignment during this task. Classical algorithms for this task extract 39 MFCC features for each time frame of a sliding window, then feed them into an (*) GMM-HMM. Context-dependent models are used in this task to account for allophones. (-5[1])Probabilistic methods for this task compute the product of an acoustic model and a language model to select the most likely word sequence given a sound signal. (10[1])For 10 points, speech synthesis (10[1])is the reverse of what natural language processing task used by digital assistants like Siri to process user input? (10[1])■END■

ANSWER: speech recognition [or automatic speech recognition or ASR; accept transcription or speech-to-text or STT; accept speech alignment; prompt on speech processing or natural language processing until read; prompt on vocal recognition or voice recognition; prompt on answers like understanding speech; reject “text-to-speech” or "TTS" or "speech synthesis"; reject "speaker identification" or "speaker verification" or "vocal identification”]
<VD, Other Science>
= Average correct buzz position

Back to tossups

Buzzes

PlayerTeamOpponentBuzz PositionValue
Akshay SeetharamClaremontALCU79-5
Nicholas DaiU[C]SDJason et al.10610
Anderson WangAnderson et al.SGV Ventures11110
Thomas DellaertALCUClaremont13010

Summary

2024 ARCADIA at Claremont2024-11-02Y3100%0%33%115.67
2024 ARCADIA at Illinois2024-11-09Y475%0%50%118.67
2024 ARCADIA at Waterloo2024-11-09Y6100%0%50%113.67
2024 ARCADIA at Duke2024-12-06Y367%0%67%105.00
2024 ARCADIA at Warwick2024-12-06Y450%0%75%110.50
2024 ARCADIA at GT2024-12-06Y560%0%40%125.33
2024 ARCADIA at Florida Tech2024-12-06Y475%0%50%114.67
2024 ARCADIA at UC Berkeley2024-12-06Y2100%0%50%104.50