Question

A “slow” system controls these quantities of a “fast” system in a model proposed by Jürgen (“yur-gen”) Schmidhuber in 1991. These quantities are uniformly sampled from the range negative to positive inverse square root of input number in a technique unusually named for its developer's first name, Xavier initialization. These quantities are updated during runtime in the (*) "attention" mechanism that is central to transformer models. These quantities are the coefficients in a sum that is fed into a function like softmax or ReLU (“rel-you”). The biases or, more commonly, these quantities are updated by performing gradient descent on the loss (10[1])function through backpropagation. For 10 points, name these quantities in (10[1])a neural network that represent the connection strength between neurons. ■END■

ANSWER: neural network connection weights [or weights of a neural network; accept fast weight programmer; accept weight vector; accept weight matrix; prompt on coefficients; prompt on w or W]
<Chen, Other Science>
= Average correct buzz position

Back to tossups

Buzzes

PlayerTeamOpponentBuzz PositionValue
Ali HamzehWilliams et al.Cleo: 5/7 movie7210
Mike BentleyWorld's Fair Wiggle WalkRiley et al.8210
Urbas EkkaTunks et al.Houston Junior College11910

Summary

2024 ESPN @ Brown04/06/2024Y367%0%0%64.50
2024 ESPN @ Cambridge04/06/2024Y2100%0%0%90.50
2024 ESPN @ Online06/01/2024Y3100%0%0%91.00