Question

A “slow” system controls these quantities of a “fast” system in a model proposed by Jürgen (“yur-gen”) Schmidhuber in 1991. These quantities are uniformly sampled from the range negative to positive inverse square root of input number in a technique unusually named for its developer's first name, Xavier initialization. These quantities are updated during runtime in the (*) "attention" mechanism that is central to transformer models. These quantities are the coefficients in a sum that is fed into a function like softmax or ReLU (“rel-you”). The biases or, more commonly, these quantities (10[1])are updated by (10[1])performing gradient descent on the loss function through backpropagation. For 10 points, name these quantities in a neural network that represent the connection strength between neurons. ■END■

ANSWER: neural network connection weights [or weights of a neural network; accept fast weight programmer; accept weight vector; accept weight matrix; prompt on coefficients; prompt on w or W]
<Chen, Other Science>
= Average correct buzz position

Back to tossups

Buzzes

PlayerTeamOpponentBuzz PositionValue
Kevin WangTriple Round Robin LoversJason Lovers6310
Jaimie CarlsonLabour's Lost LoversJeffrey and Dahmers6610
Theo KatzmanClark AClark B1190

Summary

2024 ESPN @ Brown04/06/2024Y367%0%0%64.50
2024 ESPN @ Cambridge04/06/2024Y2100%0%0%90.50
2024 ESPN @ Online06/01/2024Y3100%0%0%91.00