Question

A “slow” system controls these quantities of a “fast” system in a model proposed by Jürgen (“yur-gen”) Schmidhuber in 1991. These quantities are uniformly sampled from the range negative to positive inverse square root of input number in a technique unusually named for its developer's first name, Xavier initialization. These quantities are updated during runtime in the (*) "attention" mechanism that is central to transformer models. These quantities are the coefficients in a sum that is fed into a function like softmax or ReLU (“rel-you”). The biases or, more commonly, these quantities (10[1])are updated by (10[1])performing gradient descent on the loss (10[1])function through backpropagation. For 10 points, name these quantities (10[1])in (10[1])a neural network that represent the connection strength between neurons. ■END■

ANSWER: neural network connection weights [or weights of a neural network; accept fast weight programmer; accept weight vector; accept weight matrix; prompt on coefficients; prompt on w or W]
<Chen, Other Science>
= Average correct buzz position

Buzzes

PlayerTeamOpponentBuzz PositionValue
Kevin WangTriple Round Robin LoversJason Lovers6310
Jaimie CarlsonLabour's Lost LoversJeffrey and Dahmers6610
Ali HamzehWilliams et al.Cleo: 5/7 movie7210
Agnijo BanerjeeEugene o'NeggingA VK a Day Keeps the Doctor Away8110
Mike BentleyWorld's Fair Wiggle WalkRiley et al.8210
Omer KeskinYou, Me and the Big GHeat-Oppressed Brains10010
Theo KatzmanClark AClark B1190
Urbas EkkaTunks et al.Houston Junior College11910

Summary

2024 ESPN @ Columbia03/23/2024N1100%0%100%133.00
2024 ESPN @ Brown04/06/2024Y367%0%0%64.50
2024 ESPN @ Cambridge04/06/2024Y2100%0%0%90.50
2024 ESPN @ Online06/01/2024Y3100%0%0%91.00