Andrey Markov - 2024 ACF Nationals

Question

Answer the following about DeepMind researchers training humanoid robots to play one-on-one soccer, for 10 points each.

[10e] The team modeled the soccer environment using extensions of stochastic processes named for this mathematician, in which event probabilities depend only on the current state.

ANSWER: Andrey Markov [or Andrey Andreyevich Markov; accept Markov processes or (discounted partially observable) Markov decision processes]

[10h] The researchers trained the robot agents to learn this function using an actor–critic algorithm. Agents learn this function, which maps a state to an action, in reinforcement learning.

ANSWER: policy [or policies]

[10m] The agents controlled their movement on the field using 20 servomotors, which generally belong to either the rotary or linear type of this class of devices. These devices convert an input signal into mechanical motion or force.

ANSWER: actuators [accept rotary actuators or linear actuators] (The paper is “Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning.”)

Back to bonuses

Summary


2024 ACF Nationals	2024-04-21	Y	20	13.50	75%	50%	10%

Data


McGill	Brown	10	0	0	10
Chicago C	Minnesota A	0	0	0	0
Columbia A	Claremont Colleges	10	0	10	20
Columbia B	Michigan	10	0	10	20
Cornell A	Toronto B	10	0	0	10
Rutgers	Harvard	10	0	0	10
Ottawa	Illinois	10	0	0	10
Indiana	Florida	10	0	10	20
Kentucky	Toronto A	0	0	10	10
Duke	Minnesota B	10	0	10	20
NYU	Johns Hopkins	0	0	0	0
North Carolina B	Berkeley A	10	0	10	20
Stanford	Northwestern	10	10	10	30
Maryland	Penn	10	0	0	10
Purdue	Truman State	10	10	10	30
Chicago A	South Carolina	10	0	10	20
Texas	Cornell B	0	0	0	0
Vanderbilt	Yale B	0	0	0	0
North Carolina A	Virginia	10	0	10	20
Yale A	WUSTL B	10	0	0	10