quantization - 2025 ACF Nationals

Question

When this process is performed post-training, the AdaRound algorithm may be used to minimize loss. For 10 points each:

[10h] Name this process that is classified as either symmetric or asymmetric, depending on whether a zero-point is specified. Applying this process to LLMs has dramatically reduced their VRAM (“vee-RAM”) requirements.

ANSWER: quantization [accept binarization; accept more specific answers such as post-training quantization or quantization-aware training; prompt on PTQ or QAT]

[10m] Quantized neural networks often utilize “straight-through estimators” to estimate “surrogate” examples of this function. In a process named for it, the product of this function with a constant is repeatedly subtracted from the objective function.

ANSWER: gradient [accept surrogate gradient or gradient descent; prompt on SGD; prompt on del]

[10e] Quantization works by reducing the precision of this data type, which is defined in IEEE 754. Doubles are a more precise variant of this type, which is the most common type used to represent decimal numbers.

ANSWER: floats [accept floating-point numbers or FP]

Back to bonuses

Summary


2025 ACF Nationals	04/19/2025	Y	21	15.71	100%	57%	0%

Data


Texas	Arizona State	10	10	20
Toronto A	Columbia B	10	10	20
Penn State	Toronto C	10	10	20
Georgia Tech	UC Berkeley A	10	10	20
UCF	Georgia State	10	10	20
Cornell A	Vanderbilt	10	10	20
Waterloo B	North Carolina B	10	10	20
Winona State	Chicago A	0	10	10
Iowa State	Yale	0	10	10
Virginia	Chicago B	0	10	10
Cornell B	Illinois B	10	10	20
Rutgers	Illinois A	10	10	20
Florida	Indiana	0	10	10
Ottawa	LSE	0	10	10
WUSTL B	Michigan	0	10	10
British Columbia	Minnesota	10	10	20
NYU	RIT	10	10	20
Harvard	North Carolina A	0	10	10
Northwestern	UC Berkeley B	0	10	10
Johns Hopkins	Ohio State	10	10	20
Virginia Tech	Stanford	0	10	10