Question

This quantity, which ranges from zero to one-half, represents the probability that a point in the dataset will be misclassified based on the rest of the data. For 10 points each:
[10h] Name this quantity that equals the sum of pk (“p-sub-k”) times “one minus pk” over all classes k. A standard criterion used by the CART algorithm chooses splits to minimize this quantity.
ANSWER: Gini impurity [or Gini index; reject “Gini coefficient”]
[10m] The CART algorithm uses Gini impurity to form these constructs. These constructs are formed via bootstrap samples and then aggregated in a “random” method named for containing multiple of them.
ANSWER: decision trees [prompt on trees] (Random forests average over multiple decision trees.)
[10e] Random forests can be used for both classification and this other task, its continuous analogue. A common approach to this task minimizes the sum of squared residuals.
ANSWER: regression [accept least squares regression; accept ordinary least squares regression; prompt on ordinary least squares or OLS]
<Nageswaran, Other Science>

Back to bonuses

Summary

2024 ESPN @ Brown04/06/2024Y215.00100%50%0%
2024 ESPN @ Cambridge04/06/2024Y210.00100%0%0%
2024 ESPN @ Chicago03/23/2024Y510.0080%20%0%
2024 ESPN @ Columbia03/23/2024Y711.43100%14%0%
2024 ESPN @ Duke03/23/2024Y210.0050%50%0%
2024 ESPN @ Online06/01/2024Y412.50100%25%0%

Data

UNC ADuke001010
UNC HunnyUNC B010010