Difference between revisions of "D-Value"

Revision as of 15:54, 9 September 2014

The D-value is a statistic that estimates the number of points that a team would expect to score against an "average" quizbowl team. It is useful in comparing the performance of all teams that play a given packet set, in situations where multiple tournaments use the same packet set, thus making it impossible to directly compare the performance of two different teams.

Starting in 2010, the D-value replaced the S-value as the statistic determining wild card bids to NAQT CCCT and ICT.

Official Formula

D-value = 20 x (Adjusted TPPTH + Adjusted BHPTH x PPB)

Meaning of Component Statistics

The tossup points per tossup heard, or TPPTH, is computed by dividing the total number of tossup points a team scored by the number of tossups it heard. TPPTH is adjusted by multiplying by a strength-of-schedule factor that measures how difficult it was to score tossup points against a given team's slate of opponents, relative to all teams that played the set.

The bonuses heard per tossup heard, or BHPTH, is computed by counting the number of tossups a team answered correctly, and dividing by the number of tossups it heard. Most properly, all overtime tossups would be removed before computing BHPTH, but for ease of data collection this is not usually done. Like TPPTH, BHPTH is adjusted by multiplying the same strength-of-schedule factor.

The points per bonus, or PPB, is just the team's bonus conversion.

The resulting computation, TPPTH + (BHPTH x PPB), gives the expected number of points a team would score on a random tossup against a statistically average tossup-converting team. This number is converted to a more intuitive statistic, the number of points scored in an average game, by multiplying by 20 (since there are typically 20 tossups in an untimed match, and statistics for timed matches are typically normalized to "per 20 tossups heard").

NAQT Additional Modifications

There are two additional modifications done by NAQT to use the D-value as a measure of team strength on NAQT questions.

The first modification uses "difficulty correction factors" (DCs) that account for teams playing in combined (Division 1 and Division 2) fields on the wrong packet set. This ensures that, for instance, a Division 2 team that is forced to play the Division 1 SCT set (because not enough Division 2 teams signed up for the tournament) is not penalized for playing a tougher set. While the DCs were derived arbitrarily, there is a small amount of evidence that they are at least in the right ballpark.

The second modification uses an "order of finish correction" that accounts for the contingency that a statistically better team finishes behind a statistically worse team at a given SCT. In practice, it can be kind of tricky to determine exactly which teams to include. A good general rule of thumb is that if the average D-value of a set of teams is increased by including the next-lowest-finishing team, then the next-lowest-finishing team is included in the set and the new average D-value is computed.

These modifications allow NAQT to reward teams for finishing higher at their tournament, and to not penalize (too much) teams that play an inappropriate packet set for their division.

Criticism of D-Values

The D-value has been criticized as an example of mathturbation, given the insufficiency and inaccuracy of the data sets used to compute it. However, since this criticism would apply to any reasonably simple, transparent, and unbiased ranking of teams given current data collection limitations, and since NAQT needs a reasonably simple, transparent, and unbiased ranking of teams to determine ICT wild card bids, the D-value is seen as a relatively benign example.

The D-value has also been criticized for its ability to overestimate the ability of teams in exceptionally strong fields and underestimate the ability of teams in exceptionally weak fields. More complex strength-of-schedule adjustments could be used to diminish the influence of fields that strongly deviate from average strength.

External Links

NAQT's Explanation of D-values

@@ Line 1: / Line 1: @@
-<b><u>Official Method</u></b><br>
+The D-value is a statistic that estimates the number of points that a team would expect to score against an "average" quizbowl team. It is useful in comparing the performance of all teams that play a given [[packet]] set, in situations where multiple tournaments use the same packet set, thus making it impossible to directly compare the performance of two different teams.
-x (Adjusted TPPTH + Adjusted BHPTH x Adjusted PPB)
-<b><u>Brief Explanation</u></b><br>
+Starting in 2010, the D-value replaced the [[S-value]] as the statistic determining wild card bids to [[NAQT]] [[CCCT]] and [[ICT]].
-The D-values main function is to roughly find the average points per game a team would have against the "average" quizbowl team. This is done by finding the tossup points per tossup heard (TPPTH), bonus points per tossup heard (BHPTH), and points per bonus (PPB). By taking the TPPTH plus the product of BHPTH and PPB, this tells us roughly how many points a team would get per tossup heard, including bonuses. Since there are 20 tossups in a NAQT tournament, this figure would then be multiplied by 20 to give an estimate for the average number of points per game we would expect a team to get. Other factors such as strength of schedule (SOS) and division question correction (DC) are also taken into consideration, but skipped here to to brevity. These can be found under Additional Sources.
-<b><u>Additional Sources</u></b><br>
+== Official Formula ==
-NAQT - D-values[http://www.naqt.com/college/d-values.html]
+D-value = 20 x (Adjusted TPPTH + Adjusted BHPTH x PPB)
+== Meaning of Component Statistics ==
+The [[tossup]] points per tossup heard, or TPPTH, is computed by dividing the total number of tossup points a team scored by the number of tossups it heard. TPPTH is adjusted by multiplying by a strength-of-schedule factor that measures how difficult it was to score tossup points against a given team's slate of opponents, relative to all teams that played the set.
+The [[bonus|bonuses]] heard per tossup heard, or BHPTH, is computed by counting the number of tossups a team answered correctly, and dividing by the number of tossups it heard. Most properly, all overtime tossups would be removed before computing BHPTH, but for ease of data collection this is not usually done. Like TPPTH, BHPTH is adjusted by multiplying the same strength-of-schedule factor.
+The points per bonus, or PPB, is just the team's [[bonus conversion]].
+The resulting computation, TPPTH + (BHPTH x PPB), gives the expected number of points a team would score on a random tossup against a statistically average tossup-converting team. This number is converted to a more intuitive statistic, the number of points scored in an average game, by multiplying by 20 (since there are typically 20 tossups in an untimed match, and statistics for timed matches are typically normalized to "per 20 tossups heard").
+== NAQT Additional Modifications ==
+There are two additional modifications done by NAQT to use the D-value as a measure of team strength on NAQT questions.
+The first modification uses "difficulty correction factors" (DCs) that account for teams playing in combined (Division 1 and Division 2) fields on the wrong packet set. This ensures that, for instance, a Division 2 team that is forced to play the Division 1 SCT set (because not enough Division 2 teams signed up for the tournament) is not penalized for playing a tougher set. While the DCs were derived arbitrarily, there is a small amount of evidence that they are at least in the right ballpark.
+The second modification uses an "order of finish correction" that accounts for the contingency that a statistically better team finishes behind a statistically worse team at a given SCT. In practice, it can be kind of tricky to determine exactly which teams to include. A good general rule of thumb is that if the average D-value of a set of teams is increased by including the next-lowest-finishing team, then the next-lowest-finishing team is included in the set and the new average D-value is computed.
+These modifications allow NAQT to reward teams for finishing higher at their tournament, and to not penalize (too much) teams that play an inappropriate packet set for their division.
+== Criticism of D-Values ==
+The D-value has been criticized as an example of [[mathturbation]], given the insufficiency and inaccuracy of the data sets used to compute it. However, since this criticism would apply to any reasonably simple, transparent, and unbiased ranking of teams given current data collection limitations, and since NAQT needs a reasonably simple, transparent, and unbiased ranking of teams to determine ICT wild card bids, the D-value is seen as a relatively benign example.
+The D-value has also been criticized for its ability to overestimate the ability of teams in exceptionally strong fields and underestimate the ability of teams in exceptionally weak fields. More complex strength-of-schedule adjustments could be used to diminish the influence of fields that strongly deviate from average strength.
+== External Links ==
+[http://www.naqt.com/college/d-values.html NAQT's Explanation of D-values]
 [[Category: Statistics]]
 [[Category: ICT]]