Here I am compiling a list of test cases that can be used to help evaluate the quality of any set of computer (or human/voter) rankings. There is nothing special about these teams, other than they present useful cases I've noticed over time studying scores, upsets, and strange-looking rankings. In each case several of the computers out there violate what is sensible. I think these are all clear-cut cases, and if a system violates most or all of them, we should be skeptical of it. But don't take my word for it - do the comparisons yourself.

2005

2004

2003

2002

1996 1917


Note: The final 2003 BCS ratings are guilty on both counts! They had:

The BCS ratings almost fail the 2004 test, having Wisconsin 19th and Ohio State 20th. I haven't seen any composite listings of the 2002 BCS rankings, but I do know that a majority of their systems ranked Colorado State above TCU, and one (NY Times) ranked South Florida impossibly high at 14th, and 7 spots above the Arkansas team that beat them 42-3!

{back}