Follow me on twitter

On Wisconsin: The FAQ

01.09.12

As you may have noticed, Wisconsin inhabits the #2 spot in my ratings this morning despite having lost five of its first 17 games, including a home game to Iowa and a lopsided loss to Michigan in the past week alone.

This is an issue that is not going to go away this season. Even in a worst-case scenario for the Badgers, they are going to be highly ranked on this web site the rest of the season. This bothers many people and in order to manage the increasing number of inquiries into the matter, I have established what the tech geeks call a “FAQ document” for handy reference. What follows is that document. Please share with all parties you deem relevant to the matter.

Q: Why do you think Wisconsin is #2?

A: I actually don’t think this. Please stop assuming that I, Ken Pomeroy, personally believe every team is properly ranked. No system can do this. Even if you somehow had the time to rank all 345 teams, by the time you were done with this exercise, you would find some things that you didn’t agree with in the rankings you had just made. For fun, spend some time just try ranking the top 50. Through some miracle, if you’re happy with the position of each team when you’re done, just let the results of games come in for the next 2-3 days and then see if you’re still happy.

The bottom line is that is impossible to rank 345 teams that will make any particular individual happy. There are always going to be some outliers in a system. Wisconsin is perhaps the biggest outlier ever in my system. And it’s only going to get worse, because they are going to lose more games and not drop very far because most of the teams they play the rest of the season are very good. But to get back to the question, I don’t believe they’re the second-best team in the country. Something like #20 sounds about right.

Q: Who has Wisconsin beaten to deserve to be #2?

A: As a reminder, the ratings are not meant to have any relation to the polls. (The first paragraph on the ratings explanation page is worth reading if you haven’t seen it before.) I don’t care who “deserves” to be #2. The #2 team in my system should be the second-best in the country. It is theoretically possible to be the second-best team in the nation without having beaten anyone of prominence.

As an example, in 2010, Duke went 1-3 against teams ranked better than 20th in my system during the regular season. Then in the tournament they went 5-0 against such teams! It was crazy, but not completely surprising. The point is, for predictive purposes, simply looking at who a team has beaten or lost to is short-sighted. Obviously this methodology has not worked well in predicting the Badgers’ recent performance. Duly noted. In bulk, though, simply rating teams by the quality of their wins and losses is not going to make for a good predictive system.

Q: The fact that Wisconsin is #2 invalidates all of your work. (More of a comment, really.)

I disagree. Let’s say you are very good at performing some task. For the next 345 times you perform this task, I will judge you based on your worst effort. I don’t think you would feel like this is a fair way to measure your ability to perform the task. Because it isn’t.

If you are going to trash the entire system based on the biggest outlier over the last six years, I suspect you had no desire to use the system in the first place. Either that, or your team is ranked lower than you think it should be.

Q. When are you going to fix this?

First off, I’m not sure it can be fixed. I mean, sure I can fix it so Wisconsin is ranked more reasonably. I could add some code to my algorithm like this…

if Wisconsin:
	do something to fix Wisconsin
else:
	do normal calculations

But that really isn’t a good way to solve the problem. Needless to say, I’d like to fix it.  However, fixing Wisconsin can result in messing things up for others. Plus, there are a lot of other ratings systems already doing cool things, and I’m not in the game of copying others’ work, so that doesn’t leave a lot on the table for me to pursue. That doesn’t mean improvements aren’t possible, but regardless of what I come up with, I can promise you there will still be teams that you feel are mis-rated.

For instance, one of the most respected ratings systems in the nerd world is the LRMC ratings. They had Wisconsin fifth before yesterday’s games. Last year, heading into the tournament, they had Belmont fourth, and their system outperforms mine! The Sagarin predictor, also deservedly respected, had Wisconsin second heading into yesterday’s games.

If you would simply remove Wisconsin from the planet, the ratings on my page would look fine to me. I guess you could ignore the entire system based on the worst outlier, but I don’t think that makes a lot of sense. There’s still some insight gained from the teams that differ from the polls. (See: Saint Louis, New Mexico, UConn, Mississippi State, among others.)

Q: Your work is flawed. (Not really a Q, either, I guess, but I get this all the time.)

A: Ugh, I hate it when people say this. Of course it’s flawed. The thing is, your knowledge is flawed, too. If you are ignoring potentially useful tools because of a single issue, then your judgment is flawed as well. And I’m guessing you’ve never tracked the quality of your knowledge so you don’t even know how flawed it is. If you’re like most people, you think you’re knowledge is great because you remember the predictions you made that worked out and you forget about the ones that didn’t. It’s human nature.

The difference between your flawed knowledge and my flawed system is that I am tracking the results of my system so you can identify all of the flaws for yourself. Ten years from now, you’ll be able to look at my site and see that Wisconsin was like 21-13 at the end of the 2012 season and ranked #5. Whereas the time you made some crazy prediction that didn’t pan out is quickly forgotten.

I would say there’s still enough value in the work here to provide a useful reality check on your own knowledge. Used together, your flawed knowledge and my flawed tools can be more powerful than used separately.

Q: Anything else we need to know?

A: Well, since you asked, you might as well know that Ohio State will almost surely be the #1 team the rest of the season. Also, they will probably finish with more losses than Kentucky or UNC or Syracuse, because they will play a tougher collection of teams the rest of the season. Just to reiterate, if you are looking for a ranking of teams by wins and losses, a la the AP poll or the RPI, then you’ll be more fulfilled consulting the AP poll or RPI for your ranking needs as the season continues. (Sagarin and Massey actually have better systems for this purpose, though.) However, I believe you will be less enlightened by ignoring the work of predictive systems.

Note also, that this does not mean I think Ohio State is the best team in the country, nor do I expect you to think that! (Although right now, I actually do believe the Buckeyes are the best, but I may change my mind as upcoming events warrant.) I didn’t think Duke was the best team two years ago, even after the tournament. But by the time the season ended, it was hard to argue they weren’t one of the best teams in the country, and were significantly underrated by the folks who take a resume approach to evaluating teams at the expense of all other information. And basically, I don’t think the Wisconsin situation invalidates the predictive methods used on this site, except in cases involving Wisconsin.

The untrained eye: Washington State vs. Utah

01.06.12

Utah beat Washington State 62-60 in a 60 possession, overtime game on Thursday night. I was there. This is was I saw.

Imperfection denied

Heading into last night, no team in the nation had a better shot at going winless in conference than Utah. Their inaugural Pac-12 contest was a 40-point loss to Colorado, who is (despite now leading the conference at 2-0) probably somewhere between the 7th and 11th best team in the conference. By the end of the night, the Utes were off the hook. As was the third-most likely team, Grambling who beat Alabama A&M 60-55. The bottom line is that it’s very difficult to go through a conference slate winless, no matter how bleak the situation looks after non-conference play. The onus is now on Towson, who has a 29% chance of losing their last 15 CAA games. Despite being on a 34-game losing streak, I have the Tigers’ back. They will win a game in conference.

Upsets require breaks

As one might have expected in a Utah win, the Utes played better than normal. The ran a crisp offense in the first half and only committed three turnovers while racking up 29 points in 27 possessions. It also helped that the normally sloppy Utes encountered a zone during the entire half. This allowed Utah to take better care of the ball and also take the air out of it. The game ended with just 60 possessions in 45 minutes which undoubtedly boosted Utah’s chances as well. Furthermore, Washington State made just 10 of their 22 free throw attempts. The Utes played better than usual, and the Cougars (except for Brock Motum) played worse than usual, and that’s how these things happen. It should happen to Towson before the end of the season, too.

Utah probably can’t be considered the worst power-conference team of all time

I mean, for history’s sake, with the win they have become just a historical footnote. But for scoring-margin aficionados like myself, the remaining 16 conference games for Utah will determine whether this game was completely improbable or merely slightly unusual. Keep in mind, in two conference games the Utes have been outscored by 38 points. If Utah finished the season with an average scoring margin of -19, then yeah, this game is going to look out of place.

Fun with play-by-play data

At the under-12 media time out in the second half, Utah had committed eight fouls to Washington State’s one. Being obsessed with probabilities, I contemplated the chances that the next foul would be called against Washington State. You might be aware of this study, which showed that officials prefer to avoid lopsided foul counts. Not only was the foul count unbalanced at this point, but there were three other things going for a high chance of a Wazzu foul – 1) Utah would have the ball coming out of the break; 2) the game was at Utah, so the crowd was rather ornery to this point; and 3) Utah was trailing. It turned out Washington State would pick up two fouls on the ensuing possession.

The study referenced above looked at a 365-game sample. I went back and looked at every play-by-play for the last three seasons and came up with the following data demonstrating the tendency for officials to even up the foul count.

Probability of (x+1)th foul being called on Team A when first x fouls of game have been called on Team A

 x     n    n(a)   Chc(a)
 1  13084   7030    53.7
 2   7030   3410    48.5
 3   3410   1604    47.0
 4   1604    662    41.3
 5    662    253    38.2
 6    253     89    35.2
 7     89     28    31.5
 8     28      7    25.0
 9      7      1    14.3
10      1      0     0.0

x: number of consecutive fouls called against Team A since beginning of game
n: total number of cases
n(a): number of cases where Team A was called for the next foul
Chc(a): percentage of cases where Team A was called for the next foul

With over 13000 play-by-plays, we have enough of a sample to produce a smooth curve without any massaging of the data. The conclusion is obvious: After each consecutive foul on one team it becomes more likely that the other team will get whistled.

There are other factors contributing to this trend besides refs just giving into a coach’s complaints. Just how much of this is the result of a ref’s fragile psyche is up for debate. Nonetheless, the next time you see a lopsided foul count, you should expect the next call to go against the team that has been benefitting from the calls to that point.

Get to know Mike Scott

01.05.12

It’s getting to that time of year where people start talking about All-American teams and player of the year honors. There always seems to be that one guy that is having a large impact but fails to get the proper recognition because his team plays at a very slow pace and thus said player is unable to accumulate the shiny counting stats that other big-time players use to get the media’s attention. The first player that appealed to me like this was Georgetown’s Roy Hibbert in 2007. He was somewhat minutes-challenged, but his advanced stats were extremely impressive when he was on the floor.

Last season, you’ll recall Wisconsin’s Jordan Taylor filled this role. It wasn’t until he hit people over the head with his performance against Ohio State on January 9 that folks began to take his play seriously. Taylor hasn’t been able to maintain the blistering statistical pace that he did last season, but fortunately for tempo-free lovers, someone has taken his place.

According to bbstate.com, through Tuesday’s games, Virginia forward Mike Scott ranked 131st in the nation with 16.0 points per game and 47th with 9.0 rebounds per game. On Monday night, he went for 17 points and nine rebounds against LSU in a 57-52, 60-possession contest. The pace-contaminated numbers aren’t inspiring, and even though a few folks covering the ACC are giving Scott his due acclaim, I expect he’s less known nationally than he should be. Even if he maintains his current production through conference play, it will be a struggle for him to get consideration for ACC player of the year.

However, accounting for the brutally slow tempo UVa plays at, Scott provides one of the best scoring and rebounding combinations in the country and anchors one of the nation’s best defenses on top of that. Pick your favorite big-man player of the year candidate for comparison and drink the tempo-free nectar. (All stats through Tuesday’s games.)

           ORtg  %Poss  OR%  DR%  A%
Scott       125    27   14   26   16
T-Rob       110    28   12   32   13
Sully       126    26   11   34    8
A Davis     129    18   12   26    6
C Zeller    129    22   12   17    7
D Green     103    28    9   24   20
Henson      113    23   10   24   10

This is a good time to mention that from an individual level, offensive rebounds are more valuable than defensive rebounds because when a player grabs a defensive board he is more likely to be taking away a rebound from a teammate. This is especially true for Scott’s offensive rebounds since UVa typically only sends two players to the offensive glass. I put assist rate in the comparison also, because Scott is no slouch in that department, either. Draymond Green gets praise for his passing, and rightfully so, but statistically at least, Scott isn’t far behind, setting up one in every six of his teammates’ buckets.

Obviously, the difference between Scott and the others is that (a) his team is not as good and (b) Virginia has played mainly a collection of atrocious defenses thus far. But hey, if reputable people are going to rave about Doug McDermott, then Scott should be getting some national love, too.

Keep in mind, that without Scott on the floor most of last season, the Cavaliers were significantly worse on both sides of the ball. The improvement this season isn’t all due to Scott – the players around him have improved with another year of experience and Malcolm Brogdon has made a small impact as a freshman – but Scott’s numbers indicate he can take credit for much of the difference. It’ll take some good play in the three games UVa has against Duke and UNC for him to get the mainstream publicity he deserves, but tempo-free fanatics have already noticed.

Page 6 of 189 pages « First  <  4 5 6 7 8 >  Last »