{"id":240,"date":"2014-01-07T13:54:21","date_gmt":"2014-01-07T19:54:21","guid":{"rendered":"http:\/\/67.227.157.91\/~kenpom\/wp_blog\/most-frequent-lineup-data\/"},"modified":"2014-01-07T13:54:21","modified_gmt":"2014-01-07T19:54:21","slug":"most-frequent-lineup-data","status":"publish","type":"post","link":"https:\/\/kenpom.com\/blog\/most-frequent-lineup-data\/","title":{"rendered":"Most frequent lineup data"},"content":{"rendered":"<p>Sometimes when I\u2019m looking over a team\u2019s data, it\u2019s not exactly clear what a team\u2019s lineup looks like in reality. One could watch some video of a team\u2019s past games to figure this out, but in our modern fast-paced world, not everyone has the time to do that. So I\u2019ve been working for a while on something that was fun for me to develop and I hope useful for you to use. It\u2019s an algorithm that, given a team\u2019s lineup, will figure out what position each player plays. I&#8217;ve applied this to the ten-most frequently used lineups for each team and slapped the information on the bottom of each team&#8217;s page.<\/p>\n<p>A word of warning up front: It is not perfect, and being an algorithm and all, it\u2019s completely automated. If your job depends on this information, you should consult other sources to confirm my computer\u2019s guess. Like video, probably. If you are a coach, you probably should be looking at video before you play an upcoming opponent. It\u2019s the least you can do for getting paid to coach a basketball team. <\/p>\n<p>For the most part, though, I like the results. The system\u2019s probably 80% accurate. Maybe higher, but there\u2019s no way to really put a number on it so I\u2019m not sure why I did. You\u2019ll have to judge for yourself. I\u2019m using the old-school concept of basketball positions here which may be a disappointment to some. I\u2019m totally down with redefining positions, but in experimenting with that concept, it added another layer of complexity to the project that didn\u2019t work as well in practice as it did in theory. Also, I&#8217;m mainly focusing on offensive positions here. It\u2019s actually not that much of a leap to do something to estimate defensive positions (at that point we can start living in <a href=\"http:\/\/www.basketballprospectus.com\/article.php?articleid=1190\" target=\"_blank\">Drew Cannon\u2019s world<\/a>), but that will require a bit more work. <\/p>\n<p>In order to create the algorithm, I watched a bunch of teams (roughly 100) and assigned a position, one through five, to each player that got decent enough minutes. Then I ran a regression on various stats to best predict the position assignments. I&#8217;m using an initial model to identify the player on the floor most likely playing point guard, and then a second model to identify the remaining four spots on the floor.<\/p>\n<p>To identify the point guard, height and assist rate are useful predictors of course. But the system I\u2019m using tends to not like guys who take a lot of threes and have a low turnover percentage because those players are normally playing shooting guard or (\u201cshooting wing\u201d), regardless of their size. Not surprisingly, a low offensive rebound percentage is normally a giveaway for a point guard, but the system relaxes this requirement for taller players. <\/p>\n<p>Still, the system misses on some guys. Most notably, it can handle point guards up to about 6-6 and beyond that it has trouble. So as of this writing, Norman Powell or Bryce Alford shows up as UCLA\u2019s point guard when Kyle Anderson is on the floor. (For kicks, compare Anderson&#8217;s numbers to Dwight Powell of Stanford. There&#8217;s not a huge difference there and yet it would be laughable to slot the 6-10 Powell at point. This is an example of the challenge in developing such a model.)<\/p>\n<p>And then there\u2019s Vermont\u2019s Brian Voelkel. Voelkel is the reason this project didn\u2019t get off the ground months earlier. He&#8217;s 6-6, has easily the highest assist rate on his team, commits enough turnovers to suggest he handles the ball some, and isn\u2019t a good offensive rebounder for his size. So Voelkel shows as up as the UVM&#8217;s point guard, even though he\u2019s not. Eventually, I\u2019ve come to grips with the idea that I cannot create a system that will put Brian Voelkel in his proper place. I\u2019d like to politely request that Catamounts\u2019 starting point guard Sandro Carissimo drop a few more dimes and commit a few more turnovers to resolve this. Until that happens, Carissimo will be pegged as a two-guard in Vermont\u2019s lineup. Of course, in some cases the point guard distinction isn\u2019t all that important. Some teams have more than one person on the floor capable of handling the ball and initiating the offense. Vermont is not that case, unfortunately, so I can&#8217;t use that excuse here.<\/p>\n<p>The formula for determining the other positions is more straightforward. Height, assist rate (lower indicates a taller position), offensive and defensive rebounding (higher), weight (higher), block rate (higher), two-point percentage (higher), and three-point attempt percentage (lower) are all useful predictors. As with point guards, this part of the model isn\u2019t foolproof, but to me it\u2019s not a big deal if the three and the four are erroneously switched. On a lot of teams, it may not be possible or useful to distinguish those differences even when watching the team play. <\/p>\n<p>Finally, play-by-play data is still not in a state where substitutions are accurately recorded. In some cases, it is impossible to determine the lineup a team has on the floor. So on the team page, after the ten-most used lineups, I\u2019ve included the percentage of unknown lineups. (The percentages listed next to the other lineups are relative to the <em>total number of known lineups<\/em>.) The more unknown time there is, the less you should trust the real-life frequencies of the other lineups. But even with a lot of unknown time, you can still get an idea of what a team\u2019s lineup looks like when a particular starter is off the floor, or what it looks like when a team wants to play big or small.<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Sometimes when I\u2019m looking over a team\u2019s data, it\u2019s not exactly clear what a team\u2019s lineup looks like in reality. One could watch some video of a team\u2019s past games to figure this out, but in our modern fast-paced world, not everyone has the time to do that. So I\u2019ve been working for a while [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[1],"tags":[],"_links":{"self":[{"href":"https:\/\/kenpom.com\/blog\/wp-json\/wp\/v2\/posts\/240"}],"collection":[{"href":"https:\/\/kenpom.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/kenpom.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/kenpom.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/kenpom.com\/blog\/wp-json\/wp\/v2\/comments?post=240"}],"version-history":[{"count":0,"href":"https:\/\/kenpom.com\/blog\/wp-json\/wp\/v2\/posts\/240\/revisions"}],"wp:attachment":[{"href":"https:\/\/kenpom.com\/blog\/wp-json\/wp\/v2\/media?parent=240"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/kenpom.com\/blog\/wp-json\/wp\/v2\/categories?post=240"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/kenpom.com\/blog\/wp-json\/wp\/v2\/tags?post=240"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}