ZiPS Percentiles 2010 - A Quick Review
With Colin Wyers taking a look at PECOTA percentiles, I decided to take a quick up-to-date look at how ZiPS percentiles fared in 2010. This doesn’t replace a player-by-player evaluation of the projection system, simply a test to see if upside and downside are being overestimated, underestimated, or right on as a group.
While ZiPS doesn’t use percentiles in the same way, ZiPS can still be evaluated for how well it matched up with probabilities. To do this, I used the most generic, easy-to-use stats, OPS+, and compared how many regulars (300 PA or more) played to a certain level compared to how they did. For pitchers, I used ERA+ and 300 BF.
The first figure, “PRED” is, based on the probabilities given by ZiPS, how many regulars should have beaten the threshold given the “ACT” is how many actually did.
There are obviously more robust ways to do this (OMG TEH HETEROSKEDASTICITY~!), but I’m looking at a simple ballpark here. As I’m not an impartial observer, if you would like to check the data this is derived from, please let me know.
HITTERS
PRED ACT
OPS+ >140 7% 8%
OPS+ >130 14% 14%
OPS+ >120 21% 25%
OPS+ >110 33% 38%
OPS+ >100 49% 55%
OPS+ >90 67% 71%
OPS+ >80 83% 87%
PITCHERS
PRED ACT
ERA+ >140 8% 11%
ERA+ >130 14% 19%
ERA+ >120 22% 24%
ERA+ >110 35% 35%
ERA+ >100 52% 49%
ERA+ >90 73% 69%
ERA+ >80 88% 81%
Dan Szymborski
Posted: October 05, 2010 at 08:06 PM |
11 comment(s)
Login to Bookmark
Related News:
Reader Comments and Retorts
Go to end of page
Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.
1. Dan SzymborskiHave you found in past year for ZiPS if what you show above is commonly the case? The projected is worse than the actual, or was that more the case for this year?
The pitcher PRED vs. ACT makes a nice little X-graphic as you move, which is also pretty interesting.
In other words, your predictions seem to be doing a very good job of following both actual results and theoretical percents. I don't know about other systems' percentages, nor whether they are tainted by using them in the system development process, and it is, after all, a very quick and dirty, as you disclaimed. But for your system to come out clean by a quick and dirty that you do NOT use as a corrective when you're developing your methods is a very good sign. Nice work.
- Brock Hanke
This is super dirty, but it might be interesting to see if it helps the selection bias. If you email me a spreadsheet with three columns for each group (OPS+, PA, Predicted OPS+ for hitters, ERA+, BF, ERA+ for pitchers), I'll even do it for you. :-) Send it to my work email that you have for the FF league, I don't know what email I registered here with.
You must be Registered and Logged In to post comments.
<< Back to main