|
|
|
|
Baseball Primer Newsblog— The Best News Links from the Baseball Newsstand
Wednesday, February 08, 2012
Much like John Wathan…juggling PECOTA around.
PECOTA has arrived.
BP’s projection system, at its core, follows the same basic principles as it has before. We begin with our baseline projections, which start with a weighted average of past performance, with decreasing emphasis placed on seasons further removed from the season being projected. Then that performance is regressed to the mean. After that, we use the baseline forecast to find comparable players (while also taking into account things like position and body type) and use those to account for the effects of aging on performance.
Every season we put PECOTA under the knife, looking for things we can improve to make sure we’re coming up with the best forecasts possible. Sometimes what we come up with is a minor tweak. At other times, though, what we unearth is not only more significant, but an interesting baseball insight in its own right, even aside from its inclusion in PECOTA.
This season, we’ve made some rather radical changes to how we handle the weighted averages for the PECOTA baselines—we still deemphasize past seasons, but nowhere near as much as we used to. With such a dramatic and counterintuitive change, we thought it best to give our users an explanation of what was changed and why so that they could correctly use and interpret the PECOTA forecasts.
|
Support BBTF
Thanks to Adam M for his generous support.
Bookmarks
You must be logged in to view your Bookmarks.
Hot Topics
Newsblog: Wilmoth: Nate McLouth Designated For Assignment (12 - 12:25am, May 26)Last: TriponNewsblog: Boston.com: Curt Schilling’s 38 Studios lays off all staff (118 - 12:15am, May 26)Last:  Paul D(uda)Hall of Merit: Most Meritorious Player: 1973 Discussion (15 - 12:13am, May 26)Last: DanGNewsblog: Bud Selig -- No need for more MLB replay for now - ESPN (86 - 11:59pm, May 25)Last: cardsfanboyNewsblog: The Hall of Very Good: Former Cards Slugger Critical of "LaRussa's Regime" (4 - 11:26pm, May 25)Last: cardsfanboyNewsblog: CSN to host ‘Phillies at the Beach’ on Memorial Day (18 - 11:25pm, May 25)Last: Fielder's the first baseman, Felder is the fielderHall of Merit: Most Meritorious Player: 1972 Ballot (28 - 11:25pm, May 25)Last: lieiamSox Therapy: A Winning Ballclub? (20 - 11:24pm, May 25)Last: DanNewsblog: Matschulat: Did I Miss The "Paul Konerko Is So Overrated OMG" Bandwagon? (27 - 11:16pm, May 25)Last: baudibNewsblog: TBO: Nerdy Rays head north (17 - 10:07pm, May 25)Last: PreservedFishNewsblog: Himrich’s Top Ten Target Field Foods (6 - 9:57pm, May 25)Last: Long John McCaine Mutiny on the Bounty (scott)Newsblog: T.R. Sullivan: Of Frank Robinson, Milt Pappas and Jim Palmer (6 - 9:42pm, May 25)Last: TR_SullivanNewsblog: Dodgers want to host NHL's Winter Classic (22 - 9:38pm, May 25)Last: Cris ENewsblog: Greenberg: Cubs' Ricketts decries proposal (817 - 9:08pm, May 25)Last:  The Yankee ClapperNewsblog: HP: Baseball is leaving the human factor behind (55 - 8:48pm, May 25)Last: Squash
|
|
Reader Comments and Retorts
Go to end of page
Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.
1. Don MalcolmWhat weightings do most systems use?
Not sure. I thought its accuracy -- and certainly, its internal consistency and face validity -- fell off a cliff immediately after Nate left. My impression is that it hasn't gotten much worse or better since the big drop, and has now established a lower level.
Uniquely accurate is the term used. With nothing to back it up.
I think it's been shown all over the place that there's not a whole hell of a lot of difference in projection systems and that dumb-as-dirt Marcels does about as well as all the others.
Wouldn't any prediction that wasn't just a blatant plagiarism of somebody else's predictions be "uniquely accurate"? Now, whether is was "uniquely good", "uniquely bad", or "uniquely mediocre" would require some evidence, I suppose.
Some quick examples beforehand: the recent poster boy for “New PECOTA” would probably be Francisco Liriano, whose 3.60 PECOTA forecast for 2010 was almost identical to his real-life 3.62 ERA, while Marcels weighted his recent past (2009 was horrific, and many observers wondered if he'd ever return from his injury woes) and forecast a 4.88 ERA.
That's awesome ... until you see that Liriano's 2011 ERA was 5.09.
Somewhat off-topic -- Wyers is probably right that Dunn will bounce back but that was an historic collapse. Dunn has maybe 300 PA to prove last year was a fluke. If he's hitting 175 with no power in mid-June, he'll be released and his career is likely done.
Since they're talking about 2010, I don't get your point.
What did New PECOTA peg for Liriano in 2011? After that career-salvaging 2010 that defied many observers' expectations (but not the system's), did it also forecast a return to replacement-level pitching for Nelly?
LOL PECOTA
I think we have to wait until next year for that.
I see Walt's point now.
Basically, if you are saying new PECOTA nails Liriano in 2010, it would clearly miss on him in 2011. So is it really any better than it was?
Or, basically, Marcel was way off in 2010 but probably pegged him much better in 2011; PECOTA pegged him in 2010 but probably was way off in 2011.
Or, Liriano 2010 is the new PECOTA poster boy while Liriano 2011 is the new PECOTA Elephant Man.
Or just the general point that "hey, we got this guy dead on in this one year" is just a pointless thing to brag about for any model. That's not how models work and it's not what they're useful for. Even a simple "he'll do the same as last year" model is gonna be dead on for a good number of players.
The new PECOTA could of course be right about Liriano's talent level and he was just unlucky in 2011. Alternatively, Marcel had him pegged right in 2010 but he got lucky before returning to form in 2011. Or PECOTA had him right in 2010 but he got hurt again in 2011. Or Liriano's real name is Alfred E Newman and he's 67 years old. Or the truth is somewhere in the middle of all those things.
Now if they could show that for a certain class of players, e.g. guys coming off injury, they are consistently better than other projection systems, then they've got something to brag about.
* I find this much more plausible for hitters. For pitchers, I'm much more skeptical that data from 5-6 years ago tells me much about the pitcher today.
Vlad was by far the best I've ever encountered for predicting breakout systems, but this came at a cost. The overall standard error in projections improved quite a bit when they went to a simpler model (Vlad was neural net based) after Gary left.
IOW Vlad wasn't better overall, but if Vlad wasn't optimistic about a player. ... Well it had a few misses that way too of course but its errors were far more likely to be on the optimistic side.
You must be Registered and Logged In to post comments.
<< Back to main