Baseball for the Thinking Fan

Login | Register | Feedback

btf_logo
You are here > Home > Baseball Newsstand > Baseball Primer Newsblog > Discussion
Baseball Primer Newsblog
— The Best News Links from the Baseball Newsstand

Wednesday, February 08, 2012

BPro: Wyers: Reintroducing PECOTA

Much like John Wathan…juggling PECOTA around.

PECOTA has arrived.

BP’s projection system, at its core, follows the same basic principles as it has before. We begin with our baseline projections, which start with a weighted average of past performance, with decreasing emphasis placed on seasons further removed from the season being projected. Then that performance is regressed to the mean. After that, we use the baseline forecast to find comparable players (while also taking into account things like position and body type) and use those to account for the effects of aging on performance.

Every season we put PECOTA under the knife, looking for things we can improve to make sure we’re coming up with the best forecasts possible. Sometimes what we come up with is a minor tweak. At other times, though, what we unearth is not only more significant, but an interesting baseball insight in its own right, even aside from its inclusion in PECOTA.

This season, we’ve made some rather radical changes to how we handle the weighted averages for the PECOTA baselines—we still deemphasize past seasons, but nowhere near as much as we used to. With such a dramatic and counterintuitive change, we thought it best to give our users an explanation of what was changed and why so that they could correctly use and interpret the PECOTA forecasts.

Repoz Posted: February 08, 2012 at 07:21 AM | 23 comment(s) Login to Bookmark
  Tags: announcements, history, projections, sabermetrics, site news

Reader Comments and Retorts

Go to end of page

Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.

   1. Don Malcolm Posted: February 08, 2012 at 11:41 AM (#4056321)
Year in and year out, there is simply nothing like the drama of counterintuitivity....
   2. JJ1986 Posted: February 08, 2012 at 11:52 AM (#4056335)
When I was messing around with this kind of thing a few years ago, I found that a 1/1/1 weighting was better than a 4/3/2 weighting, but didn't go much further.

What weightings do most systems use?
   3. Randy Jones Posted: February 08, 2012 at 11:59 AM (#4056347)
Haven't paid much attention to PECOTA recently(not giving BPro any money). Hasn't its accuracy gone continuously downhill since Silver left BPro?
   4. DCA Posted: February 08, 2012 at 12:04 PM (#4056353)
Hasn't its accuracy gone continuously downhill since Silver left BPro?

Not sure. I thought its accuracy -- and certainly, its internal consistency and face validity -- fell off a cliff immediately after Nate left. My impression is that it hasn't gotten much worse or better since the big drop, and has now established a lower level.
   5. andrewberg Posted: February 08, 2012 at 12:13 PM (#4056366)
I have always liked to aggregate projections to get a composite idea (fewer Matt Wieters outliers). The problem is that there are fewer and fewer available so the weighting isn't as strong. On the other hand, fangraphs makes it easier to get the few that are left.
   6. bads85 Posted: February 08, 2012 at 12:47 PM (#4056408)
Is it still deadly accurate? I play fantasy baseball to kill, not wound.
   7. Austin Posted: February 08, 2012 at 01:17 PM (#4056442)
   8. AROM Posted: February 08, 2012 at 01:22 PM (#4056446)
"Is it still deadly accurate? I play fantasy baseball to kill, not wound."

Uniquely accurate is the term used. With nothing to back it up.
   9. The Piehole of David Wells, Depends Salesman Posted: February 08, 2012 at 01:24 PM (#4056448)
Not sure. I thought its accuracy -- and certainly, its internal consistency and face validity -- fell off a cliff immediately after Nate left. My impression is that it hasn't gotten much worse or better since the big drop, and has now established a lower level.


I think it's been shown all over the place that there's not a whole hell of a lot of difference in projection systems and that dumb-as-dirt Marcels does about as well as all the others.
   10. Kiko Sakata Posted: February 08, 2012 at 01:32 PM (#4056458)
Uniquely accurate is the term used. With nothing to back it up.


Wouldn't any prediction that wasn't just a blatant plagiarism of somebody else's predictions be "uniquely accurate"? Now, whether is was "uniquely good", "uniquely bad", or "uniquely mediocre" would require some evidence, I suppose.
   11. Walt Davis Posted: February 08, 2012 at 03:02 PM (#4056570)
I hope BPro isn't still up to its old tricks. From the "UPDATE" (not added by Wyers it seems):

Some quick examples beforehand: the recent poster boy for “New PECOTA” would probably be Francisco Liriano, whose 3.60 PECOTA forecast for 2010 was almost identical to his real-life 3.62 ERA, while Marcels weighted his recent past (2009 was horrific, and many observers wondered if he'd ever return from his injury woes) and forecast a 4.88 ERA.

That's awesome ... until you see that Liriano's 2011 ERA was 5.09.

Somewhat off-topic -- Wyers is probably right that Dunn will bounce back but that was an historic collapse. Dunn has maybe 300 PA to prove last year was a fluke. If he's hitting 175 with no power in mid-June, he'll be released and his career is likely done.
   12. SG Posted: February 08, 2012 at 03:12 PM (#4056580)
That's awesome ... until you see that Liriano's 2011 ERA was 5.09.


Since they're talking about 2010, I don't get your point.
   13. SoSH U at work Posted: February 08, 2012 at 03:22 PM (#4056589)
Since they're talking about 2010, I don't get your point.


What did New PECOTA peg for Liriano in 2011? After that career-salvaging 2010 that defied many observers' expectations (but not the system's), did it also forecast a return to replacement-level pitching for Nelly?

   14. Guapo Posted: February 08, 2012 at 04:41 PM (#4056665)
From the comments:

Bryce Harper's number one comp is Wayne Causey


LOL PECOTA
   15. SG Posted: February 08, 2012 at 04:44 PM (#4056668)
What did New PECOTA peg for Liriano in 2011?


I think we have to wait until next year for that.

I see Walt's point now.
   16. Melo's Love Handles (NJ) Posted: February 08, 2012 at 05:02 PM (#4056689)
[15] I don't. Anyone care to explain?
   17. SG Posted: February 08, 2012 at 05:04 PM (#4056692)
Anyone care to explain?


Basically, if you are saying new PECOTA nails Liriano in 2010, it would clearly miss on him in 2011. So is it really any better than it was?
   18. Walt Davis Posted: February 08, 2012 at 06:47 PM (#4056793)
Right. If your system was smart enough to know that Liriano's "true talent" in 2010 was a 3.60 ERA, he then goes out and puts up a 3.60 ERA then, presumably, its projection for 2011 will be a 3.60 ERA. If anything, given Colin is claiming that older data is nearly as good as recent data*, his 2010 performance should give you even more confidence that his true talent is around 3.60.

Or, basically, Marcel was way off in 2010 but probably pegged him much better in 2011; PECOTA pegged him in 2010 but probably was way off in 2011.

Or, Liriano 2010 is the new PECOTA poster boy while Liriano 2011 is the new PECOTA Elephant Man.

Or just the general point that "hey, we got this guy dead on in this one year" is just a pointless thing to brag about for any model. That's not how models work and it's not what they're useful for. Even a simple "he'll do the same as last year" model is gonna be dead on for a good number of players.

The new PECOTA could of course be right about Liriano's talent level and he was just unlucky in 2011. Alternatively, Marcel had him pegged right in 2010 but he got lucky before returning to form in 2011. Or PECOTA had him right in 2010 but he got hurt again in 2011. Or Liriano's real name is Alfred E Newman and he's 67 years old. Or the truth is somewhere in the middle of all those things.

Now if they could show that for a certain class of players, e.g. guys coming off injury, they are consistently better than other projection systems, then they've got something to brag about.

* I find this much more plausible for hitters. For pitchers, I'm much more skeptical that data from 5-6 years ago tells me much about the pitcher today.
   19. 100 Years is Nothing Posted: February 08, 2012 at 11:31 PM (#4056923)
I have my own proprietary projection system, and to be sure I am not missing any data, and don't run into small sample size issues, I do not project anyone until their age 50 season. I get the projections right 99.9% of the time, and is quickly done for each season.
   20. McCoy Posted: February 09, 2012 at 12:06 AM (#4056929)
I bet you missed on Minoso and Paige.
   21. AROM Posted: February 09, 2012 at 12:20 AM (#4056931)
Do you have a 2013 projection for Moyer yet?
   22. Melo's Love Handles (NJ) Posted: February 09, 2012 at 12:53 AM (#4056947)
Ahhh, thanks for the explanation.
   23. Ron J Posted: February 09, 2012 at 07:01 AM (#4056990)
#9 Best I can tell only Gary Huckabay's Vlad was tremendously different in design from any of the other systems.

Vlad was by far the best I've ever encountered for predicting breakout systems, but this came at a cost. The overall standard error in projections improved quite a bit when they went to a simpler model (Vlad was neural net based) after Gary left.

IOW Vlad wasn't better overall, but if Vlad wasn't optimistic about a player. ... Well it had a few misses that way too of course but its errors were far more likely to be on the optimistic side.

You must be Registered and Logged In to post comments.

 

 

<< Back to main

Support BBTF

donate

Thanks to
Adam M
for his generous support.

Bookmarks

You must be logged in to view your Bookmarks.

Hot Topics

NewsblogWilmoth: Nate McLouth Designated For Assignment
(12 - 12:25am, May 26)
Last: Tripon

NewsblogBoston.com: Curt Schilling’s 38 Studios lays off all staff
(118 - 12:15am, May 26)
Last: Paul D(uda)

Hall of MeritMost Meritorious Player: 1973 Discussion
(15 - 12:13am, May 26)
Last: DanG

NewsblogBud Selig -- No need for more MLB replay for now - ESPN
(86 - 11:59pm, May 25)
Last: cardsfanboy

NewsblogThe Hall of Very Good: Former Cards Slugger Critical of "LaRussa's Regime"
(4 - 11:26pm, May 25)
Last: cardsfanboy

NewsblogCSN to host ‘Phillies at the Beach’ on Memorial Day
(18 - 11:25pm, May 25)
Last: Fielder's the first baseman, Felder is the fielder

Hall of MeritMost Meritorious Player: 1972 Ballot
(28 - 11:25pm, May 25)
Last: lieiam

Sox TherapyA Winning Ballclub?
(20 - 11:24pm, May 25)
Last: Dan

NewsblogMatschulat: Did I Miss The "Paul Konerko Is So Overrated OMG" Bandwagon?
(27 - 11:16pm, May 25)
Last: baudib

NewsblogTBO: Nerdy Rays head north
(17 - 10:07pm, May 25)
Last: PreservedFish

NewsblogHimrich’s Top Ten Target Field Foods
(6 - 9:57pm, May 25)
Last: Long John McCaine Mutiny on the Bounty (scott)

NewsblogT.R. Sullivan: Of Frank Robinson, Milt Pappas and Jim Palmer
(6 - 9:42pm, May 25)
Last: TR_Sullivan

NewsblogDodgers want to host NHL's Winter Classic
(22 - 9:38pm, May 25)
Last: Cris E

NewsblogGreenberg: Cubs' Ricketts decries proposal
(817 - 9:08pm, May 25)
Last: The Yankee Clapper

NewsblogHP: Baseball is leaving the human factor behind
(55 - 8:48pm, May 25)
Last: Squash

Buy MLB playoff tickets, plus 2011 World Series, 2011 ALCS tickets and NLCS game tickets. We also have Texas Rangers playoff schedule, tickets to Red Sox games and Yankees game tickets. Plus, buy Phillies baseball tickets, Tigers playoff tickets and the biggies like ALDS baseball tickets and 2011 NLDS tickets.

Demarini, Easton and TPX Baseball Bats

 

 

 

AllianceTickets.com has cheap MLB Tickets. Get all your Colorado Rockies Tickets, Seattle Mariners Tickets, San Francisco Giants Tickets and all your favorite baseball tickets here. We also carry cheap Denver Broncos Tickets, Seattle Seahawks Tickets and Denver Nuggets Tickets.

Page rendered in 0.1913 seconds
54 querie(s) executed