Baseball for the Thinking Fan

Login | Register | Feedback

btf_logo
You are here > Home > Baseball Newsstand > Baseball Primer Newsblog > Discussion
Baseball Primer Newsblog
— The Best News Links from the Baseball Newsstand

Thursday, February 14, 2008

Nate Silver (BP): The Smoking Gun? (Clemens’s late career record):

Comparing reality to projection, Clemens pitched about 50 more innings over this period than might have been expected, and accumulated nine more wins. On the other hand, his ERA was incrementally higher than our forecast. His actual strikeout rate (8.87 K/9) was very close to PECOTA’s expectations (9.24 K/9), while he walked a few more hitters than anticipated. Overall, there is nothing particularly unusual about Clemens’s performance over this four-year window—pitchers of Clemens’s caliber quite often do remain successful late into their thirties.

Fargo Posted: February 14, 2008 at 06:35 PM | 40 comment(s) Login to Bookmark
  Tags: general

Reader Comments and Retorts

Go to end of page

Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.

   1. marko Posted: February 14, 2008 at 09:38 PM (#2691321)
Of-course, nate is assuming that clemens was 1.) Clean in 97, 2.) was on steroids in 1999.

Since the years mcnamee alleged clemens juiced are the only years everyone takes as gospel, why is nate ignoring the fact that in his 1999 season roger clemens was supposedly clean? That perhaps his drastic drop in performance in 99 was a result of not having mcnamee and winstrol around? I wonder what PECOTA would have predicted after 1999, a season where presumbly clemens wasn't juicing?

This analysis was skewed to favor clemens, what a shocker!
   2. As foretold by the prophesy (JFSE) Posted: February 14, 2008 at 10:12 PM (#2691368)
The only assumption is that he was clean in 1997. The article is a what if scenario, assuming he was clean the rest of his career, what would be the most probable scenario.

The hypothesis is that he is used steroids or something sometime after that, not the assumption. The hypothesis could be confirmed if his stats were significantly higher than what his projections would show over a non-trivial sample.

Now, his conclusion wasn't to prove or disprove something was fishy, he said so himself. The point is to show that the logical conclusion to a Hall of Fame career is pretty much a Hall of Fame decline.
   3. marko Posted: February 14, 2008 at 10:49 PM (#2691413)
"Now, his conclusion wasn't to prove or disprove something was fishy, he said so himself. The point is to show that the logical conclusion to a Hall of Fame career is pretty much a Hall of Fame decline."

A Hall of fame decline is something Hank Aaron had, which was aided by his homepark.

What happened with clemens starting in Toronto? Who the hell knows, but it doesn't seem "natural".

And, FTR, Assuming Clemens was taking steroids in 1999 is an assumption. We have no evidence that this is true, as we don't have evidence that he juiced in 97 (though I suspect he juiced both years).
   4. Mirabelli Dictu (Chris McClinch) Posted: February 14, 2008 at 10:59 PM (#2691429)
Well, that's a false assumption.


I take it you have tangible evidence that he was juicing in 1997? I mean, it's certainly a questionable assumption, but who the hell are you to say categorically that it's a false one?
   5. Harveys Wallbangers Posted: February 14, 2008 at 11:17 PM (#2691440)
Kevin:

There is nothing objectionable to presume a "clean state" as a default without info otherwise.

Even though the person in question is objectionable.

I am less bothered by Clemens current discomforts than Bonds. BB never really pretended to be anything other than a guy who played baseball.

Anyway, that's my ha'penny on the issue.
   6. David Nieporent (now, with children) Posted: February 14, 2008 at 11:17 PM (#2691441)
Because there's no basis to assume he was clean in 1997.
Non-jackasses don't need a "basis" to assume someone is innocent. Everyone is, unless there's evidence of guilt.
   7. As foretold by the prophesy (JFSE) Posted: February 14, 2008 at 11:22 PM (#2691446)
#7, Nor is there basis to say he was already tainted. They could have tried projecting him from his age 25 year, but then, they wouldn't have a way to expect a HoF career from him.

However, in these exercises, everything after the cutoff point is split into a separate timeline. The point is to see if the actual events follow the logical conclusion if nothing changed. The PECOTAs are quite good at figuring out what kind of player someone is, much better than the Clemens Report and the "Anti-Clemens Report" did, to try to give as scientific an answer as projecting stats can give.
   8. The Polish Sausage Racer Posted: February 14, 2008 at 11:41 PM (#2691472)
I'd be surprised if Clemens was clean in 1988, frankly.
   9. Srul Itza Posted: February 14, 2008 at 11:43 PM (#2691474)
Come on, people.

Clemens is a Texas school boy.

That means he has been roiding since junior high.

Well, since he did not get to Texas until he was 15, then high school.

What more do you need to know?
   10. As foretold by the prophesy (JFSE) Posted: February 14, 2008 at 11:52 PM (#2691485)
Oh right, people automatically believe someone's guilty when the media says so. No one believes to wait for the answer, they just take the first plausible answer they hear, regardless of accuracy.

I keep forgetting.
   11. Srul Itza Posted: February 14, 2008 at 11:59 PM (#2691492)
Oh right, people automatically believe someone's guilty when the media says so.

Where Clemens is concerned, who needs to wait for the media. I still don't understand why he isn't doing hard time for attempting to murder Piazza by shoving that bat through his torso.
   12. baudib Posted: February 15, 2008 at 12:40 AM (#2691523)
This isn't really surprising at all.

I'm pretty sure that Clemens is lying. However, I don't think he got nearly the benefit from PEDs as Bonds or McGwire did.
   13. Zach Posted: February 15, 2008 at 01:46 AM (#2691553)
As per BBref, there's a ridiculous jump in quality between Clemens' comparables by age after age 33 (career through 1996) and his age-comparables after age 34.

Comps through age 33 (6 retired)

W-L % Innings ERA+
185-113 .620 2693.1 121

Comps through age 34 (7 retired)

W-L % Innings ERA+
208-133 .610 3108 120

Comps through age 35 (7 retired)

W-L % Innings ERA+
227-143 .613 3405.4 122

After 1997, the average career IP of his comparables increased by 15%, with no decrease in career ERA+. Thus, I don't see 1997 as being a good year to run the projections for. Given the inherent time uncertainty of when (or if!) Clemens started using, the same procedure would give different answers after 1996, 1997, and 1998.
   14. marko Posted: February 15, 2008 at 01:57 AM (#2691561)
"I'm pretty sure that Clemens is lying. However, I don't think he got nearly the benefit from PEDs as Bonds or McGwire did."

Except that he did.
   15. Fargo Posted: February 15, 2008 at 01:58 AM (#2691562)
All that #16 proves is that Clemens outlasted most in his cohort. As they "die out" and he "continues on" he's going to be in an increasingly rare set of survivors in his initial cohort -- whether he juiced or not. As the PECOTA analysis points out, persons of his early caliber who "survive" to mid-late career in good shape are rare but Clemens is not unique among that elite set.
   16. Zach Posted: February 15, 2008 at 02:01 AM (#2691563)
D'oh! Here's the career comparison I meant to make. (Statistics are now career totals for retired comparables)

After age 33:
W-L % IP ERA+
235-157 .599 3527.7 115

After age 34:
W-L % IP ERA+
259-180 .590 3978.6 116

After age 35:
W-L % IP ERA+
267-181 .597 4100.9 118

So after age 35, the quick & dirty career projection for career length increases by roughly 15% with no decrease in quality. I'd expect his Pecota comparables improved comparably.
   17. Zach Posted: February 15, 2008 at 02:05 AM (#2691565)
But when you're comparing to a baseline established by a comparable cohort, it matters dramatically what that cohort is. If Silver had compared to PECOTA projections before Clemens went to Toronto, he would get a very different answer than he gives in this article. Since the time of alleged juicing is unknown, you should be very leery when the same method gives different results for different starting points.
   18. Darren Posted: February 15, 2008 at 02:14 AM (#2691570)
I don't get the objections to this study. The accusation is by McNamee is that he began giving Clemens PEDs in 98. Nate's study shows that there is no statistical evidence that supports this claim.

If you think Clemens was juicing his whole career, then this is irrelevant, of course. But that's not what Nate is talking about.
   19. Fargo Posted: February 15, 2008 at 02:17 AM (#2691571)
One thing you (#20) should keep in mind is that Silver's "comparables" are not identified in the same way as B-Ref (and Bill James) chooses them. I haven't had time to think the implications through for your analysis, but keep in mind while B-Bef finds comparables by looking at the total career up to a given age, PECOTA finds comparables based on their performance only in the last three years, and further adjusts for a host of other factors (height, weight, speed, handedness, etc.). What this may mean is that your method is more susceptible to the attrition of a cohort over the career than Silver's method -- which also adjusts for body type.

Added edit: I also concur with Darren's point (#21).
   20. marko Posted: February 15, 2008 at 02:25 AM (#2691577)
"I don't get the objections to this study. The accusation is by McNamee is that he began giving Clemens PEDs in 98. Nate's study shows that there is no statistical evidence that supports this claim."

The accusations don't mention 1999, when clemens performance drastically dropped. Why doesnt nate take that into account?

FYI, Bonds tested positive for steroids. There is a baseball God.
   21. As foretold by the prophesy (JFSE) Posted: February 15, 2008 at 02:35 AM (#2691590)
The problem is, it's pretty much established that numerical analysis won't tell any useful information that will be statistically significant. We've seen two 'official' reports that contradict each other, and two unofficial reports here as well. Nate Silver's are probably as accurate as they're going to get.

The issue is that people are using PEDs as a witchhunt to chastise players who are 'too good', and this is just a means to it. People overperform and underperform all the time, and humans are quite good at looking for patterns where there are none, looking for a reason for each streak and slump.

So, whenever someone plays too good, is generally disliked for something, and tends to be paid a lot of money to act his own way, he attracts a lot of negative attention. So, people have all the reason in the world to accuse someone like Alex Rodriguez. Oh wait, they already have.

Regardless, performance enhancing drugs are such a vague topic that it's laughable. There's no issues for people having performance enhancing surgeries, such as Tommy John surgery - some are even having it done preemptively. People take cortizone shots, which slowly paralyzes the body in large quantities eventually, doesn't anyone think of the children about that?

And, if they suddenly decided to ban and test coffee, and other caffeinated products at workplaces, then, management suddenly decides to retroactively apply this policy and penalize people for having abused such a drug in the past, a lot of people would suddenly be up the creak without a paddle.
   22. David Nieporent (now, with children) Posted: February 15, 2008 at 02:39 AM (#2691596)
FYI, Bonds tested positive for steroids. There is a baseball God.
You do realize that this is the same claim made in the indictment, not some actual steroid test Bonds was given, right?
   23. Kiko Sakata Posted: February 15, 2008 at 03:18 AM (#2691618)
I don't get the objections to this study. The accusation is by McNamee is that he began giving Clemens PEDs in 98. Nate's study shows that there is no statistical evidence that supports this claim.


The study itself is fine as far as what it is - given Roger Clemens's career through 1997, his career through 2007 is not particularly surprising. I think the objections are to what this means. In order to be able to make any claims about "statistical evidence that supports [McNamee's] claim" you would need to know (1) the specific years when Roger Clemens used PEDs, (2) the specific years when Roger Clemens did not use PEDs, and (3) the statistical improvement one would expect for a pitcher who used PEDs. We don't actually know any of these things, so this study ends up not telling us anything at all.
   24. Zach Posted: February 15, 2008 at 03:31 AM (#2691630)
I'm not at all sold on PECOTA as a projection system, to begin with. But any projection system that works by identifying comparable players is implicitly assuming that your target player will age in similar ways. Think of driftwood in a river. You want to project the point at which a particular stick will hit the shore by keeping careful track of where previous sticks that passed by the same place have come to shore in the past. But in order for that forecasting method to work, the population of comparable sticks must be constant as time goes by -- if the population of comparable previous sticks is changing with time, it means that your stick is caught in an eddy or the river's current has changed since you started keeping records. Since BPro has kindly decided to keep its entire hideous algorithm proprietary and hidden, I can only speculate about how it works in practice.
   25. Kiko Sakata Posted: February 15, 2008 at 03:49 AM (#2691648)
But any projection system that works by identifying comparable players is implicitly assuming that your target player will age in similar ways.


This actually raises another potential issue with using PECOTA to draw conclusions about whether a player used PEDs. If the comparable players in Clemens's projection used PEDs, then the fact that Clemens late career was similar to these guys could actually be evidence that he <u>did</u> use PEDs.
   26. As foretold by the prophesy (JFSE) Posted: February 15, 2008 at 04:38 AM (#2691701)
The point is, like the stock market, a player's results are essentially random and cannot be predicted in any way, shape or form. Statistically, if you accumulate enough of them, they will start to show useful data. Of course, you can never see the full population, and the population changes constantly.

This is why there is a technical maximum on the effectiveness of projection engines, because the number we're trying to project, and the information we can base it on is a sample, not the population. The effectiveness of projection engines fall under this maximum, but try to tend to it over time, as it's improved.

Granted, PECOTA isn't the perfect way to project. ZiPS isn't either, and it's just as proprietary. The only projection system that is truly open source is Marcels. There is no perfect projection engine, since we work with imperfect data. I doubt this is the issue. It's just the most sophisticated statistical study that has been performed on this data to date.

Some things just can't be statted, because we don't know them. We can't tell how exactly how good a fielder Ruth or Gehrig was because the only data is G, PO, A, E and DP. We can't tell exactly for some players when all data is available to us. We don't know who was affected, when they were affected, and how they were affected. We can at best use the information in front of us, and it's not sufficient, so we use what we have, and hope it fits.
   27. Zach Posted: February 15, 2008 at 05:08 AM (#2691737)
If a system includes as much information as PECOTA claims but sees such a small improvement over Marcel, to my eyes the next step would be hard core triage to get all the information that wasn't improving the prediction out of the system. Every extraneous piece of information is a new opportunity for the system to screw up. It could be that Prospectus has done this triage and I haven't read about it, but the abundance of factors in PECOTA's system makes me like it less, not more.
   28. Dan Szymborski Posted: February 15, 2008 at 05:13 AM (#2691742)
But in order for that forecasting method to work, the population of comparable sticks must be constant as time goes by -- if the population of comparable previous sticks is changing with time, it means that your stick is caught in an eddy or the river's current has changed since you started keeping records.

The problem is that this line of reasoning invalidates neural networks, both the artificial kind and the kind that are in our brains and everything from weather forecasting to econometrics. Of course the model is going to change, the only question is how the model is going to change.

When you go to the grocery store, you're projecting what you want to eat over the course of the next week or two. The first time you go to a grocery store and shop for yourself, you might not do a good job of it, but after years, your mind has adjusted its model of proper grocery store shopping with knowledge of money needed, amount of food you can eat, practical limitations due to time and dietary needs.

It's no different here. The ability to infer a model from observations is and will be increasingly important in computers over the coming decades.
   29. Dan Szymborski Posted: February 15, 2008 at 05:19 AM (#2691749)
to my eyes the next step would be hard core triage to get all the information that wasn't improving the prediction out of the system. Every extraneous piece of information is a new opportunity for the system to screw up. It could be that Prospectus has done this triage and I haven't read about it,

That's the whole point of the Bayesian inference Nate's using.
   30. Zach Posted: February 15, 2008 at 05:33 AM (#2691756)
Dan, are you referring to the reasoning I'm talking about or the reasoning I'm exhibiting?

Mental models of how things work are of course famous for failing badly and unpredictably, and being overly constrained by limited prior experience. If you suddenly had to shop for a diabetic or a platoon of hungry Marines, your mental model of shopping would fail miserably.

Regarding comparable players, the belief that similar players age similarly is why that information is included in the model, is it not? Hence the weighting of more comparable players higher? If you have a projection system that assumes Player A will age similarly to the ensemble of comparable players, then if things are working properly that ensemble should remain comparable to Player A as time goes by.
   31. Dan Szymborski Posted: February 15, 2008 at 05:39 AM (#2691758)
Mental models of how things work are of course famous for failing badly and unpredictably, and being overly constrained by limited prior experience. If you suddenly had to shop for a diabetic or a platoon of hungry Marines, your mental model of shopping would fail miserably.

True, but computers are far better at weighing long-term observations than people are. Baseball games are quite constrained in possible outcomes, which is a huge bonus here.


Regarding comparable players, the belief that similar players age similarly is why that information is included in the model, is it not? Hence the weighting of more comparable players higher? If you have a projection system that assumes Player A will age similarly to the ensemble of comparable players, then if things are working properly that ensemble should remain comparable to Player A as time goes by.


But again, it's a learning model - it's set to learn from past mistakes in a systematic manner.
   32. Zach Posted: February 15, 2008 at 05:41 AM (#2691759)
That's the whole point of the Bayesian inference Nate's using.

Maybe this is as much a philosophical issue as a mathematical one, but why would you start with a universe of largely irrelevant data and then use Bayesian inference to carve down some of the weights rather than starting with a minimal model and adding data after it's known to be valuable?
   33. Zach Posted: February 15, 2008 at 05:48 AM (#2691761)
Still going by intuitive reasoning, rather than actual math here.

If you have a model which is set up to map a given (incomplete) set of variables to a given (incomplete) set of outcomes while minimizing total error, then at any given time won't you believe your model is working better than if you could consult an Oracle which could tell you the true error on the universe of outcomes?
   34. Fargo Posted: February 15, 2008 at 06:14 AM (#2691767)
Huh? All the projection/prediction models can be tested by comparing the predictions to empirical observations from real life. You don't need to consult an oracle.
   35. Zach Posted: February 15, 2008 at 06:55 AM (#2691771)
Not all possible player-seasons have been played yet.
   36. Fargo Posted: February 15, 2008 at 07:07 AM (#2691774)
Now that was truly oracular.

It should be added to the great declarations of the Oracle at Delphi.

"Nothing in excess."

"Know thyself."

"Not all player-seasons have been played yet."

"It ain't over 'til the fat lady sings."
   37. Zach Posted: February 15, 2008 at 07:25 AM (#2691780)
They will play a lot of games this season, you know. Those games aren't included in the models yet. Suppose some phenom in deepest darkest Africa is busy developing the super-gyroball that you can keep throwing until you're 55 -- that's a possible career path which wouldn't be included in the model. Heck, there haven't been any left-handed catchers in a hundred years. The data which has already been collected is a tiny fraction of all the data which could ever be collected.

You must be Registered and Logged In to post comments.

 

 

<< Back to main

Support BBTF

donate

Thanks to
Backlasher
for his generous support.

Bookmarks

You must be logged in to view your Bookmarks.

Hot Topics

NewsblogHP: Baseball is leaving the human factor behind
(7 - 8:09am, May 25)
Last: boteman

NewsblogBoston.com: Curt Schilling’s 38 Studios lays off all staff
(45 - 8:04am, May 25)
Last: Golfing Great Mitch Cumstein

NewsblogNeyer: New Yankee Stadium: A Review
(75 - 8:01am, May 25)
Last: Harveys Wallbangers

NewsblogFS Midwest: Streaker halts Cardinals-Phillies game
(2 - 7:57am, May 25)
Last: Justin 'The Cespedobear' T

NewsblogGreenberg: Cubs' Ricketts decries proposal
(750 - 7:54am, May 25)
Last: Jolly Old St. Neck Wound, Moral Idiot

NewsblogMatinale: WADJ: Wins Above Derek Jeter
(1 - 7:01am, May 25)
Last: bjhanke

NewsblogRoy Halladay bobblehead with glove on wrong hand selling on MLB.com
(12 - 6:46am, May 25)
Last: Doris from Rego Park

NewsblogSullivan: Dan Haren Makes Mariners Look Like Mariners
(1 - 6:40am, May 25)
Last: The cushions are crowded for Edmundo

Newsblog12 Baseball Feats That Only Happened Once
(25 - 6:25am, May 25)
Last: Greg (U)K

NewsblogShawn Green to play for Israel in World Baseball Classic
(12 - 5:50am, May 25)
Last: shoewizard

NewsblogOT: NBA Monthly Thread, May 2012
(1772 - 5:44am, May 25)
Last: baudib

NewsblogPrimer Dugout (and link of the day) 5-25-2012
(1 - 5:33am, May 25)
Last: Tim Stauffer, Trot Nixon's Coming (Dan Lee)

NewsblogWins Above Replacement: Distribution and Rarity of Talent 2011 - Beyond the Box Score
(9 - 4:18am, May 25)
Last: bobm

NewsblogDodgers want to host NHL's Winter Classic
(15 - 3:07am, May 25)
Last: Greg (U)K

NewsblogOT: NHL Playoff Thread
(1731 - 1:45am, May 25)
Last: baudib

Buy MLB playoff tickets, plus 2011 World Series, 2011 ALCS tickets and NLCS game tickets. We also have Texas Rangers playoff schedule, tickets to Red Sox games and Yankees game tickets. Plus, buy Phillies baseball tickets, Tigers playoff tickets and the biggies like ALDS baseball tickets and 2011 NLDS tickets.

Demarini, Easton and TPX Baseball Bats

 

 

 

AllianceTickets.com has cheap MLB Tickets. Get all your Colorado Rockies Tickets, Seattle Mariners Tickets, San Francisco Giants Tickets and all your favorite baseball tickets here. We also carry cheap Denver Broncos Tickets, Seattle Seahawks Tickets and Denver Nuggets Tickets.

Page rendered in 0.4007 seconds
55 querie(s) executed