User Comments, Suggestions, or Complaints | Privacy Policy | Terms of Service | Advertising
Buy MLB playoff tickets, plus 2011 World Series, 2011 ALCS tickets and NLCS game tickets. We also have Texas Rangers playoff schedule, tickets to Red Sox games and Yankees game tickets. Plus, buy Phillies baseball tickets, Tigers playoff tickets and the biggies like ALDS baseball tickets and 2011 NLDS tickets. |
Demarini, Easton and TPX Baseball Bats
|
AllianceTickets.com has cheap MLB Tickets. Get all your Colorado Rockies Tickets, Seattle Mariners Tickets, San Francisco Giants Tickets and all your favorite baseball tickets here. We also carry cheap Denver Broncos Tickets, Seattle Seahawks Tickets and Denver Nuggets Tickets. |
Page rendered in 0.4007 seconds
55 querie(s) executed

Reader Comments and Retorts
Go to end of page
Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.
1. marko Posted: February 14, 2008 at 09:38 PM (#2691321)Since the years mcnamee alleged clemens juiced are the only years everyone takes as gospel, why is nate ignoring the fact that in his 1999 season roger clemens was supposedly clean? That perhaps his drastic drop in performance in 99 was a result of not having mcnamee and winstrol around? I wonder what PECOTA would have predicted after 1999, a season where presumbly clemens wasn't juicing?
This analysis was skewed to favor clemens, what a shocker!
The hypothesis is that he is used steroids or something sometime after that, not the assumption. The hypothesis could be confirmed if his stats were significantly higher than what his projections would show over a non-trivial sample.
Now, his conclusion wasn't to prove or disprove something was fishy, he said so himself. The point is to show that the logical conclusion to a Hall of Fame career is pretty much a Hall of Fame decline.
A Hall of fame decline is something Hank Aaron had, which was aided by his homepark.
What happened with clemens starting in Toronto? Who the hell knows, but it doesn't seem "natural".
And, FTR, Assuming Clemens was taking steroids in 1999 is an assumption. We have no evidence that this is true, as we don't have evidence that he juiced in 97 (though I suspect he juiced both years).
I take it you have tangible evidence that he was juicing in 1997? I mean, it's certainly a questionable assumption, but who the hell are you to say categorically that it's a false one?
There is nothing objectionable to presume a "clean state" as a default without info otherwise.
Even though the person in question is objectionable.
I am less bothered by Clemens current discomforts than Bonds. BB never really pretended to be anything other than a guy who played baseball.
Anyway, that's my ha'penny on the issue.
However, in these exercises, everything after the cutoff point is split into a separate timeline. The point is to see if the actual events follow the logical conclusion if nothing changed. The PECOTAs are quite good at figuring out what kind of player someone is, much better than the Clemens Report and the "Anti-Clemens Report" did, to try to give as scientific an answer as projecting stats can give.
Clemens is a Texas school boy.
That means he has been roiding since junior high.
Well, since he did not get to Texas until he was 15, then high school.
What more do you need to know?
I keep forgetting.
Where Clemens is concerned, who needs to wait for the media. I still don't understand why he isn't doing hard time for attempting to murder Piazza by shoving that bat through his torso.
I'm pretty sure that Clemens is lying. However, I don't think he got nearly the benefit from PEDs as Bonds or McGwire did.
Comps through age 33 (6 retired)
W-L % Innings ERA+
185-113 .620 2693.1 121
Comps through age 34 (7 retired)
W-L % Innings ERA+
208-133 .610 3108 120
Comps through age 35 (7 retired)
W-L % Innings ERA+
227-143 .613 3405.4 122
After 1997, the average career IP of his comparables increased by 15%, with no decrease in career ERA+. Thus, I don't see 1997 as being a good year to run the projections for. Given the inherent time uncertainty of when (or if!) Clemens started using, the same procedure would give different answers after 1996, 1997, and 1998.
Except that he did.
After age 33:
W-L % IP ERA+
235-157 .599 3527.7 115
After age 34:
W-L % IP ERA+
259-180 .590 3978.6 116
After age 35:
W-L % IP ERA+
267-181 .597 4100.9 118
So after age 35, the quick & dirty career projection for career length increases by roughly 15% with no decrease in quality. I'd expect his Pecota comparables improved comparably.
If you think Clemens was juicing his whole career, then this is irrelevant, of course. But that's not what Nate is talking about.
Added edit: I also concur with Darren's point (#21).
The accusations don't mention 1999, when clemens performance drastically dropped. Why doesnt nate take that into account?
FYI, Bonds tested positive for steroids. There is a baseball God.
The issue is that people are using PEDs as a witchhunt to chastise players who are 'too good', and this is just a means to it. People overperform and underperform all the time, and humans are quite good at looking for patterns where there are none, looking for a reason for each streak and slump.
So, whenever someone plays too good, is generally disliked for something, and tends to be paid a lot of money to act his own way, he attracts a lot of negative attention. So, people have all the reason in the world to accuse someone like Alex Rodriguez. Oh wait, they already have.
Regardless, performance enhancing drugs are such a vague topic that it's laughable. There's no issues for people having performance enhancing surgeries, such as Tommy John surgery - some are even having it done preemptively. People take cortizone shots, which slowly paralyzes the body in large quantities eventually, doesn't anyone think of the children about that?
And, if they suddenly decided to ban and test coffee, and other caffeinated products at workplaces, then, management suddenly decides to retroactively apply this policy and penalize people for having abused such a drug in the past, a lot of people would suddenly be up the creak without a paddle.
The study itself is fine as far as what it is - given Roger Clemens's career through 1997, his career through 2007 is not particularly surprising. I think the objections are to what this means. In order to be able to make any claims about "statistical evidence that supports [McNamee's] claim" you would need to know (1) the specific years when Roger Clemens used PEDs, (2) the specific years when Roger Clemens did not use PEDs, and (3) the statistical improvement one would expect for a pitcher who used PEDs. We don't actually know any of these things, so this study ends up not telling us anything at all.
This actually raises another potential issue with using PECOTA to draw conclusions about whether a player used PEDs. If the comparable players in Clemens's projection used PEDs, then the fact that Clemens late career was similar to these guys could actually be evidence that he <u>did</u> use PEDs.
This is why there is a technical maximum on the effectiveness of projection engines, because the number we're trying to project, and the information we can base it on is a sample, not the population. The effectiveness of projection engines fall under this maximum, but try to tend to it over time, as it's improved.
Granted, PECOTA isn't the perfect way to project. ZiPS isn't either, and it's just as proprietary. The only projection system that is truly open source is Marcels. There is no perfect projection engine, since we work with imperfect data. I doubt this is the issue. It's just the most sophisticated statistical study that has been performed on this data to date.
Some things just can't be statted, because we don't know them. We can't tell how exactly how good a fielder Ruth or Gehrig was because the only data is G, PO, A, E and DP. We can't tell exactly for some players when all data is available to us. We don't know who was affected, when they were affected, and how they were affected. We can at best use the information in front of us, and it's not sufficient, so we use what we have, and hope it fits.
The problem is that this line of reasoning invalidates neural networks, both the artificial kind and the kind that are in our brains and everything from weather forecasting to econometrics. Of course the model is going to change, the only question is how the model is going to change.
When you go to the grocery store, you're projecting what you want to eat over the course of the next week or two. The first time you go to a grocery store and shop for yourself, you might not do a good job of it, but after years, your mind has adjusted its model of proper grocery store shopping with knowledge of money needed, amount of food you can eat, practical limitations due to time and dietary needs.
It's no different here. The ability to infer a model from observations is and will be increasingly important in computers over the coming decades.
That's the whole point of the Bayesian inference Nate's using.
Mental models of how things work are of course famous for failing badly and unpredictably, and being overly constrained by limited prior experience. If you suddenly had to shop for a diabetic or a platoon of hungry Marines, your mental model of shopping would fail miserably.
Regarding comparable players, the belief that similar players age similarly is why that information is included in the model, is it not? Hence the weighting of more comparable players higher? If you have a projection system that assumes Player A will age similarly to the ensemble of comparable players, then if things are working properly that ensemble should remain comparable to Player A as time goes by.
True, but computers are far better at weighing long-term observations than people are. Baseball games are quite constrained in possible outcomes, which is a huge bonus here.
Regarding comparable players, the belief that similar players age similarly is why that information is included in the model, is it not? Hence the weighting of more comparable players higher? If you have a projection system that assumes Player A will age similarly to the ensemble of comparable players, then if things are working properly that ensemble should remain comparable to Player A as time goes by.
But again, it's a learning model - it's set to learn from past mistakes in a systematic manner.
Maybe this is as much a philosophical issue as a mathematical one, but why would you start with a universe of largely irrelevant data and then use Bayesian inference to carve down some of the weights rather than starting with a minimal model and adding data after it's known to be valuable?
If you have a model which is set up to map a given (incomplete) set of variables to a given (incomplete) set of outcomes while minimizing total error, then at any given time won't you believe your model is working better than if you could consult an Oracle which could tell you the true error on the universe of outcomes?
It should be added to the great declarations of the Oracle at Delphi.
"Nothing in excess."
"Know thyself."
"Not all player-seasons have been played yet."
"It ain't over 'til the fat lady sings."
You must be Registered and Logged In to post comments.
<< Back to main