Baseball for the Thinking Fan

Login | Register | Feedback

btf_logo
You are here > Home > Baseball Newsstand > Discussion
Baseball Primer Newsblog
— The Best News Links from the Baseball Newsstand

Friday, July 22, 2011

Wyers (BP): Lost in the SIERA Madre

whenever we use a skill-based metric like xFIP or SIERA….We are using a proxy for regression to the mean that doesn’t explicitly account for the amount of playing time a pitcher has had. We are, in essence, trusting in the formula to do the right amount of regression for us. And like using fly balls to predict home runs, the regression to the mean we see is a side effect, not anything intentional.

Paywall, of course, but interesting nonetheless.  Any stat-savvy people want to comment?

Drew (Primakov, Gungho Iguanas) Posted: July 22, 2011 at 10:57 PM | 31 comment(s) Login to Bookmark
  Tags: general

Reader Comments and Retorts

Go to end of page

Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.

   1. Miserable, Non-Binary Candy is all we deserve CoB Posted: July 23, 2011 at 03:10 AM (#3883706)
Paywall?

Looks publicly viewable to me.
   2. Drew (Primakov, Gungho Iguanas) Posted: July 23, 2011 at 03:13 AM (#3883708)
I always assume everything BP is behind the paywall. Glad this one isn't.
   3. Forsch 10 From Navarone (Dayn) Posted: July 23, 2011 at 04:25 AM (#3883726)
And yet they were using SIERA on their site just, what, a couple of months ago?
   4. Walt Davis Posted: July 23, 2011 at 04:34 AM (#3883729)
Not a big deal but his description of the effects of multicollinearity is wrong. Multicollinearity violates no assumptions of OLS -- in fact, controlling for multicollinearity is the main purpose of OLS -- no matter how high it gets. [EDIT: OK, if the collinearity is perfect, you've got a problem.]

What multicollinearity affects is the power of your regression to detect effects. The interpretation of a regression coefficient (for say X1) is the impact of a one-unit change in X1 controlling for all the other Xs. If X1 and X2 are highly collinear, this independent effect of X1 (and the independent effect of X2) are harder to detect. All that really means is you need more sample size or, if you don't have that, it means that you can't tell which of those two variables is important.

However, in the case of a regression with quadratic and interaction terms, you simply don't care about collinearity because it is impossible to interpret the effect of X1 controlling for X1-squared.

Yhat = b0 + b1*X1 + b2*X1^2
dYhat/dX1 = b1 + 2b2*X1

So the effect varies with the value of X1 -- i.e. the relationship is curvilinear. That means that you are estimating a parabolic relationship between the Y and X1 and, at certain points on that curve, the impact of X1 will be positive, will go to zero, then will go negative (or vice versa). Wyers refers to such sign-switching as a "mistake" but it's not -- the quadratic model is assuming a different functional form from the linear.

From a strict testing perspective, all you care about is whether the coefficient b2 is statistically significant. If it is, then the quadratic provides a better fit than the linear. In short, the linear model is the "mistake" if b2 is significant.

Now Colin appears to be right _in the case of SIERA_ that the extra complexity provides such a minimal improvement to the overall fit that you might as well stick with the simpler linear model.

In essence, he at one point refers to the "fundamental" relationship between two variables but assumes (or strongly implies) that the fundamental relationship between variables is always linear. That of course isn't true. In the extreme, for the function y=k + x^2, there is no linear relationship (no correlation) between y and x whatsoever but obviously you can perfectly predict y if you know x.

Back to the equation, something similar occurs with interactions:

yhat = b0 + b1*X1 + b2*X2 + b3*X1*X2
dyhat/dX1 = b1 + b3*X2
dyhat/dX2 = b2 + b3*X1

So it's impossible and pointless to think about interpreting the impact of X1 independent of X2 (and vice versa) because you explicitly modelled it so you can't.

My main point here being: Don't throw #### into a regression equation unless you know how to interpret the results.
   5. Christopher Linden Posted: July 23, 2011 at 04:37 AM (#3883730)
SIERA has been a big part of their 2011 annual and season coverage since the offseason. It was apparent a huge amount of work went into it, and was presented with no small degree of fanfare as the #1 ERA estimator/predictor. Now Colin publishes a document where he says SIERA works pretty well, but no so much well that it's worth the added complexity compared to FIP (which BP will now be using) and xFIP.

Some commenters on the BP thread are seeing a connection in the timing; Matt, SIERA's point man and apparent progenitor leaves for Fangraphs, and Colin knocks a swipe at SIERA, only his strongest statement boils down to "SIERA's good as 'em all, but it's not worth the upkeep."

BP is going back to FIP (not xFIP) as its primary ERA estimator.

Happy Base Ball
   6. Chris Needham Posted: July 23, 2011 at 04:41 AM (#3883731)
Maybe it's late. Maybe I'm cranky. But this kind of article is exactly what I hate about stathead writing. It's the kind of article that makes me wonder if the Bill Conlins of the world might be on to something. Needlessly long, random charts, incomprehensible stats and references... if you haven't been following the 'debate' about that particular stat, that article is completely incomprehensible. Hell, even if you have been, it is.
   7. Tricky Dick Posted: July 23, 2011 at 04:52 AM (#3883735)
Some commenters on the BP thread are seeing a connection in the timing; Matt, SIERA's point man and apparent progenitor leaves for Fangraphs, and Colin knocks a swipe at SIERA, only his strongest statement boils down to "SIERA's good as 'em all, but it's not worth the upkeep."


I also noticed that he criticized the "refinements" that Fangraphs is making to the original SIERA, saying that Fangraphs is "welcome to this demonstrably redundant measure." I don't have an interest in either side of this debate, but it is a rather tart response.
   8. Walt Davis Posted: July 23, 2011 at 05:14 AM (#3883741)
I think you're being cranky Chris. In his third paragraph he tells you everything those not in the debate need to know:

Some fights are worth fighting. The fight to replace batting average with better measures of offense was worth fighting. The fight to replace FIP with more complicated formulas that add little in the way of quality simply isn’t. FIP is easy to understand and it does the job it’s supposed to as well as anything else proposed. It isn’t perfect, but it does everything a measure like SIERA does without the extra baggage, so FIP is what you will see around here going forward.

If you haven't been following the debate (and I haven't) and aren't interested in the nuts and bolts (I sometimes am) there's no need to read further.

if you haven't been following the 'debate' about that particular stat, that article is completely incomprehensible.

But at some point, "specialist" literature has to assume you've been following the "debate" because it's aimed at a specialist audience. Not that sabermetrics is at a similar level but nobody expects articles in physics journals to rehash Einstein -- if you don't know Einstein, what are you doing reading the journal?

If a person doesn't know what FIP, xFIP, SIERA and whatever else are; isn't interested in how they differ; isn't interested in whether they produce different results or lead to different conclusions ... why is that person reading an article on the comparison of FIP, xFIP and SIERA?

I will agree it's needlessly long and other issues. In its way, it's overkill in the same way that SIERA seems to be overkill. I suppose that's partly the result of the monster we've created -- if we ##### at Conlin for not using OPS, some subset of us also ##### at BPro for not using "stat of the day". B-R, Bpro, fangraphs, etc. have to be able to defend the stats they choose to present.
   9. Christopher Linden Posted: July 23, 2011 at 05:17 AM (#3883742)
To me the true takeaway of the article is simply that even after adding batted-ball data and sifting through ground balls, fly balls, data assembled by MLBAM stringers, and who knows how many dead chickens, our ERA estimators don't seem to be a whole lot better than what they've been. That, more than anything, was Colin's point here, I think. In one test there was no discernible difference between any of the four "estimators" (including, AIR, raw ERA) when the error rate was rounded to the tenth. Either we are just at the beginning of learning how our proxies for a pitcher's talent and tendencies can be alchemized into projection gold, or we are at the end, and to paraphrase another former BPer, the numbers have told us all they have to tell us.

I feel more comfortable using FIP (or in certain cases with pitchers who might have big outlier HR/FBs, xFIP) for my Vorosian evaluations.

Regarding the timing, yeah, it's hard to not see the timing of Colin's article as a slam on the SIERA metric Matt will be bringing to Fangraphs.

Happy Base Ball
   10. Christopher Linden Posted: July 23, 2011 at 05:18 AM (#3883743)
EDIT: Double post.
   11. Chris Needham Posted: July 23, 2011 at 05:27 AM (#3883746)
I think you're being cranky Chris. In his third paragraph he tells you everything those not in the debate need to know:

Wouldn't be the first time I've been called cranky... I'll just point out that the third paragraph appears to be about 300 words into the article.

But at some point, "specialist" literature has to assume you've been following the "debate" because it's aimed at a specialist audience. Not that sabermetrics is at a similar level but nobody expects articles in physics journals to rehash Einstein -- if you don't know Einstein, what are you doing reading the journal?

Completely fair point. And one I thought about putting in in the prior post.

But at the same time, I don't see BP doing as much of the explanatory articles for the non-math, non-acronym folks. Fangraphs does it more frequently -- their stuff certainly can be much more approachable at times -- but I think that 75% (add your own error bar) of the writing out there is preaching to the converted -- or preaching to a sliver of the already converted.
   12. Rally Posted: July 23, 2011 at 05:36 AM (#3883747)
I pretty much agree with Colin's take here. Same thing I've been ready from Tango since SIERA came out. FIP is a simple formula that I can keep in my head and plug into a calculator even if I'm away from my main computer, and get a result that is 99% as good. If I want something more complex and accurate I'd use my projection system anyway.

Colin mentions my name in the article, and links to several minor league pitchers who share my name. I find it funny this case of mistaken identity happens all the time, yet nobody bothers to link to my football stats for the Dolphins.
   13. SoSHially Unacceptable Posted: July 23, 2011 at 07:17 AM (#3883760)
EDIT: Double post.


Double post or not, I still expect a Happy Base Ball.

FWIW, the only word I am certain I could identify and define in Walt's first post was "Colin." That's probably another way of saying I haven't been following the debate.
   14. Voros McCracken of Pinkus Posted: July 23, 2011 at 07:27 AM (#3883762)
A while back I did a base run versions of a DIPS ERA estimator that I thought had some use because it expanded the range of ERA's achieved and also because traditional FIP underrates guys who give up lots of homers but not many baserunners (and overrates the opposite). This would only be appropriate for starters though. Relief pitchers probably aren't worth the trouble.



http://vorosmccracken.com/?page_id=14
   15. Voros McCracken of Pinkus Posted: July 23, 2011 at 07:54 AM (#3883764)
As for multicollinearity, the best way for someone not versed in stats to understand it is as a sample size problem. When you have two independent variables that track very closely with one another in a regression, what can happen is that the coefficient for one of those variables could be determined by a very small number of samples even if the overall sample size is large. This is because only in the samples when the closely tracking variables actually differ does it count toward that coefficient.

So if you have height and weight as variables in a study of 100 and almost everybody in the sample is of a "normal" height/weight relationship and only one or two are much heavier or lighter, those one or two samples will determine almost all of the coefficient for either height or weight.
   16. bobm Posted: July 23, 2011 at 12:53 PM (#3883785)
FTFA:
Among those is SIERA, which has lately migrated from here to Fangraphs.com in a new form, one more complex but not necessarily more accurate. We have offered SIERA for roughly 18 months, but have had a difficult time convincing anyone, be they our readers, other practitioners of sabermetrics, or our own authors, that SIERA was a significant improvement on other ERA estimators.


I do not have a dog in this fight, but I found this piece interesting. It's very rare IMO in sabermetrics that someone asks, "What incremental value does this new formula have over the currently used formulas?". It's unfortunate that this bit of introspection--after much initial touting--was only prompted after a year-and-a-half by a defection to a competitor.
   17. SG Posted: July 23, 2011 at 02:44 PM (#3883810)
Some commenters on the BP thread are seeing a connection in the timing; Matt, SIERA's point man and apparent progenitor leaves for Fangraphs, and Colin knocks a swipe at SIERA, only his strongest statement boils down to "SIERA's good as 'em all, but it's not worth the upkeep."


It may be that a behind-the-scenes determination that SIERA wasn't worth the upkeep was the reason for Matt's departure in the first place. So it's not necessarily an after-the-fact swipe. As far as I can tell, Colin's not the intellectually dishonest type.
   18. Wins Above Paul Westerberg Posted: July 23, 2011 at 03:42 PM (#3883820)
Interestingly, this is what Dave Cameron said this week in regards to SIERA on his Fangraphs chat:

"I think SIERA is interesting. I also think the differences between SIERA and xFIP are really quite small. I'm glad we have both on the site, but I'll probably keep referring to xFIP most of the time. In cases where the difference is actually meaningful, it will be nice to be able to refer to SIERA and say "here's why xFIP might be missing something on this specific guy", but overall, they're in agreement on 99% of all pitchers."

So I guess BPro isn't alone in seeing a bit of redundancy here, even if they do come off a bit like a scorned lover.
   19. puck Posted: July 23, 2011 at 04:26 PM (#3883830)
It may be that a behind-the-scenes determination that SIERA wasn't worth the upkeep was the reason for Matt's departure in the first place. So it's not necessarily an after-the-fact swipe. As far as I can tell, Colin's not the intellectually dishonest type.


Not just that, but Colin's written about this before, and has posted plenty of comments about the topic on various sites. He very well might not have agreed with using SIERA on the site, or figured, while they have a guy there who wants to maintain it, fine, why pick that particular battle.
   20. Walt Davis Posted: July 23, 2011 at 07:58 PM (#3883917)
FWIW, the only word I am certain I could identify and define in Walt's first post was "Colin."

Not to worry, it was pure stat-geekery with no baseball content.

As far as I'm concerned, the prime example of the effects of collinearity in sabermetrics is the "BA doesn't matter" mistake. We could start by regressing (correlating) runs with BA and we'd find a strong, positive relationship. However, in the equation:

runs = b0 + b1*BA + b2*OBP + b3*SLG + e

the coefficient for BA is low and/or insignificant which would lead one to the conclusion that, once controlling for OBP and SLG, BA doesn't matter. The question is why. Part of the answer is that BA is the largest component of both OBP and SLG -- they are fairly highly collinear. So whatever impact BA has is being captured in OBP and SLG (but not vice versa). The rest of the answer is that the coefficients in that equation are not interpreted the way people think they are.

Second point first: What does it mean to vary OBP while holding BA and SLG constant? Basically it means you drew a walk, a clearly positive event. What does it mean to vary SLG while holding BA and OBP constant? Basically it means you added an extra base, a clearly positive event. What does it mean to vary BA while holding OBP and SLG constant? Try it sometime, it's not easy to do. But basically it means something like trading a HR and 2 walks for 2 doubles and a single (and even that probably doesn't work out right) which isn't clearly a positive thing -- but who the hell wants to know the impact of that?

So first point: We can rewrite the above equation as:

b1*BA + b2*OBP + b3*SLG = b1*BA + b2*(BA + ISO_OBP) + b3*(BA + ISO_SLG)
= (b1 + b2 + b3)*BA + b2*ISO_OBP + b3*ISO_SLG

So BA has a much larger coefficient than the other two. Also we now have more sensibly interpreted coefficients. A change in BA while holding ISO_OBP and ISO_SLG constant is a single; a change in ISO_OBP is still a walk; a change in ISO_SLG is still an extra base. The main problem of course being that those have different denominators which is just dumb so simplify as:

c1*(H/PA) + c2*(BB + HBP/PA) + c3*(XB/PA)

and you have a nice, clean, easy to interpret regression ... which will show you that H/PA has the largest coefficient. The various linear run estimators lead you to the same conclusion -- the value of a base is roughly constant regardless of how that base was obtained but every hit adds a substantial run-scoring bonus.
   21. Der-K's tired of these fruits from poisoned trees Posted: July 23, 2011 at 08:03 PM (#3883919)
Nicely put.
   22. base ball chick Posted: July 23, 2011 at 09:47 PM (#3883944)
b1*BA + b2*OBP + b3*SLG = b1*BA + b2*(BA + ISO_OBP) + b3*(BA + ISO_SLG)
= (b1 + b2 + b3)*BA + b2*ISO_OBP + b3*ISO_SLG



o kayyyyyyyyyyy

can someone explain to me exactly what SIERA is looking for - leaving out math words i don't understand what they mean like coefficience?

i understand that FIP imagines that fielders don't matter because there is so little statistically significant difference between the best fielders and the worst that if you looked at any pitcher and pretended that he had the worst 8 fielders in the ML or the 8 best fielders in the ML he would have the exact same FIP even if he didn't have the same ERA because he wouldn't change a thing about how he pitches.

ok

i know you use FIP in fantasy baseball to decide what pitchers to pick, but is the change from FIP to SIERA enough to statistically significantly increase your winning percentage?
   23. Nivra Posted: July 23, 2011 at 11:18 PM (#3883971)
Posting here since BP doesn't allow posting by non-subscribers.

I can't believe neither this site nor BP comments mentioned what I think was a crucial takeaway from Colin's article:

"In a real sense, that’s what we do whenever we use a skill-based metric like xFIP or SIERA. We are using a proxy for regression to the mean that doesn’t explicitly account for the amount of playing time a pitcher has had. We are, in essence, trusting in the formula to do the right amount of regression for us. And like using fly balls to predict home runs, the regression to the mean we see is a side effect, not anything intentional."


That's an incredibly incisive indictment of skill-based metrics like FIP, xFIP and SIERA. It calls into question some of the basic tenets of why we are constructing skill-based metrics to begin with. It also opens up a possible avenue into a completely different set of statistics that properly regress its components.

"When you create an expected home run rate based on batted ball data, what you get is something that's well correlated with HR/CON but has a smaller standard deviation, so in tests where the standard deviation affects the results, like root mean square error, it produces a better-looking result, without adding any predictive value."


This also opens up possibilities for better methods for analyzing model fit and predictive accuracy.

I think that Colin made an error by introducing these two main points in an article surrounding the SIERA vs. FIP, which then became a discussion regarding formula complexity. These two points are independent and distinct, and I'd love to see more discussion about the points made above, rather than the complexity issue.
   24. Jittery McFrog Posted: July 23, 2011 at 11:29 PM (#3883978)
#22

can someone explain to me exactly what SIERA is looking for - leaving out math words i don't understand what they mean like coefficience?


There are much people around here who are waaaaay more qualified than me to answer, but since no one has chimed in yet:

FIP looks at HR, BB, and K rates and treats these as the basic skills. It doesn't look at how the effects of these skills interact.

So if a pitcher starts walking an extra guy per inning, his FIP goes up by 3, regardless of what his HR or K rates are.

SIERA tries to take into account how the skills interact. For example, if a HR-prone pitcher starts walking extra batters, his SIERA goes up more than if he were a low-HR guy.

I think that's the basic idea. The article introducing SIERA is here.
   25. Jittery McFrog Posted: July 24, 2011 at 12:05 AM (#3884014)
That's an incredibly incisive indictment of skill-based metrics like FIP, xFIP and SIERA. [...]I think that Colin made an error by introducing these two main points in an article surrounding the SIERA vs. FIP, which then became a discussion regarding formula complexity.


I agree on both points, although I think there are a couple of (quite fixable) problems with his critique.

One was here:

So what happens if we give all the ERA estimators (and ERA) the same standard deviation? We can do this simply, using z-scores. I gave each metric the same SD as xFIP [...]

ERA FIP xFIP SIERA
RMSE 1.94 1.87 1.86 1.84

In essence, what we’ve done here is very crudely regress the other stats (except SIERA, which we actually expanded slightly) and re-center them on the league average ERA of that season. Now SIERA slightly beats xFIP, but in a practical sense both of them are in a dead heat with FIP, and ERA isn’t that far behind. As I said, this is an extremely crude way of regressing to the mean. If we wanted to do this for a real analysis, we’d want to look at how much playing time a player has had. In this example we regress a guy with 1 IP as much as a guy with 200 IP, which is sheer lunacy.


It seems to me that the most important claim of the article hasn't been tested properly. If you think the claims for SIERA "are as imaginary as they can get", then why not see if they still disappear when a non-crude regression is applied to each of ERA, xFIP, FIP and SIERA? For example, why not, for each predictor, regress to the SD that provides the lowest RMSE? If I want to know whether SIERA is measuring something that other predictors aren't, I want to see which is closest when each is at its best, not which is closest after each has been run through a crude filter. Now maybe he has done something like this; if so, I haven't found it yet, which may very well be my fault.

Also, one of the claims that the creators of SIERA made is that

Looking through all of these different tests, it is apparent not only that SIERA is the best ERA estimator currently available, but specifically that it is exceptionally strong at measuring the skill level of specialized kinds of pitchers.


If SIERA claims to measure the skill of certain pitchers particularly well, then one should probably test those separately before "moving on".
   26. tshipman Posted: July 24, 2011 at 12:11 AM (#3884023)
That's an incredibly incisive indictment of skill-based metrics like FIP, xFIP and SIERA. It calls into question some of the basic tenets of why we are constructing skill-based metrics to begin with. It also opens up a possible avenue into a completely different set of statistics that properly regress its components.


Isn't the purpose of regressing home runs to fly balls to find a proper regression? I don't understand what you mean here.
   27. Nivra Posted: July 24, 2011 at 04:41 AM (#3884167)
Jittery: I agree with that, as well, with a small caveat. IMO, the crude regression shows that the difference between xFIP and FIP and SIERA are negligible, already. 1.84 vs. 1.86 and 1.87? That's the tiniest of incremental gains, if it even is significant, which I doubt.

It definitely does call for more research regarding proper regression of analytic metrics like ERA, FIP, xFIP and SIERA. I think a viable finish to the article would be a call for further research in this area inasmuch as this is the first time I've seen this particular critique, and it's an extremely important matter for these metrics.

#26/tshipman: The argument xFIP makes is that HR/FB should be 100% regressed to league avg, essentially assuming that pitchers have very little ability to influence HR/FB. The fact that xFIP appears to predict ERA better than FIP supports this assumption. Colin is making the argument that the assumption is incorrect. xFIP does better than FIP simply because FB-rate has lower variance than HR/PA. Thus, by using lgAvg HR/FB * pitcher FB/PA, you reduce the variance in your metric. This makes the predictor better because you are "regressing to the mean."

The question with regression is always, "how much do you regress?" xFIP randomly chooses a regression amount based on the variance difference from HR/PA to FB/PA. This is most likely not the optimal amount to regress, but it's better than doing no regression (FIP). That's why xFIP does better than FIP. It's not because HR/FB is outside of pitcher ability, in fact, Colin gives data that clearly shows pitchers do control HR/CON has additional value over and above FB/CON.

He basically calls into question how effective all the newer estimators are outside of their ability to regress to the mean. Is FIP better because pitchers don't influence BABIP or because FIP regresses more than ERA? Is xFIP better than FIP because pitchers don't influence HR/FB or because xFIP regresses more than FIP? He gives evidence that the bulk of the improvement we see for xFIP and SIERA is due to regression, and also that the difference between FIP/xFIP/SIERA and ERA is much smaller than previously thought.

What he doesn't do is a more rigorous analysis (like Jittery is calling for above) of how much of the improvement is due to regression vs. actually reducing noise due to abilities outside of the pitchers' control. This is definitely the next step in this line of analysis. The step after that would be to rethink the way we calculate these metrics and how the different components regress independently, and come up with a new metric that does this properly.
   28. Austin Posted: July 24, 2011 at 04:57 AM (#3884171)
Colin always seems particularly incendiary when he writes an article, and this one certainly comes off that way.

For the record, even having a statistic like FRA to measure the value already contributed by the pitcher, I don't think it's a problem to have "mini projection systems" (as Colin insinuates that they are) like SIERA to fill the role of predicting future performance. Projection systems can't generally be run on a daily basis, so component-based statistics that purport to measure underlying skill can still be useful even if Colin's criticisms are taken to be accurate.
   29. tshipman Posted: July 24, 2011 at 05:32 AM (#3884177)
#26/tshipman: The argument xFIP makes is that HR/FB should be 100% regressed to league avg, essentially assuming that pitchers have very little ability to influence HR/FB. The fact that xFIP appears to predict ERA better than FIP supports this assumption. Colin is making the argument that the assumption is incorrect. xFIP does better than FIP simply because FB-rate has lower variance than HR/PA. Thus, by using lgAvg HR/FB * pitcher FB/PA, you reduce the variance in your metric. This makes the predictor better because you are "regressing to the mean."

The question with regression is always, "how much do you regress?" xFIP randomly chooses a regression amount based on the variance difference from HR/PA to FB/PA. This is most likely not the optimal amount to regress, but it's better than doing no regression (FIP). That's why xFIP does better than FIP. It's not because HR/FB is outside of pitcher ability, in fact, Colin gives data that clearly shows pitchers do control HR/CON has additional value over and above FB/CON.


Okay. I see what you mean here now. You're right that it's interesting, but I don't know how much more information even a perfectly regressed statistic provides.

Thank you for explaining it.
   30. Nivra Posted: July 24, 2011 at 07:26 AM (#3884182)
Discussion over on the Book about this: http://www.insidethebook.com/ee/index.php/site/comments/siera_updated/
   31. studes Posted: July 24, 2011 at 11:37 AM (#3884195)
I don't see anything new in this regression discussion. Things like FIP and xFIP have always been acknowledged to be "lazy" regression. But that's because they were both intended to be more descriptive than predictive--there were intended to isolate certain parts of a pitcher's performance and come closer to some sort of "truth."

Certainly, that glimpse of the "truth" can be improved upon by using better regression techniques. In my mind, that's what projection systems should do. However, I don't feel qualified to state whether SIERA, or any other stat, is an improvement or not. (and reading Colin's article, along with Matt's reply, won't help. Not enough brain cells in this old head of mine.)

You must be Registered and Logged In to post comments.

 

 

<< Back to main

BBTF Partner

Dynasty League Baseball

Support BBTF

donate

Thanks to
Andere Richtingen
for his generous support.

You must be logged in to view your Bookmarks.

Hot Topics

NewsblogOT Soccer Thread - Champions League Knockout Stages Begin
(293 - 7:34am, Mar 24)
Last: The_Ex

NewsblogRhys Hoskins suffers torn ACL in Phillies' spring game | ESPN
(1 - 6:25am, Mar 24)
Last: Jeff Francoeur's OPS

NewsblogOhtani fans Trout to seal Japan's 3rd Classic championship
(1 - 5:18am, Mar 24)
Last: kirstie819

NewsblogBaseball’s Most Valuable Teams 2023: Price Tags Are Up 12% Despite Regional TV Woes
(5 - 3:02am, Mar 24)
Last: kirstie819

NewsblogOT - 2023 March Madness thread
(64 - 11:41pm, Mar 23)
Last: My name is Votto, and I love to get Moppo

Newsblog2023 NBA Regular Season Thread
(1278 - 9:31pm, Mar 23)
Last: spivey

NewsblogMLB Pipeline: Ranking all 30 farm systems
(10 - 9:31pm, Mar 23)
Last: Walt Davis

NewsblogOT: Wrestling Thread November 2014
(2668 - 8:39pm, Mar 23)
Last: /muteself 57i66135

Sox TherapyYoshida In The Spotlight
(32 - 7:54pm, Mar 23)
Last: jacksone (AKA It's OK...)

NewsblogReggie Jackson: Former commissioner Bud Selig blocked me from buying A's
(20 - 7:20pm, Mar 23)
Last: the Hugh Jorgan returns

NewsblogOhtani fans Trout to seal Japan's 3rd Classic championship
(25 - 6:04pm, Mar 23)
Last: Hombre Brotani

NewsblogThe Shift is dead in MLB. But is the ‘two-man outfield’ next? And will it work? [$]
(5 - 3:42pm, Mar 23)
Last: Walt Davis

NewsblogSpring training OMNICHATTER 2023
(149 - 2:06pm, Mar 23)
Last: Snowboy

NewsblogBraves option Grissom to minors, clearing Arcia to start at SS
(12 - 1:34pm, Mar 23)
Last: Howie Menckel

NewsblogMLB making small changes to pitch clock rules, memo says
(14 - 10:41am, Mar 23)
Last: The Yankee Clapper

Page rendered in 0.3371 seconds
45 querie(s) executed