Baseball Primer Newsblog— The Best News Links from the Baseball Newsstand
Tuesday, May 16, 2006
Brian O’Neill discovers the joy of using WPA from FanGraphs...unfortunately, he has to use them on the Pirates.
The bottom four Pirates have been the out machines. They are Chris Duffy (-88), Jose Castillo (-98), Joe Randa (-100) and Burnitz (-181). Burnitz’s WPA is the lowest in baseball. Ditto for the Pirates batters’ cumulative number of -603. Neither ranking should surprise anyone.
The only WPA that shocked me was a negative 60 for Freddy Sanchez. He’s hitting .319 with a .356 on-base average and .511 slugging average and he has done a good job cashing in limited RBI opportunities. But a scan of his games shows he has done most of his best work in early innings or blowouts such as Sunday’s loss, when hits hardly move the odds needle. If Sanchez keeps hitting, his WPA should improve.
Repoz
Posted: May 16, 2006 at 05:13 AM | 101 comment(s)
Login to Bookmark
Tags:
pirates
|
Reader Comments and Retorts
Go to end of page
Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.
That says a lot about the value of WPA.
I don't understand the surpise here. If a team takes a lead by one run after one inning, there are eight innings left for the opposing team to score. If a team takes a lead by one run after seven innings, only two innings remain to tie (or overtake) the deficit. Somewhat intuitively, the probability of the leading team winning when they're up by one with two innings to go is greater than if they lead by one with eight left to play. This same correlation exists with player feats during the game. Thus, via WPA, a solo shot in the first usually wouldn't have nearly the same amount of value as a solo shot in the seventh (special cases like blowouts notwithstanding).
I think WPA is more of a fun little gadget than a hardcore sabermetric tool, but what's wrong with that?
Nothing. It's when people use it to justify other things. Like in the Rowand crashing into the fence thread.
It's bad sabermetrics.
But don't let that sway your opinion of the stat.
WPA chooses the regress to the mean everything that happens over the rest of the game. It measures in-the-moment change on the chance of winning. A homer in a tied 1st has a lesser impact on winning than a homer in a tied 7th becuase the other team has fewer opportunities to respond. Given that we've regressed all future events in the game, the score, inning, and base-out state of the game makes a major difference in calculating value.
It seems like you could regress score, inning, and base-out state as well, and get something like VORP or SLWTs. Or you could regress some mix of the score, inning and base-out state to consider certain specific aspects of clutch value. Or you could account for what a player did in the context of how the game progressed after that situation as well.
Of all the above, I find the WPA assumptions the most useful, though I'd love to see what difference it made to make different assumptions. WPA accounts for the entire situation at the moment, and assesses that value in ignorance of future events, but with (assumed) full knowledge of past events. That seems to me to best correspond to how a player experiences a baseball game.
I'm not a big WPA fan myself, but this is precisely backwards. WPA is exactly the right tool for analyzing the cost/benefit of something like risking injury on a defensive play. Clearly, a fielding attempt that makes sense in a tie game might not make sense if you're down by 7 with two innings to play. I'm not arguing for/against that particular play, just saying that the likelihood that a given play will cause you to win/lose a particular game -- what WPA tells you -- is relevant for that discussion.
If you had two pitchers with the same ERA and innings pitched where one was a starter and one was a closer, the closer would have a higher WPA because his innings were more valuable, which they obviously are.
It's not any different if a hitter is getting big hits late in a game, those hits are more valuable.
I don't think so. WPA only "knows" what has happened prior to a plate appearance; it doesn't know what came later. So it says a 2nd inning HR wasn't a big deal, even if it turns out to be the game winner; it gives credit to RBI guys for hitting at the "right" time (RISP), but can't credit a high-OBP guy for getting on base at the "right" time (prior to hits). So WPA really doesn't capture a player's "true" value. If you want value, you'd do as well or better by looking at good 'ole R and RBI.
Of course, you're right that WPA is an even worse tool for measuring talent.)
It's totally different. A closer is more valuable because he's used selectively in close games, not late innings. Hitting late in the game is no better than hitting early, unless you do it disproportionately in 1) close games and/or 2) RISP.
What I like WPA for is decision-making evaluations (ie Rowand, or bullpen utilization) and "the clutch." WPA tracks value as it is experienced in a ball game - the guy who hits the 2nd inning homer doesn't know at hte time that both teams' bats will go cold, but the guy hitting in the bottom of the 8th knows how big it would be to hit a blast - which I think is a pretty cool feature.
WPA gives big credit to hitters that do 1) and 2), which for a hitter is equivalent to performing in a 'closer' situation.
It doesn't make sense to acknowledge the value from a pitcher's standpoint, but disregard it from a hitter's standpoint.
It's silly.
Let's say a batter draws a leadoff walk in the ninth inning of a tied game. WPA will give equal value to each batter. If one batter is followed by Albert Pujols and the other by Neifi Perez, the first is obviously more likely to score. Pujols will get the credit when he gets the hit. The first batter doesn't get extra credit for batting in front of Pujols.
but Pujols is getting extra credit for batting behind the guy who drew the walk...
Of course it does. Teams can insert their closers in high leverage situations, but cannot do the same with hitters ("hey, let's have Pujols bat for Molina here -- it's 9th inning w/ RISP").
The first batter doesn't get extra credit for batting in front of Pujols.
Exactly. WPA gives extra credit for timely hitting with men on base, but not for getting on base in a "timely" way. A system should either extract a player from what his teammates do (like RC or BR), or take account of how a player's performance interacted with that of his teammates'. But WPA is a weird hybrid when applied to hitters.
A hitter gets extra credit for getting on base in a timely situation, but not for the fortune of being followed by good/timely hitters.
But the reason the best hitters are in the middle of the lineup is because it makes them more likely to come up in these situations, compared to batting 1st or 9th. And when they perform in these situations they would get credit.
Steroids.
I read the piece, and I don't think its very persuasive if you are trying to point out the limitations of WPA. I think WPA is a very underrated statistic by many analysts. I'm not quite sure about the implementation of it, but I think the idea is an excellent one
So the ninth-inning slugger receives four times as much WPA credit as the first-inning guy. Is this fair? Probably not.
He doesn't really explain in that piece why he feels WPA is not a good tool, other than it does not treat all hits equally, which I guess he believes would be more accurate.
Yes, Studes has become a regular Harry Hopkins for the WPA.
I was talking about Pujols, not Bonds. :)
I disagree that WPA is a "hybrid." I think one of WPA's greatest strengths is that it's very clear, methodologically and theoretically. It records the value of each event in a baseball game based on the situation in the game at the time the event occurred - value defined as improved or decreased chance of winning that specific game. It doesn't account for future events, which can be worth considering, but I think it's quite well-defined and internally consistent.
It's a hybrid in terms of whether the opportunities provided to a hitter by his teammates impacts the metric. Most sabermetric measures (RC, Lwts, BR) try to extract a player from that context (since we know players can't control it, and at least the vast majority have no ability to raise their performance "when it matters"). It's fine to put the player back in context to measure value, but WPA only does that partially. In yesterday's AZ-SD game, WPA says that CH Park "won" almost half a game (.44), while Chad Tracy "lost" about a third of a game (-.3) for his team. One problem: AZ won the game. So why does it matter greatly how many runs Tracy's teammates had scored and allowed -- and how many of his teammates were on base -- for each of his PAs, yet not matter at all that his teammates managed actually to win the game? I don't get it....
This measures baseball as it is lived by the players, managers and fans.
I agree that there are aspects of "value" that don't seem to be covered in WPA. It's not a perfect stat. But it can be summed up precisely in one sentence, and it's internally consistent. Your criticisms seem to be demanding WPA to be something it isn't - to take future events into account. That's a valid criticism of the stat, but not evidence of its lack of internal coherence.
This is false. They don't remove anything from anywhere - they assume average context. Since the average single is worth .5 runs, they give all singles that value. The problem with that, from a VALUE standpoint is that context is everything. Things don't even out if they did then you would see players with 3 win differences between their offensive + baserunning WARP and their WPA.
The cool thing about WPA is that it doesn't do anything in retrospect, but always in the moment in the game.
Well put.
You're kidding, right? Obviously, I mean extract them from their specific context.
Also, we've been through this before - people on the site love WPA, until it gives them a number they don't like. Google up last year's MVP threads, I was hammered pretty hard for saying Ortiz was offensivly more valueable than A-rod last year, though the defense, baserunning, and positional advantage made it a wash.
This criticism of WPA I don't understand. The only argument I can see--and it's not trivial--against using WPA to measure value is that it assumes an average opponent.
Where have you been? Many of us are curious about your fork project.
The argument you cite is also valid, of course, and IMO is pretty devastating. WPA factors in <u>game state</u> context -- that is, baserunners/outs/score -- but not any other context. (It's not just that it assumes an average opponent, but it also assumes average <u>teammates</u>. If I walk in front of the pitcher, I clearly haven't increased my team's chances of winning by as much as if I walk in front of ARod.) It's not clear to me that it's any less distorting to factor in one type of context but not another.
But the one described by Matt is also valid. Let's say I hit a solo home run in the first inning, and then Bruce Chen pitches 9 shutout innings (a fan can dream, can't he?), and we win 1-0. Now let's say that in his next start, Bruce Chen pitches 9 shutout innings, and I hit a solo home run in the ninth inning, and we win 1-0. WPA thinks that I did a lot more to help the team in the second game than in the first. But I did exactly the same thing in each game.
Wow. I must be a lousy writer! I do obviously feel that WPA is a good tool, for certain things. Do I feel it should be the "ultimate" measure of value? No, for many of the reasons outlined here. I agree with Matt's comments about it.
Guy, I understand the nature of your complaints, but I don't understand why you consider those "issues." When a batter draws a walk before a big hit, he doesn't know the big hit is coming. Why give him credit? WPA reflects what everyone knows at any point in time about a game, and the likely impact of each play given the current context. That's its virtue and its weakness. I do agree that the Rowand catch was a fine application of WPA thinking. As I said in that linked article, it's all about narrative, and Rowand's catch was part of the narrative.
It's true that, in its current state, it doesn't reflect the quality of the next batter, or the quality of the current pitcher, or the quality of the pitcher who might be brought in if the current batter gets on base, etc. etc. Of course, if you factor those in, you need to then "debit" the appropriate players to make the overall math work out.
I'll leave that spreadsheet to the next person. :)
Exactly. And that's a huge problem with WPA, or rather with its most common application, and how MHS uses it as his uber MVP stat.
It's basically a truism that in the same game, for the same team, all runs have approximately* equal value. If you win 4-3, each of the 4 runs is equally game-winning: take any one away, and the game is equally tied. So the way to calculate value after the fact is to replace the player with an average player, sim the season multiple times, and take the difference between actual team wins and expected team wins, and that's your value.
WPA takes the real-time expected win value of the event where you don't know what will come after, and adds it up for later use; whereas at that later time, you are capable of assessing the actual value of the event because you know what came after. And those two sets of values are very different, and it is a huge mistake to confuse them: WPA is a useful stat in real-time, but certainly not retrospectively.
*I say approximately here because in a blowout late-game situation or even a non-blowout 9th inning lead situation, there is some defensive indifference that allows some runs to score that wouldn't be freely allowed if they were likely to affect the outcome of the game. I could buy an argument that these runs are worth a bit less than other runs.
Dunno if I agree.
I think that SNWL is by far the best way to measure the value of a starting pitcher and SNWL is conceptually close to WPA.
I think Doug Drinen's old reliever reports were by far the best way I've ever seen for measuring the value of relief pitchers and they were straight WPA.
Position players? My position has always been handle with care. There are big issues relating to team quality (for WPA -- might be interesting to do something like a support neutral WPA. But that's a honking lot of work) and batting order position. (For those interested, check out the discussions on RSB and RSBA about James Tuttle's methods. Tuttle found it neccessary to normalize opportunities. He was merely doing a simple bases gained metric, but the problems will be there in any method based on situations.)
But as long as somebody else will do the heavy lifting I'm interested in the handful of players who do significantly better or worse by WPA or RPA than by (say) runs created (or any decent metric)
If it's "about narrative," then why bother to calculate it after the fact? In real time, sure. ("That hit just increased his team's chances of winning from 48% to 56%." Assuming an average opponent and average teammates.) But after the fact, why do we care what we knew at the time he got the hit?
Of course, if one believes that knowledge of context has a significant impact on individual performance -- in other words, "clutch hitting" -- then my argument doesn't hold. In that case, we could say that he got the hit because of his knowledge of context, so it would make sense to care about it. But I don't understand people to be making that argument.
Of the two ways of looking at value, neither is inherently superior, its just a matter of which angle you want to look at it from. WPA is the perfect tool if you define value in the second way, and a terrible tool if you define value in the first.
And assuming everything is the same isn't value either. Which is what RC or whathave you does. It's a step forward towards value - not the final step.
Assuming it would be a better system, what about a game where someone hit a home run in a 5-0 game, that eventually was won 6-5. Since that became the winning run, it would give much more value than WPA, though I don't think it would be as valuable as the first run, or a home run in the bottom of the ninth of a 5-5 game.
ARP, or whatever the stat is called, is more similar to WPA since it takes into account baserunners inherited for relievers, but ignores game state. WX(RL) is the most similar stat since it compares a pitcher's effect on win expectancy to an average or replacement level pitcher.
The reason WPA seems to say that the first is less valuable (or "did less to affect the team's chances of winning") is because it isn't looking at the value of "the run scored in the first inning of that game." It's looking at the value of the run scored in the first inning of an average game. But the run scored in the first inning of that game is just as valuable as the run scored in the ninth inning of that game.
Right. SNWL removes context, doesn't it? WPA is all about context.
I think Doug Drinen's old reliever reports were by far the best way I've ever seen for measuring the value of relief pitchers and they were straight WPA.
I don't remember Doug's old reports, but I believe BPro publishes essentially the same reports today on their site. They really do have great reliever stats.
Is the value of performance based upon our knowledge of the context, or upon the actual context?
I believe that it can be based on our knowledge of the context, and I believe a good rationale could be stated that one based on "current" context is more valuable than one based on "retrospective" context (though I don't feel strongly about it; I don't see why both aren't useful). Players perform and deliver in the here and now, and evaluating them based on the incremental, current impact of their performance is most in line with their reality and the reality of anyone watching the game.
Take BPro's reliever runs added/lost report. That is based on the "current" base/out situation when a reliever comes into a game and the impact he had on a situation -- relative to what reasonably could have been expected to happen, not on what did happen retroactively. I personally think this is a great insight for evaluating the impact of a reliever.
To be a truism, I think it has to be true. I disagree with this statement, meaning I don't think it's true, meaning I don't think it's a truism.
Do you disagree that a homerun in a tie game in the ninth inning does more to increase a chance of team's winning a game than a homerun in the first inning of a tie game? For example, if a player hits a homerun in a tie game in the ninth inning, he increases the chances that his team will win by roughly 50%, give or take, from around 50% to around 100%, give or take. Do you believe that a homerun in the first inning of a tie game will also increase the chance that his team will win by 50%?
My guess is that what you disagree with is the second part of the sentence - that the run scored in the ninth inning is more valuable than the run scored in the first, which is reasonable. But the idea that a run scored in the ninth is more valuable than a run scored in the first is no more unreasonable than the idea that all runs in a one run game are equally valuable. Once again, it all depends on your definition of value.
No, it wasn't -- it *is*, but it *wasn't*. It depends on when you're doing the valuing.
The run in the ninth inning only is more valuable if you do not condition on the actual score/events of the game. This occurs in real time, since we do not have foreknowledge of future events. So you can say that a late inning HR caused a greater change in the probability of the team winning the game as perceived at the time. This approach is good to use for real time strategic decisions.
But, when we analyze players performance after the fact, the results of the games *are* known, and so conditioning on the final score is only responsible approach to take. When you condition on the outcome, all runs are equally valuable.
This is incorrect.
Take the case of "ace" relief pitchers, used only in highly leveraged situations. According to your definition, all ace relief pitchers are vastly overrated, because the ninth innings of one-run games they pitch in are no more important than the first innings of the same games. Who cares if Mariano Rivera in his prime could shut a team down in the ninth inning of a one-run game? It was no more important than if he shut them down in the second game. In the other definition, it does matter, because the extent to which he affected his team's chances of winning the game were greater in the ninth inning than they would have been in the second.
Another way to look at it is that in your definition of value, no player's contribution has any value whatsoever if his team lost the game. If a guy hits four homeruns in a game that his team loses 5 to 4, his homeruns, by definition, have no value, since we know the final result, and they lost. In the other definition they would have value, because each of his homeruns increased the chances that they would win the game.
If a player hits a homerun in the first inning of a tie game, he increases his team's chances of winning by less than if he hits the same homerun in the ninth inning of a tie game. This is a fact. And given that it is a fact, it is still a fact after the game is done.
People on this site keep insisting there is only one valid definition of value, only one reasonable way of looking at this subject. And it is this that I disagree with most strongly.
Let's say there is a crippling unexpected player's strike today that cancels the rest of the season. In 2035, you crack open The Sporting News 2006 Baseball Register to refresh your memory of the season.
If all that book had was oscar Villarreal's basic stats, you might be fooled. "5 wins as a reliever? Wow! Good ERA. Not great peripherals, but he got the outs somehow. Nice little season there by Oscar the Grouch."
Of course, Villarreal has had a bad season and allowed a bunch of inherited runners to score. WPA recognizes that, as he scores a -43 (almost a whole win below average). Contrast that with VORP, where Oscar scores as one of the top 120 pitchers in the bigs because of his low RA/9 number.
I guess some of the other reliever stats will capture this as well, but WPA does a nice job.
According to your definition, all ace relief pitchers are vastly overrated, because the ninth innings of one-run games they pitch in are no more important than the first innings of the same games.
Conditioning on the result, the ninth innings of one-run games are no more valuable than the first innings. The trick, though, is identifying the first innings of one-run games in real time. Since the 9th is no less important than the first, and he can't pitch every day, why not wait until the ninth (or eighth) to decide if you want to use the player? That's why teams can, should, and do wait until the late innings to use their best relievers.
Another way to look at it is that in your definition of value, no player's contribution has any value whatsoever if his team lost the game. If a guy hits four homeruns in a game that his team loses 5 to 4, his homeruns, by definition, have no value, since we know the final result, and they lost.
Yeah, there is a sample size problem with my proposed stat. Which is why I suggested simming seasons. However, most games are reasonably close, and have at least a chance of turning on one player's universe of possible contributions.
So, I am of two hands on this ... On the one hand, the result would have been the same if he didn't hit those HR. On the other hand, I could see that his HR have some value, but they all have the same value.
People on this site keep insisting there is only one valid definition of value, only one reasonable way of looking at this subject. And it is this that I disagree with most strongly.
I'm not one of them. Never have been. But there are invalid inferences that are made from using certain definitions of value in certain contexts, and adding up WPA to give players relative credit for their contributions after the game is over is one of those. And that is what I disagree with most strongly.
From me, no. My quarrel with WPA is that it is used out of it's appropriate context, which is real time -- that's actually a quarrel with the inference not with the stat. But leverage is mostly used as a measure of how important the selected innings were when the pitcher was actually pitching. Which is fine, that's a valid inference (mostly). I do object to value = RSAA*leverage, and in one of the Sox Therapy threads there was an interesting issue brought up where pitchers who give up baserunners create their own leverage ... you bring in two guys with a 1-run lead in the 9th ... one K's the side, the other walks one and gives up a single ... the second guy has a higher leverage even though it's his fault and not a managerial decision.
I'm a little confused by this. If you are valuing runs on the final outcome, they will have different values depending on whether the game is 1-0 or 10-0.
They would be equally valuable if the outcome or their in game context were disregarded. Excepting of course the general rule where the 1st run value > 2nd run value > 3rd run value etc in an individual game.
I think he's saying that all runs in a given game are equally valuable - not all runs across a season.
The 9th is not 'no less important', rather more important! By the eighth or ninth inning you can get extra value out of your closer by using him in close games. Conversely in blowouts you can use your worst reliever because even if he gives up runs they have negligible negative value.
I do object to value = RSAA*leverage, and in one of the Sox Therapy threads there was an interesting issue brought up where pitchers who give up baserunners create their own leverage ... you bring in two guys with a 1-run lead in the 9th ... one K's the side, the other walks one and gives up a single ... the second guy has a higher leverage even though it's his fault and not a managerial decision.
That's more of a problem with leverage compared to WPA. In those two situations, there would be a difference in leverage, but the WPA for those two pitchers would be equal.
So does that mean runs in a 10-0 are worth 1/10 of a run in a 1-0 game?
If I wasn't able to choose WPA to value runs, I think it would be more accurate to give the most value to the first run scored in a game, then less to the second run, etc., regardless of outcome.
Of course almost all measures of value today use 'runs across a season.'
However, every time someone says "on average", they're talking about regression to the mean. What WPA does is that it regresses the value of certain events to the mean value of that event in all games of a certain score-inning-base-out state. Regression to the mean, basically by definition, excludes aspects of value.
(all numbers below are totally made up for the sake of argument, the exact quantity shouldn't matter)
Let's take a hitter, call him A. Rodriguez. No, that's too obvious, say Alex R. He has a unexpectedly low WPA for his VORP-production because he hit 9 HRs in tied, bottom first innings and only 2 HRs in tied, bottom ninth innings.
On average a home run in a tied 1st has less value than a home run in a tied ninth. The latter always wins the game in every case, and so has a WPA of +.5. The former only wins the game sometimes, and often has a relatively small effect, so it has a WPA of +.1.
However, let's further say that of his 9 HRs in the bottom of the first, an astounding 6 occured in games that the Blue Fork Wankees won 1-0. Each of those was actually a game winner, in retrospect an event equal value to the two walkoffs. WPA does not record this value, because it only records the average value of those first inning home runs, not their actual importance to winning in retrospect.
Basically, I'd say that WPA is a pretty good estimate of overall value. However, there will be cases where WPA misses a significant variation in value because of its regression to the mean.
It's not wholly unlike the relationship of VORP and WPA. For the most part, big VORP means big WPA. However, as in the case of the MVP race last year, that was not actually the case, because Ortiz' production was arranged such that it had much more WPA value than would be expected in a full-regression stat like VORP. however, we don't know if the regression inherent in WPA might have missed certain aspects of Rodriguez' or Ortiz' value.
(My guess is that it didn't, in this case, and the MVP race was a dead heat. But I don't know that.)
The close games in the 9th inning are more important because you know it's likely that the game will end up being close, and so the pitchers you used will have their value maximized.
It is not a fact. You're mistaken -- or, rather, your position is correct only due to an ambiguity in the language. The two "tie games" you cite aren't the same. One is a game tied at the time, and one is a game tied at the end.
Given that the home run is the difference between the scores of the two teams, it doesn't matter when it is hit. The difference between the first inning home run and the ninth inning home run is that you don't know, when the 1st one is hit, that it will be the difference. It may be a HR in a blowout, which is less valuable than a HR in a close game. But given that we're discussing the close game, it doesn't matter when it's hit.
That's the same as the Rivera situation: pitching a scoreless first inning in a 1-run game is exactly the same as pitching a scoreless ninth inning in that game. The problem is that if you use him in the first, you don't know that it will be a 1-run game.
I don't mean to get technical, but I think this is a wrong use of "regression to the mean." WPA does use average run outcomes for base/out situations to derive tables, but there is no regression to the mean involved in WPA.
Regression to the mean is a specific technique involving correlation analysis, or a description of a phenomenon in which a subject's stats move toward the overall population average. As far as I know.
Like I said, I don't mean to be technical, but when you use it in this context I don't understand what you mean.
I think this can also be applied to hitting. By giving a hitter extra credit for home runs in 1-0 wins, it's similar to, say, giving a pitcher a win when he gives of 6 runs in an 8-6 victory. Or a no decision when a pitcher is pulled after giving up 6 runs in the first inning. The result of the game shouldn't determine the value of hitting or pitching.
To me: WPA > Context Free > Outcome Based
Thanks Matt. For that clarification and for many others in your last couple posts.
But using WPA you're giving a batter who hits a walkoff homer in a tie game extra-credit for the fact that his team managed to keep the game tied. How is that different from rewarding the pitcher for a win in your example?
What I was thinking about when I wrote that is that there is quite literally only one difference in the inputs to SNWL and WPA for a starting pitcher.
However that one difference (the specific run support) is huge. Classic case of missing the forest for the trees on my part.
Giving the teammates credit for a player's walkoff home run would be similar, in a way, to the benefits a pitcher gets for a win or no decision when he pitches poorly.
If both teams have been keeping it tied, the teammates with the player hitting the home run shouldn't benefit from his home run, he should. Just like the credit in an 8-6 win by a pitcher should go to the people who won the game, the hitters.
Using outcomes you could say a 2-1 win by a pitcher is equivalent to an 8-6 win by a pitcher since they both won, but they aren't.
But what is it called, then, when a projection uses as one factor the population mean? I guess that's a method of accounting for regression to the mean. What WPA does is that it takes the exact context and replaces it with the context of the mean set of events. What's the term for that?
One thing I'd add is that WPA is good for the butterfly effect. In other words, if a guy hits a homer in the top of the 9th to take a 4-3 lead, and then his teammates proceed to tack on five more runs, I think that homer is worth more than the average run in a 9-3 win, because it's hard to be sure that all those runs would have come across if he hadn't hit the homer.
I wouldn't use WPA exlusively and assume a complete butterfly effect, but I'd rather not assume that every event would happen completely independently of previous events. I don't think there can ever be one perfect number for value, it's good to look at a whole bunch of 'em.
It would be very distorting to use for 1 game, where all the credit went to the winning team, because terrible play on the winning team would be disregarded, and good play on the losing team would be ignored.
Doug's reports had a lot more than the WPA, though the WPA stuff was the core.
He also included a great usage chart. Here's the 1999 Yankees
Don't know about you but I think there's an awful lot of useful info packed in there.
He also included the number of times a guy was brought in with runners on, the number of times a reliever went more than an inning and the number of times a reliever entered a tie game.
I put a fair amount of time into formatting that table. Waste of my time and damned annoying.
Jim, if I can't post tables in a meaningful manner I can't see posting here period. I'm serious.
Papelbon...+1.815 +1.880
Ray..........+1.433 +1.454
Zumaya.....+1.520 +1.403
Ryan.........+1.322 +1.330
Rodriguez..+1.068 +1.245
I don't know how WX is calculated, but both measures seem to agree about value, a quarter of the way through the season.
If WPA was inherently flawed in concept it would seem it should diverge more. Since I assume WX is based on season totals, it would stand to reason that on an individual game basis, WPA would be more accurate.
No, I don't think so. When you look at a lot of WPA data (like last year's season totals), you start to realize that a lot of the data doesn't seem to make any sense (Tony Clark as 3rd-best batter in the NL last year?!). And the reason is that each PA's outcome is essentially multiplied by its leverage and then summed. When you do that, you lose information both about the leverage and about the context-free outcomes.
I think the WP framework leads to lots of interesting analyses; Leverage is one of them, but I'm increasingly believing that WPA (as a performance metric) is near useless. Well, it's useless in the way that RBIs or fielding percentage is useless: there's good information in there, but lots of noise, and lots that's missed.
This is a problem specifically with Tangotiger's implmentation of LI, not the concept of leverage. Tango's LI is on a PA-by-PA basis. Doug Drinen measured a similar statistic, but his unit of time was a partial inning (basically, the length of time that a given pitcher is on the mound). He looked at the WPA of a perfect partial inning, and called it "P" (short for "perfection", I think). This implementation isn't perfect (it looks at the value gained by getting outs, but not the cost of runs, which is probably a bigger issue when using this for batters).
BTW, Tango's counter-argument here is that, in real time, a pitcher can be pulled after every PA, and so the proper unit of time is the PA. If a pitcher does raise his own Leverage, the manager does actually make a decision to stay with him in a higher-leverage situation.
Ron,
you over-snipped me. I don't care for WPA as an overall value tool (I understand it's use for relievers), but I meant the specific use of WPA in the Rowand thread was bad sabermetrics.
Care to elaborate? It seemed like a pretty good use of WPA.
If one wants a touchstone for reasonableness, compare WX to ARP or RSAA. WX should map on a little better to ARP than straight RSAA, since ARP is sort of a hybrid measure. It includes inherited runners scored against compared to replacement level.
Also, the extent to which a pitcher creates his own leverage is going to be reflected in his performance record.
Not useless. Dammit, will you people stop saying "useless"! NOTHING in the world of baseball statistics is useless, or almost nothing anyway. Something may not do what you want it to do, or may not do everything, but that doesn't make it useless.
WPA isn't best compared to RBIs or fielding percentage (both of which are extremely useful, anyway). It's best compared to GWRBI, pitcher wins or saves.
WPA is a "payoff" stat (a very sophisticated one, as opposed to a crude one like Game-Winning RBI).
Tony Clark wasn't the 3rd-best hitter in the NL - anyway, that's not what WPA tries to measure. But if you use WPA to argue that Tony Clark's hitting paid off more than all but two other National Leaguers, I think that's a reasonable (if questionable) argument.
The play was worth 0.278? In the first inning?
WX is calculated summing values for each PA, not a formula using totals?
Why do you find that implausible? If 10 runs = 1 W, we'd expect the three+ runs saved by the catch to be worth about this much. Anyway, it is correct. Did you really not even check before calling it "bad sabermetrics?"
Ron, those are great reports that Doug put together. I agree. BTW, BPro does has WPA (they call it Win Expectancy) for relievers.
What WPA does is that it takes the exact context and replaces it with the context of the mean set of events. What's the term for that?
Matt, by this do you mean that it assumes average hitters are coming up to bat next, instead of specific hitters with known performance outcomes? If so, that's just assuming average hitters. :)
Did you read the usage?
Seriously, what's your argument? That is was worth more than .278? Less than .278? It's incalcuable? It doesn't matter?
I assume you have a point here...
You must be Registered and Logged In to post comments.
<< Back to main