|
|
|
|
Baseball Primer Newsblog— The Best News Links from the Baseball Newsstand
Friday, May 04, 2012
I believe it was David Grabiner Lloyd George that said…“This WAR, like the next WAR, is the WAR to end all WAR.”
As many have noticed, we have not been updating the 2012 WAR data in-season. The reason for this is that we have been working on a major upgrade to our WAR framework. The link above gives a very detailed rundown including: how to compute each part of our WAR formula, our reasoning on making the changes, downloads for historical WAR data, charts of the various constants and factors we use, and a chart of differences between the new B-R WAR, the old B-R WAR and FanGraphs WAR.
...All in all, these are major improvements to the system and while there are dozens of details, here are the main points of difference between our old framework and the new one.
• Switch from BaseRuns for batting to an advanced wRAA metric.
• Folding ROE, infield singles, SO vs. Non-SO into wRAA.
• Excluding pitchers’ hitting and averaging by league rather than year from the league averages for wOBA and wRAA.
• Estimation of CS numbers for leagues they are missing.
• Use of Baseball Info Solutions Defensive Runs Saved from 2003-present (in our view the most advanced defensive metric).
• Use of a player-influenced runs to win conversion for both batters and pitchers based on PythagenPat.
• Use of a player-specific park factor for pitchers weighted by actual appearances in each park.
• After a preliminary WAR calculation, we fine-tune the replacement level on a playing time basis, so the total WAR in each league is very consistent year-to-year.
• I’ve changed the oWAR and dWAR formulations. oWAR is now called ndWAR for (no-defense WAR), but is the same otherwise. dWAR now contains the position component of WAR, so the Career Leaderboard is now dominated by SS, C and other great defensive players.
|
Support BBTF
Thanks to TedBerg for his generous support.
Bookmarks
You must be logged in to view your Bookmarks.
Hot Topics
Newsblog: JM Catellier: Is Pedro Martinez a First Ballot Hall of Famer? (129 - 4:46am, May 22)Last:  BrianBriansonNewsblog: OT: The Soccer Thread, May 2013 (1050 - 4:31am, May 22)Last:  Swedish ChefNewsblog: White Sox Ace Chris Sale Eats and Eats and Eats Without Gaining Any Weight (82 - 4:27am, May 22)Last: you got a STEAGLES? you're gonna need a STEAGLES.Newsblog: OT: NHL is finally back thread (359 - 4:02am, May 22)Last:  Robert in Manhattan BeachNewsblog: OT: NBA Monthly Thread - May 2013 (1056 - 3:41am, May 22)Last:  Los Angeles El Hombre of AnaheimNewsblog: LATimes: Microsoft unveils new Xbox One console (5 - 3:33am, May 22)Last: Dr. HouseNewsblog: [OTP-May] Politico: Congressional baseball game, May 1, 1926 (3836 - 2:46am, May 22)Last:  BurlyBuehrleNewsblog: Barry Bonds: Detroit Tigers' Miguel Cabrera 'the best' ... but not better than me (63 - 2:30am, May 22)Last: the Hugh Jorgan returnsNewsblog: Yanks, Manchester City awarded MLS expansion team (24 - 2:12am, May 22)Last: the Hugh Jorgan returnsNewsblog: Dollar Sign on the Glistening Muscle: Scouting Ballplayers in 1980s Playgirl (26 - 2:07am, May 22)Last: base ball chickNewsblog: OMNICHATTER for MAY 21, 2013 (121 - 1:50am, May 22)Last:  botemanNewsblog: Posnanski: Jeff Francoeur and ANT (56 - 1:24am, May 22)Last: Sunday silenceNewsblog: SB Nation: Five lost scouting reports (11 - 12:44am, May 22)Last: PerryNewsblog: BBTF SOFTBALL GAME IN NEW YORK--AUG 17 (311 - 12:07am, May 22)Last:  Rafael Bellylard: Built like a FielderNewsblog: Primer Dugout (and link of the day) 5-21-2013 (22 - 12:01am, May 22)Last: esseff
|
|
Reader Comments and Retorts
Go to end of page
Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.
1. fra paolo Posted: May 04, 2012 at 10:03 PM (#4123857)Your new formula claims that Jose Bautista month's worth of games have been exactly as valuable to the Toronto Blue Jays as Omar Vizquel's 12 PAs of burnt toast production (.091/.161/.091). Please correct this glaring error in the formula before I throw myself off the nearest bridge, as it is simply impossible that such a thing could ever happen in this world.
Yours,
Every single Blue Jays fan.
So another reason to ignore war for pitchers? Been doing it for a while, apparently no reason to stop.
Depends, I know a lot of people who honestly believe that a great pitcher is worth more than a great position player, can't say I can prove 'em wrong.
70 years ago, sure. 40 years ago, maybe, post 1980, not even remotely close.
You would have to fully surrender yourself to the Dips alter, and fangraphs war to really believe that. Yes occassionally a great pitcher will be worth what a great position player is worth, but those are going to be exceptional years, not the norm.
...Jim McCormick. A 19th century pitcher with a record of of 265-214. His 83.2 WAR is right between George Brett and Robin Roberts.
See, I would think increased run environments would make great pitching scarcer, so getting hold of one of those top 5 to 10 guys in the league would make them even more valuable. Its not the Dips alter you have to sacrifice to, its the positional scarcity alter. Personally I agree with you that an everyday player is more valuable, but I don't think the argument the other way is incoherent or anything.
The last bullet point seems to be Sean just ####### with us. So oWAR didn't change but you change it's name, but dWAR did change but you left it's name the same. Got it. I guess.
I don't know, it's hard to quantify. I've actually been thinking about stuff like that the last few days with Longoria out. The Rays can lose him, their best position player, for a month and not see a significant change in their expected win total even if they ran slightly above replacement level Elliot Johnson out there every day. What would happen though if Alex Cobb was eaten by a shark tomorrow and then Shields or Price had their head smashed by a line drive in their next start? Other than the possibility of moving Davis back to the rotation they'd be looking at a pretty bad replacement and going from a starter that might go 7-8 innings allowing 2-3 runs to one that will often go 5-6 innings giving up 4-5 runs is a damned big difference both in the chance of winning each of those games and also the strain it puts on the bullpen.
Injury risk and the greater unpredictability in general of pitcher performance prevents teams from really voting with their wallets on the issue but I suspect that if they felt that pitchers were as reliable as position players to stay healthy and good then the highest paid players in the game would be a bunch of starting pitchers.
Pitchers really did seem to receive a boost. I hope I'm not falling victim to the "I'm not used to it so I don't like it" idea, but so far, I don't like it.
Not sure why you'd fold ROE into it but not Rdp. Both are largely a function of handedness and speed. Rdp does have an opportunity component to it that's unrelated to batter performance but that's a pretty thin reed. Anyway, doesn't matter much. Also, I liked having rROE separated out.
Excluding pitchers’ hitting and averaging by league rather than year from the league averages for wOBA and wRAA.
Excluding pitchers would seem to make sense for position player WAR but doesn't it screw with total WAR?
But more importantly, doesn't the league-averaging thing mean I can't do a straight comparison of players across leagues?
Use of a player-influenced runs to win conversion for both batters and pitchers based on PythagenPat.
Would somebody like to translate?
I’ve changed the oWAR and dWAR formulations. oWAR is now called ndWAR for (no-defense WAR), but is the same otherwise. dWAR now contains the position component of WAR, so the Career Leaderboard is now dominated by SS, C and other great defensive players.
I'm confused. Didn't oWAR used to have the position component, meaning it was offense relative to position? If that's now in dWAR (which is OK but has its downsides), then there has been a change to oWAR. Or is the position adjustment now in both which means WAR ne ndWAR + dWAR?
OK there is in fact no ndWAR but rather ndRAR. The old oWAR appears to be (as of 5 seconds ago at least) afWAR (with no idea what af stands for) but does still include the position adjustment. So the help tag explicitly states WAR ne afWAR + dWAR ... which seems an odd choice to make.
The basement community has been beating these advanced metrics down our throats for a few years now. Stating that such-and-such player was robbed of an MVP or Cy Young, or that so-and-so didn't deserve theirs. All based on these foolproof numbers. Sure they say WAR is imperfect, but not by much.
And now, the mathletes go through and change the formulas? Like mentioned above, now Zobrist was the most valuable position player in 2011. I don't recall his name being in the MVP chase last year, but I bet it would have been under this new formula.
To me, this puts a bit of a black cloud over the Sabremetric community. Like the numbers can be skewed to mean whatever you want them to mean (which is true, but we are all hoping for objectivity). How are we supposed to take solace in the numbers, and trust them, when they can change? It's like re-writing history.
I'm pretty skeptical about the defensive credit that Zobrist is getting for 2009 and 2011. While it was obvious that the previous defensive system was dead wrong about him (he was rated as below average at 2B in 2011) I don't know if a second baseman or right fielder is going to be able to save as many runs as he's getting credited with. Here are the rtot/yr (approximately 1200 innings) for him along with his actual innings played:
2009 2B: 27 (714.2)
2009 RF: 33 (329.1)
2011 2B: 26 (1058.1)
2011 RF: 25 (289.2)
Small sample sizes for the RF numbers of course but in both seasons the 2B numbers are in the upper 20s, that just seems too high. Zobrist is a great player and his versatility/the Rays roster construction leads to him being unfairly left out of the discussion about who the best 2B is because he plays RF a fair amount but according to new WAR he was the most valuable position player in the AL two of the last three seasons and that doesn't pass the smell test. Nearly 30 runs saved per 1200 innings for a 2B is just excessive.
A sense a fully unbiased and objective point is about to be made.
We actually had a very nice little discussion about runs per win a few days ago. I read up a bit on it then, unfortunately, by the time I had, the discussion pretty much died.
It's basically a way of translating runs into wins, rather than saying something silly like 10 runs equals 1 win, which may or may not be reasonably accurate depending on the run environment. The Pythagorean records we all know and love are derived from it.
Anyway, unless I messed up somewhere, the formula should be: RPW = (RPG^(1-z))*2 where RPG is for both teams, and z typically is 0.287.
Also, there is a cool graph from tango, showing RPW over time here.
That does seem rather nonsenseful.
Zach Grienke has the best season of the last 10 years.
And yet, Dennis Eckersley's 1992 still underwhelms.
The quick answer is that these numbers are always an interpretive estimate of what players contribute to winning. They're there to be criticized, not for "solace." And if somebody comes up with a better estimate, why shouldn't they abandon the old one?
The only people who should be put about by an upgrade in WAR formulas, as several posters above have noted with telegraphic snark, are those who had become addicted to looking at the single number in the WAR column and ending thought at that moment.
I know I'm considered a lone nut for saying these kinds of things, but there's no way that WAR numbers should ever have been considered truly objective. Or ever will be.
Mind you, I didn't say the new numbers were a better estimate. I will have to study the issue of whether Rick Reuschel's 1977 was more valuable than Willie Mays's 1962 before deciding :)
What.
The problem is that while 0.1 WAR is clearly insignificant, 0.9 WAR, while not necessarily conclusive, is fairly significant.
TZ: 0
UZR: -3
DRS: -11(!)
He's been bad, but is that even possible?
This would only give more fodder to the people who reject it out of hand.
How is that a problem? 0.9 to one significant digit is 1.
Would somebody like to translate?
A player's performance influences his run environment, which needs to be accounted for when making run-to-win conversions. A great pitcher can turn a 9-run environment into a 7-run environment, or a great hitter can turn it into a 9.5-run environment, and that is now accounted for...
...incorrectly, it would seem. The Book blog discussion turned toward the proper way to make these calculations, because the factors for pitchers were too small, giving them too much credit. Pitchers' numbers will stop looking so crazy once this gets ironed out.
That was my response when I saw that Brett Lawrie (94 OPS+) was tied for the AL lead in WAR for position players because he's +12 for fielding through 27 games. I suppose it's possible, but he's not even in the top 25 on Fangraphs.
If Player A has 0.5 WAR, and Player B has 1.4 WAR, then that is information I want to have. I don't want to see two 1's.
I assume by 'meaningless', you mean 'enough to get the Red Sox into the playoffs last year', and 'enough to prevent the Cardinals from winning the WS last season'.
We are discussing details on Tom's board, but it looks like a better approach would be to look at the equations for a case where a 2.50 team (with Greinke) faces a 4.50 team (average) and work out the changes in wins there. It's quite a bit more complicated, but I think it will handle the dominant pitcher performances much better and not penalize the dominant hitters as much either.
I apologize for not foreseeing this issue. I was a bit unsettled by the war_rate's we were generating, but I'd seen multiple signoffs on it, and thought it was the correct path.
Here are the runs above replacement for last year. The reason Kemp drops is because of the runs to wins values. As for Kershaw and Halladay, that is due to quality of opp, park factors and Phillies defense is considered much, much worse than the Dodgers.
+-----------+----------------+
| player_ID | runs_above_rep |
+-----------+----------------+
| kempma01 | 74.0 |
| braunry02 | 72.5 |
| vottojo01 | 57.0 |
| sandopa01 | 54.7 |
| uptonju01 | 53.6 |
+-----------+----------------+
5 rows in set (0.02 sec)
mysql> select player_ID, runs_above_rep from majors_war_pitchers where year_ID=2011 and lg_ID='NL' order by runs_above_rep desc limit 5;
+-----------+----------------+
| player_ID | runs_above_rep |
+-----------+----------------+
| hallaro01 | 77.734 |
| leecl02 | 70.278 |
| hamelco01 | 53.288 |
| kershcl01 | 51.543 |
| kenneia01 | 39.226 |
Greinke a
Starlin Castro's at the top of the NL list, despite being on pace for about 45 errors this year.
EDIT: As if on cue, Castro just made another error.
No, I mean meaningless by being well within the margin of error.
When you say '9 runs is in reality 6 runs, which is meaningless', I don't see how I am supposed to interpret that as '9 runs is within the margin of error'.
Either way though, I stated initially that 9 runs is not conclusive, which is the takeaway from something being within the margin of error. There is however a vast gulf between 'not conclusive' and 'meaningless'.
.9 WAR doesn't necessarily mean 9 runs.
Either way though, I stated initially that 9 runs is not conclusive, which is the takeaway from something being within the margin of error. There is however a vast gulf between 'not conclusive' and 'meaningless'.
I understand now, it's pedantry.
Wheeeeeeee!
Prince Fielder’s 50 home run season in 2007 (on a 288/395/618 line) ranks as less valuable than Omar Olivares’s 1996 effort, where he went 7 and 11 with a 4.89 era in 160 innings for the last-place Tigers. (3.6 to 3.8 WAR)
Lonnie Smith 8.3
Will Clark 8.0
That's certainly not what it used to say. In fact, it used to have Clark way out in front at 9.4. Can anyone explain what happened there?
Stan Musial and Hank Aaron and Albert Pujols, too, have suffered a great deal in their defensive rating. Babe Ruth, too, was viewed better under the old system, as was Barry Bonds. Barry went down, but is still positive; Ruth is now a negative. Williams and Mantle under the old system were slightly negative, I think; now they’re both big negatives in dWAR.
Besides formerly great fielders falling, like Yaz and Kaline and Hernandez, some who were valued highly for their defense have rocketed up even more, though. Mays, Brooks Robinson, Ozzie Smith, and Mark Belanger (Belanger now has as much aggregated dWAR as Smith in a much shorter career—and he was a tremendous fielder, all his contemporaries agree, I believe), with a lot of dWAR under the old system, have gained even more credit on defense under the new system. So has the redoubtable Cal Ripken, who came out very well under the old system, but he is now in the realm of the super great fielders. And he very well might be one, but the difference between him and Jeter in fielding is almost other-worldly.
We had some sort of discussion about Hornsby a few months ago, and some of us were surprised to see that WAR had his defense as better than Collins or Morgan then. Hornsby's defensive rating, in the teeth of the conventional wisdom of his time, was high under the old fielding evaluation--now, he's a great fielder, it looks like. See also Grich and Trammell. Honus at 21.7 I think went up, but no matter what the system, this guy never suffers. This will all take some learning and explaining, I think.
If you want to compare two fielders you never should have used dWAR, you should be using runs from DRS or TZR, rather than messing with the conversion to wins.
Is overall league-wide dWAR still negative?
I haven't studied it yet, so I don't know if it actually IS a disaster. But clearly it's going to be perceived as one.
Some strange results to be sure, and a lot of them. Thats never a good sign.
Boy, I tell you, though--how great was Brooks Robinson on defense? Most everyone after him on the leader board for a good ways are shortstops. He provided more dWAR than the great shortstops!I always thought Nettles and Clete were fairly close. How great was that Oriole infield when you had Grich at second to go along with Belanger and Brooks?
There will be some interesting discussion that will arise out of all this. That can't be anything but good.
I hate this. Keith Hernandez, Will Clark, JT Snow, Jeff Bagwell, Mark Grace, Don Mattingly, Wes Parker ... they all have negative dWARs. I realize that this all might make sense if you understand what the statistic really means, but come on. This statistic needs to be monkeyed with if anyone is going to trust it.
I can't speak for the recently updated version of WAR, but I know under B-ref's recently outdated system, the 1973 Orioles rated as the best fielding unit ever, with +11.2 dWAR.
I bet that infield played hell with opponent batters' BAbips. Did anything get through?
I do find it curious that that we just assume that WAR components for positions are static and can not change.
But if you want to know how good a guy was at his particular position there's Rfield isn't there?
Don Mattingly +33
Keith Hernandez +117
JT Snow actually comes out as -20.
EDIT: I've always ignored everything to the right of WAR on the B-Ref value ledger anyway (not that I mean it's not good information, but for my purposes I like the stuff to the left. The components that outline how you get to the final WAR. That's why I've never understood the complaints about what dWAR doesn't do a good job of measuring. It seems like the thing everyone is complaining about dWAR not providing is right there if they'd just pan over to the Rfield column)
This. No matter who cranks it out, WAR is always going to be an estimate -- nothing more, nothing less. If the fundamental processes that go into the formula are valid (and I think that's true here once the RPW issues and perhaps SP leverage get worked out), then the results will be good estimates.
The specific rankings of certain players is really quite irrelevant. WAR is only supposed to throw players into large bins based on estimated value. From a team perspective, you don't sweat decimal differences between players -- you only care generally what kind of production you can get out of a guy vs. how much you're paying for him. If you think that any version of WAR is more precise or more useful than that, you're misguided about the purpose and function of the stat.
Post announcing it
http://www.sports-reference.com/blog/?p=675
Assuming that one significant digit is the proper level of precision, it's not information. It's measurement noise masquerading as information.
This seems reasonable. The higher instability of year-to-year performance among pitchers relative to hitters would explain the both single-year and career outcomes. In a given year, the quasi-randomly distributed likelihood that a pitcher can have "everything go right" (health, hit luck/defensive efficiency, usage, control, etc.) and put up an MVP season makes the pool of potential MVP pitchers relatively large. Think of Mark Fidrych 1976, Mike Scott 1986, or Dontrelle Willis 2005. However, there is less outcome variance among position players. A defensive career year can help a guy luck into an MVP (Zobrist last year, Markakis 2008, Lonnie Smith 1989), but that's about it. Thus, the odds are significant that some Fidrych will have all the stars align for a single year, whereas the pool of sure-bet MVP candidates is relatively constrained to, say, the Pujols, Bonds, and A-Rods of the world. However, year-over-year, the position players are a better bet to come closer to repeating their previous performance, so only those pitchers who turn out to be similarly sure bets can do well in career WAR.
You must be Registered and Logged In to post comments.
<< Back to main