User Comments, Suggestions, or Complaints | Privacy Policy | Terms of Service | Advertising
Page rendered in 0.8151 seconds
48 querie(s) executed
| ||||||||
You are here > Home > Baseball Newsstand > Discussion
| ||||||||
Baseball Primer Newsblog — The Best News Links from the Baseball Newsstand Saturday, December 20, 2008Rob Neyer: Explaining some new math
Huh, I didn’t realize that positional adjustments is considered so ‘new’. |
Login to submit news.
You must be logged in to view your Bookmarks. Hot TopicsNewsblog: Carlyle’s Rubenstein Is in Talks to Acquire Baltimore Orioles
(5 - 6:35pm, Dec 07) Last: Howie Menckel Newsblog: OT - NBA Redux Thread for the End of 2023 (153 - 6:32pm, Dec 07) Last: Mellow Mouse, Benevolent Space Tyrant Newsblog: Yankees get Juan Soto in blockbuster trade with Padres (41 - 6:24pm, Dec 07) Last: Howie Menckel Newsblog: Jeimer Candelario, Reds reach 3-year, $45M deal, sources say (13 - 6:21pm, Dec 07) Last: Howie Menckel Newsblog: Guardians win Draft Lottery, securing next year's top pick (7 - 6:19pm, Dec 07) Last: Zach Newsblog: Eduardo Rodriguez signs with Diamondbacks: NL champs add to solid rotation on four-year, $80M deal, per report (3 - 6:15pm, Dec 07) Last: Walt Davis Newsblog: Who is on the 2024 Baseball Hall of Fame ballot and what’s the induction process? (406 - 6:15pm, Dec 07) Last: chisoxcollector Hall of Merit: 2024 Hall of Merit Ballot Discussion (190 - 6:01pm, Dec 07) Last: kcgard2 Newsblog: OT Soccer - World Cup Final/European Leagues Start (325 - 5:49pm, Dec 07) Last: AuntBea odeurs de parfum de distance sociale Newsblog: Reports: Astros, Victor Caratini agree to 2-year, $12M deal (7 - 5:23pm, Dec 07) Last: Tom and Shivs couples counselor Newsblog: Mookie Betts will be 'every-day second baseman' for Dodgers (38 - 4:14pm, Dec 07) Last: jacksone (AKA It's OK...) Newsblog: Red Sox trade Alex Verdugo to Yankees for three pitchers (29 - 4:14pm, Dec 07) Last: Walt Davis Newsblog: Jerry Reinsdorf meets with Nashville Mayor Freddie O'Connell (5 - 3:14pm, Dec 07) Last: Tom Nawrocki Hall of Merit: 2024 Hall of Merit Ballot Ballot (4 - 3:10pm, Dec 07) Last: Jaack Newsblog: 'I had tears, man': Brett's career on full display in MLB Network documentary (3 - 10:22am, Dec 07) Last: RoyalFlush |
|||||||
About Baseball Think Factory | Write for Us | Copyright © 1996-2021 Baseball Think Factory
User Comments, Suggestions, or Complaints | Privacy Policy | Terms of Service | Advertising
|
| Page rendered in 0.8151 seconds |
Reader Comments and Retorts
Go to end of page
Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.
i was also confused that rob neyer seemed surprised at something that caused me, a sabermetric novice who only has a rudimentary understanding of what goes on in the community at large, to say "yeah, of course." but i can only imagine that rob is playing ignorant so as to get the question answered for his readers, sort of like how in movies where a bunch of scientists are talking there's always one who randomly doesn't know something he probably should just so the other guys can explain it to him (and thereby the audience).
here a question: a poor fielding shortstop or catcher might be "more valuable" than a better fielding first baseman (say the shortstop below replacement level and the first baseman is below average but above replacement level). depending on the exact numbers, once you adjust the shortstop might be "more valuable," but is he really more valuable than the team since he's not playing his position well?
Yet, I think that this particular thing they are talking about is new. However, Seidman either really crappily explained it in that email with Rob or it is far more complex than it seems it should be. Because after reading that, it does seem like it should be simple, but something is keeping me from really getting it.
So instead of saying player A is 30 BRAA, 10 FRAA, 10 BaseRunningAA, player B is 25/15/5, but the 25 is more valuable than the 30 because player B is a shortstop and shortstops tend to be weaker hitters than first basemen like A, you're saying that whole thing and now adding the fact that the 15 is more than 5 more valuable than the 10 FRAA for player A.
And no, I was not feigning ignorance. I've really had to be hit over the head a few times with this stuff, but now I do think I get it.
And Ivan, you may be right about VORP, but it's useful to understand that (for example) Carl Crawford probably isn't really a more valuable fielder than Carlos Beltran (even though his FRAA or his UZR might suggest that he is). -r
Justin, now that I know you're a finance major like I was, I know you understand the concept of differing marginal values. Just like the difference between a 200 OPS+ and a 190 OPS+ is more valuable than the difference between 110 and 100, the claim here is that being 7.5 runs over average defensively at short is more valuable than 7.5 runs over average at first. It's not that you're comparing him to second basemen now. It's that the 7.5 run difference is harder t accomplish at short than 1b, and hence, more valuable. You can't straightline compare the two as we've been doing lo these many years.
6. robneyer Posted: December 20, 2008 at 03:12 AM (#3034550)
What Jeff K. said.
Quoted in a Posnanski column on Bill James and now this? I can die, if not a happy man, somewhat less disgruntled...
Reading Jeff's #4 again, is it the case that using previous methodology, Player A would be 30/10/10 and Player B would be 40/0/5 because the positional adjustment was built into the offense?
The claim, and I use that term because I'm not positive I believe it yet, is that this is untrue. We aren't properly valuing their defensive contributions. Because it's harder to be +10 AA at SS than 1b, we have to place more value on it. So now, A is 30/10/10, and B is 30/15/10. And hence, B is more valuable.
I like what Rob said about Crawford and Beltran.
Maybe the problem is that I always thought positional adjustments were there and they were not. I think I always thought a +15 rating for a LF was easier to come by than +15 for a CF. Was this not the case? I mean, I never thought that if Albert Pujols was +15 at 1B that implied he could play SS and be a +15.
So a team full of Willie Bloomquist would be utterly average?
No, look at the terms. It'd be utterly Willie. :)
No, of course not. But you added that 15 to Pujols the same as you added 15 to Rey Sanchez. And yes, the team with Crawford and Beltran is saving 30 runs, but those 30 runs are not "created" equally, so to speak. I think that a good analogy would be low run environments vs. high. If Ty Cobb in 1905 is 30 BRAA, and Todd Hundley in 1998 is 30 BRAA, those are not of equal value. The 30 BRAA in 1908, given league scoring, is of course more valuable. Apply the same concept to defensive positional context.
For example: "It is a very confusing concept, but basically it just allows us to make cross-positional comparisons so someone like Crawford doesn’t have an overstated defensive value."
Now, if you're unfamiliar with positional adjustments (or hell, adjustments for park, era, league...) this is not the one I'd want to start with.
A stat like VORP is already adjusted before you see the final product, but in using UZR or Dewan's system, we would need to adjust for the difficulty level of playing certain positions; so (for example) Ryan Howard's +1 run at 1B isn't even close to a +5 by Feliz at 3B.
I don't see how this can be right. VORP is adjusted, yes, but it is only adjusted to normalize offense across positions, not defense as well. Whatever defensive stat you use to construct whatever your own personal version of VORP is, you should still have to adjust for defensive positional context, else I don't see where the advancement is.
I'm probably missing something, but lately it seems like the people who are working with defensive numbers are treating them more like gospel than they deserve to be treated at this point. I think anybody who starts promulgating a new stat has the duty to explain why the stat has value and how it came to be calculated. I'm not saying that people are necessarily failing to do that, I'm just saying that the burden is on the prosecution.
I understand the concept but it seems like you either adjust positionally for offense or defense not both. The whole point of adjusting offense for position is to give them credit for playing a harder position. It is inherently harder to play shortstop or catcher defensively, so sure, give them credit for that, but it doesn't make it inherently harder to hit a baseball when you play shortstop. You can either compare a player versus average for the position on both sides of the ball or you make a adjustment based on how much harder it is to field the position as described in the article, not both. Comparing to an average player at the position seems worlds more valid than these (seemingly)arbitrary positional modifiers. Is the point of this then just a quick, rough comparison of the relative defensive value of players at different positions?
As for marginal values that's a complex topic that perhaps should be applied to every statistic, not just defense... but I don't see how that's accomplished with what's being described here in any case.
Or from a practical standpoint, it doesn't matter where a run is saved when measuring defense as long as it is saved. As long as both player A and player B save the same number of runs relative to positional average it's not important that player B's position is harder than Player A's.
1. Take the individual components of Tango's "Fans' Scouting Report" - hands, first step, etc. - and see to what extent each component correlates with defensive stats at their given position.
2. Use all of that to estimate how well each MLB player could do at any other position.
Then, either
3a. Adjust all players to a common position, then evaluate defensive value there; or
3b. Devise a positional adjustment along the lines of what's being discussed here.
I dropped it because (a) life got in the way; (b) to do this, it helps to have defensive stats you can believe in, and there was enough disparity in defensive stats to throw off the correlations; (c) I can't tell if, to the extent there was little correlation, the lack of correlation with defensive stats was because defensive stats were shaky, or that Tango's FSR components aren't as position-independent as he'd hoped. At the time the FSR ratings - and a number of defensive stats - were still in their infancy, so it was hard to look at multiple years. And, again, (a).
If someone else wants to take it up, I won't be offended.
- - - -
FWIW, initial conclusions were:
(a) Ichiro! might be able to handle CF. (His FSR numbers and defensive stats reflected RF.)
(b) The Yankees' problem was not that Jeter was playing SS, but that the entire defense was built for offense. Jeter could be an A or A- fielder at other positions, instead of the C or so he was at SS; but nobody else on the roster was performing like someone who could handle SS.
The problem is, if I understand the issue, is that before you would look at Pujols as a +15 defender at 1B and a hypothetical SS who is a +15 defender at SS and conclude they are of equal value defensively. Now, suppose that SS is the same hitter as Pujols, the old methodology would proclaim them of equal value. Intuitively, this is not correct and always bothered me.
That is a claim that there is different distribution of skills at different positions, e.g. that the bell-curves are very different. That may be correct, but there's nothing in what I've seen research-wise on this that supports that, imo.
The problem is, if I understand the issue, is that before you would look at Pujols as a +15 defender at 1B and a hypothetical SS who is a +15 defender at SS and conclude they are of equal value defensively. Now, suppose that SS is the same hitter as Pujols, the old methodology would proclaim them of equal value. Intuitively, this is not correct and always bothered me.
That's not the case. If they had the same offensive numbers then the SS would have a VORP (or RAA, or whatever) significantly higher because the positional baseline offensive production would be much different. And thus, would appear to be a significantly more valuable player.
What's interesting here is that we are looking at defense in both an absolute and a relative sense.
The positional adjustment described here is giving us a total value for the player on the defensive end of the spectrum (it's comparing the player's positional value and the skill at the position relative to peers).
This makes sense to me, but it also reflects a managerial choice. For example, we can say that Carl Crawford was worth less defensively in 2008 than (say) Carlos Beltran because he was deployed in LF instead of CF. What we can't say, though, is that he's an inferior defensive player true-talent wise because we don't know. We can only use the adjustment to look at the actual.
So, let's take that same mindset to the offensive side.
Arguably, if we're doing this to address what the player actually contributed during the season---based on where a team actually deployed the player---we should also adjust offensive numbers to reflect lineup position shouldn't we? That reflects an actual contribution, too.
It seems to me that this might make a difference, though, if you are trying to put a dollar value on the worth of the players. For instance, suppose a team has a fixed sum of money to fill two slots, a shortstop and a first baseman, and needs a method to determine the best use of budget to fill both positions. In theory the team's valuation should consider that the same "runs saved" defensive contribution from the shortstop may be very rare, although it is relatively easy to find among 1st basemen. Like someone stated above, this is a form of marginal valuation.
However, based on what I can tell, it would seem difficult to come up with solid numbers to assess the defensive value differences by position.
Yes, but using offensive numbers to set the baseline is an indirect way to get at the difference in difficulty between SS and 1B. IOW, a 130+ SS will be a much better hitter than the average SS, while a 130+ 1B will be not very far above average for a 1B. The reason for doing that is to try to get at the fact that it's a lot harder to play SS than it is to play 1B.
Tango's suggestion is to stop trying to measure defense indirectly like this. Instead, try to come up with the relative value, in runs, of the difficulty of playing each position. Think of it as reflecting the number of plays made and the relative difficulty of those plays. That way we're measuring apples against apples rather than using oranges to estimate apples.
I see this approach as way too doctrinaire. Value isn't based on difficulty directly, but merely indirectly to the extent that difficulty of a position affects availability of players. We measure the availability already, there's no reason to go to a more rigid method that less takes into account actual conditions.
It's similar to crude oil and gold. Historically, the ratio between gold and oil prices trends towards 15:1. That's useful information, but most of the time, you're not in an "average" situation and you have to take into account conditions on the ground.
And that's what the normal approach does in baseball. Talent is not distributed in the same way in each era.
It's not comparing oranges to estimate apples at all. We're not trying to measure relative difficulty, but relative value. Players only have value compared to players that could theoretically replace them, not constructs of difficulty.
Traditionally, we've always compared hitters to a position specific baseline. With VORP, it's marginal value over a replacement level second baseman, where replacement level 2B offense is calculated off of the actual results of how ML second baseman hit that year. This is true at each position.
So, generally, we'd say that a player who is +10 VORP at SS is equal in value to a player who is +10 VORP at second base. You see people make arguments like this all the time.
The problem. however, is that there's an implicit assumption in doing things that way that the sum of the defensive differences between any two positions are equal to the offensive differences within a given year. That just is hardly ever true, and leads to incorrect valuations of players.
Let's use an example from last year. NL second baseman hit .271/.338/.408 last year, while NL shortstops hit .276/.334/.404. There's hardly any difference between the offensive level of the two positions, so a position-adjusted offensive number would return similar values for players who had the same line at SS and 2B. Let's say that we said that a player was +10 runs better than the average SS offensively, and average defensively for a shortstop, so we'd call him +10 runs overall. Now, we have a player who is +10 runs better than an average 2B offensively, but +5 runs better than an average defensive second baseman, so we'd call him +15 runs better overall.
In this instance, the offense plus defense would conclude that the second baseman was more valuable. However, we know that an average defensive SS is a much better player than an average defensive 2B. That gets lost in the shuffle when you compare them to the offensive average of each position and the two positions produce similar offensive output.
That's why a lot of us have gotten away from position-adjusted offensive numbers, such as VORP, and are breaking things down by component. Compare offense to average hitter (not average for position, just league average), then add a defensive metric, then toss in a fixed position adjustment that doesn't fluctuate based on yearly performance, and you get runs above/below an average player. It's just a better way to do things.
That's a bug, not a feature. Baseline value isn't fixed.
One of the problems that these "arbitrary" positional modifiers attempt to address is the fact that offensive and defensive skills are likely somewhat correlated.
To put it another way, most folks here would agree that RF > LF > 1B on the defensive spectrum. But if you look at 2008, AL RFs outhit both AL LFs and AL 1B by a decent margin.
If you use VORP, you either (a) tweak replacement level by basing it on something other than average offensive performance at the position or (b) wind up with VORP telling you that a 1B with an X OPS+ is more valuable than a RF with an X OPS+. The latter conclusion directly contradicts the normal defensive spectrum.
Defensive positional modifiers get away from that, by measuring (albeit perhaps arbitrarily) the value of playing RF v. 1B completely apart from the offensive production of those two positions. If a full-time 1B and RF post the same OPS+, the RF is 5 runs more valuable overall according to positional adjustments.
Of course, you could also use these positional adjustments to set replacement level. Start with a baseline replacement level of X RC/600 PA, adjust that based on defensive difficulty, and you have your positional replacement level to use with VORP. The math is the same; it's the definition of replacement level (or average, depending on what you're measuring) that changes.
In regards to the talent level at certain positions, it is far more fixed than variable - fluctuations within the talent distribution will never exceed some fundamental laws. LF will never be equal defensively to CF. 2B will never be equal defensively to SS. When the metrics miss this important difference, they're wrong.
I think David Cameron's post responds to this pretty well. I'll just add that I think you've implicitly defined the class of "players that could theoretically replace them" in a way that's too limited and that would cause us to miss important aspects of value. As Bill James pointed out early on, a defender's ability to save runs depends who we compare him to. Compared to a replacement SS, Mark Belanger saved quite a few. Compared to having Boog Powell at SS, he saved a roughly infinite number.
It's that latter point which is lost by looking solely to "replacement level" substitutes. The fact is that Boog Powell can't play SS, while Belanger could. That gives Belanger real value which isn't captured by comparing him solely to others who belong to the class "can play SS". Using a positional adjustment based on defensive difficulty is a way to capture that value.
This is what I keep coming back to in this internal struggle of mine.
Except that you shouldn't be using average to set the baseline. You should be using the bottom end regulars to set the baseline. This is generally a much better (but not perfect) proxy for replacement level.
If not, star-gluts at a position will throw you off.
I would also think you should be using all of MLB, not just the arbitrary AL/NL to make the comparison.
We compare him to those that could replace him, other players that could reasonably play shortstop. We compare his defense to those that can reasonably play shortstop and then we compare his offense to those that can reasonably play shortstop.
We make comparisons of this type in every aspect in our daily lives without any problems. When I buy a 4-door sedan, I don't have trouble making valuation compared to other 4-door sedans I could reasonably buy, not to $500,000 sports cars or ride-on mowers. When I buy soup and it's the worst soup available, I don't evaluate them on price as being "less than saffron" but "tastier than feces" but instead to other soups I could buy.
What we're talking about here is rigid and inflexible, based on the conceit that value is based on some absolute difference in difficulty.
But the relative availability of talent is variable. Value is not based on difficulty.
This is putting the cart before the horse. When has replacement level CF ever been better than LF/RF in recent years? Or SS better than 2B? Where is the logic in comparing a player to a fictional construct of a player instead of actual players in order to correct a problem that does not exist?
My understanding is that Tango's method (and really, he "invented" this, brilliantly so, with strong underlying data) actually means you compare SS as hitters to 1B as hitters, and then merely adjust that, rather than comparing SS to SS.
Tango and I have talked about this many times, and basically he and I disagree which is a better treatment. I am ready to be convinced, but the argument Dave makes and others convinces me more that it isn't correct.
The baseline of SS production versus 1B production varies seasonally. Right, if you want to see which salary was better, that might matter, but I want to know who was a better player, and I haven't seen any evidence that a stat like OPD does a worse job than the O+D+Pos. It could for some other uses, but I don't think it works for a seasonal comparison because it uses fixed numbers rather than the season variation (which is a little painful for me to write because I don't use single season PFs - I think they are different.)
I recognize this makes some numbers funky, but I am not sure why it throws me "off". Off what?
I strongly disagree. There are differences in defensive performance by league.
No, of course not. But you added that 15 to Pujols the same as you added 15 to Rey Sanchez. And yes, the team with Crawford and Beltran is saving 30 runs, but those 30 runs are not "created" equally, so to speak. I think that a good analogy would be low run environments vs. high. If Ty Cobb in 1905 is 30 BRAA, and Todd Hundley in 1998 is 30 BRAA, those are not of equal value. The 30 BRAA in 1908, given league scoring, is of course more valuable. Apply the same concept to defensive positional context.
I don't see how this comparison holds up. I understand 30 BRAA in a 2 runs/game environment has more value than 30 BRAA in a 4 runs/game environment. But how are the same number of defensive runs in the same scoring environment any more valuable?
I would also add that this isn't true, the game changes. RF in the 19th century was the equivalent of the modern DH. . . 3B used to be more valuable than 2B. 1B used to be more valuable than LF or RF.
In the era of turf parks, SS, 2B and CF became much tougher defensively than they are now. If you don't believe look at what those guys hit during that time-frame, compared with other eras. Those managers and GMs weren't dumb, you needed better fielders (and thus worse hitters) to be able to handle the position.
Who knows how the game will change over the next 100 years? You need a system that can adjust to changing conditions on the fly, not one that is rigid.
A: Okay, this guy is a second baseman, and he's 10 runs better than the average second baseman. So that's his value.
B: But why are you comparing him just to the second basemen? What about the shortstops? Pretty much all of them could play second base too. Shouldn't you be comparing him to them also?
I've had that conversation in my own head a lot, so this new way makes a lot of sense to me.
But this misses something very important: it's more difficult to play SS than it is to play any other position. There's more defensive responsibility there. We need to capture that difficulty somehow. If we use offense as a proxy for capturing that distinction, we're trying to describe an apple by evaluating an orange. Ideally, we'd measure that extra responsibility directly rather than indirectly.
This is NOT a dispute about the need to compare SS and 2B, it's about the proper way to make that comparison. Current measures use offensive performance as a proxy; that's the indirect way (I'd even say it's wrong when, as Palmer does, it uses average performance as a baseline rather than replacement value). Tango's trying to get a direct way to measure something everybody agrees needs to be measured.
Also, overall, I am not 100% convinced it is easier to play SS. The double play goes a long way to close the fielding gap (I think). It's a differnt discussion that I haven't pursued, but we need to think longer about that. Defensive stats (like mine or MGLs) don't properly account for DP turning "difficulty".
I see this said a lot, but I don't think it's true, and it gets at something important for this debate. If you look at the number of plays being made by 2B and 3B, they don't change much over the years. To take a purely arbitrary example, in 1906 and 1907, Johnny Evers was making 200 plays per year more than Harry Steinfeldt. That pretty strongly suggests to me that managers were mis-evaluating the defensive spectrum, not that the spectrum itself has changed.
If we limit ourselves to estimating defensive value by offensive performance, we'd miss the fact that 2B were actually making many more plays. It's only by using a direct measure like Tango does that we capture this distinction.
Johnny Evers plays were easy. 2B is a very easy position when you aren't expected to turn a DP. I played 2B in little league because I had no arm. The 3B were ALWAYS better players, before turning the DP became a huge part of the position. They were hitting and running all the time, making the pivot much less important.
Look at how 2B hit back then. If 2B wasn't easier, those managers sure were giving up a lot of offense at 3B. Were they all idiots? Players were faster, because there weren't power hitters. You needed a great arm to play 3B. Etc. It's not very hard to see how they were more valuable if you look carefully.
Those aren't remotely close to restatements of the same question. They might if you added salary in your evaluation for the MVP, but I suspect you do not.
The only explanation: IT'S A TRAP!
That makes sense. I would certainly disagree, though, that we almost always do the former and hardly ever have any interest in the latter. We pretty clearly have great interest in both.
I'm happy to give my opinions. :)
We, of course, don't have any way to know this. And see below.
Absolutely not. Managers were trying to judge positional difficulty merely by watching. That turns out to be very hard to do. It's like trying to come up with the odds for poker simply by playing the game (an example I think MGL gave once). In the absence of hard data, this is a very difficult problem. Making it even harder to evaluate is the fact that not all plays are created equal (as you noted). It is true that the plays a 3B makes are harder, on average, than those a 2B makes. But we know today that the extra difficulty pretty much balances out the fact that a 2B makes so many more. As Tango's numbers show, the two positions turn out equal. That's the real benefit of his approach, namely that we realize something we might not have recognized.
The managers then weren't stupid, any more than physicists in 1900 were stupid. They just were lower down on the learning curve.
Clearly not, and I should have been more precise. 2B have many more assists than 3B (more PO too). 1B have lots more PO, but many fewer assists.
If people are using the hit and run, sacrifice and steal nearly every time a runner is on 1B, one would have to admit the turning the DP skill drops in value.
And if 3B/2B are equal in the modern game, is it that far of a leap to see that in the deadball era 3B had more value?
What about the prevalance of turf in the 1970s and 1980s. There are very clear signs that more defensive ability was needed to play up the middle in that era - how do the adjustments proposed deal with that.
The game of baseball is absolutely not static. It evolves. Over time certain skills become more valuable than others. If the adjustments don't account for this, they are missing something.
I like the direct positional adjustments, but, I agree it definitely must be recalculated on a regular basis (not assumed to be some some of universal constant.
What really convinces me is this example. Say, there were 6 prime age ARod clones in a league at one time putting up a 150 OPS+ and 10 FRAA. And one Albert Pujols, putting up a 150 OPS+ and 10 FRAA, and a bunch of normal 115 OPS+ 1Bs with in aggregate average defense.
If you adjust for positional offense, Pujols will look better than the ARod clones. If you compare to league average offense (then add defense) Pujols will look equal to the clones.
But, we all intuitively know that the ARods could move to 1B, and be better fielders than Pujols. And, Pujols would be a train wreck at SS. To get the relative values right, you have to directly adjust for "degree of difficulty". Your good defenders at prime positions can always move down the spectrum. Your good defenders at 1B/RF/LF generally can't move up the spectrum.
I know you aren't saying we should do it this way, but I think it's obvious that you cannot do it this way. I can't imagine any system that uses league average to calibrate the baseline will work. That's what I meant earlier when I said you'd be 'off'.
If you look at the bell curve, the 6 A-Rods are way to the right. Everyone gets this.
So if you are using the bottom 20% let's say to calibrate replacement level at a position (DanR uses the bottom 3/8); the 5 extra A-Rods, only move you from using number 25-30 to number 20-25, which isn't nearly as big of a difference as using #10 instead of #15 - which is where you get moved to if you are using 'average' as your baseline.
You have to account for the fact that stars aren't always evenly distributed by position.
1) Batting Value over Replacement level hitters (all positions treated equally).
2) Fielding Value over Replacement level fielders at the same position. Generally the baseline (replacement level) here is average, although Nate Silver suggests for SS it's a little below average. Tango thinks the method Nate used to come to this conclusion is somewhat flawed.
3) A fielding position constant - this is where the SS moves ahead of the 1B, assuming both are equally valuable based on #1 and #2 (snapper's example in post 60). This fluctuates over time at different position depending on the types of players in the league, the parks, the baseballs, manager strategies that are in vogue, etc..
We have #1 figured out pretty well. #2 is improving, and will get better over time.
The question is how to come up with 3.
DanR does it by taking the bottom 3/8 of regulars at a position. If I understand correctly Tango is trying to find this without using batting and this would be ideal - if it were to fluctuate based on the evolution of the game.
But I think they are both coming up with similar numbers anyway, that's the gist of post 44 right?
Am I missing something - I'm trying to make sure I understand exactly where we are right now.
This is just assuming your conclusion. The question we're trying to answer is whether, in fact, the game was different then (at least in this particular way; there's no doubt it was different in lots of other ways). The key point I'd make is this: despite the claims that 3B was a more important defensive position then, the ratio of assists made by 2B and 3B was not any different then than it is now. That suggests pretty strongly that the relative importance hasn't changed that much. I do agree that there were fewer double plays, but there were also lots more errors (and more by 3B than by 2B). How it all adds up can't be precisely known. My point is that we shouldn't just accept the claim, we should measure it.
Pete Palmer uses league average as the baseline.
I agree that these are relevant factors (I'd add baserunning and arm also, though we aren't really talking about those here). Where I disagree is how to calculate #3. IMO, we shouldn't be using offensive metrics to estimate defensive value. We should be measuring defense by itself. That's what Tango's system attempts to do.
I completely agree. That's why we need a LOT more work on the dead ball era.
1) Dan sees the SS to 2B/3B as around .8 or .9 wins, while Tango has it as .5 wins
2) Dan sees the CF to corner gap as around .6 wins, Tango as a full win difference.
These discrepancies aren't enormous, of course, and it is somewhat encouraging that two such different methodologies come to relatively similar conclusions. Still, they aren't insignificant either, especially when looking over a long career, so one would in the end like to know which, if either, was "right".
We call this theoretical time the early 2000s.
This is evidence for 3B being tougher. SS make more errors than anyone, right? With lousy gloves and faster runners, 3B has much less room for error (pun intended) than 2B does.
I don't think raw numbers of plays are the way to measure it at all. Degree of difficulty is a much more important factor in the equation.
We have. If you assume managers act rationally, you can tell a lot by how players that played a position hit. There is a clear tradeoff between fielding and hitting availability at each position. The tougher the baseline of fielding ability needed to play the position, the worse the players chosen generally hit. I think this is a much more robust way of looking at the issue than just the number of assists and putouts players made.
Another example would be Matt Holliday for the A's. I've been thinking that his addition to the team would pretty large considering his replacement of Jack Cust. Next year he's replacing a valuable hitter who loses value in both fielding and baserunning. Now that the A's can hide Cust's glove at DH with somebody above average at fielding and baserunning would probably add at least a couple wins.
My question can be simplified to this... Is it more valuable to have a a plus baserunner at a position with a lower baserunning replacement value? Carl Crawford or Rickey Henderson would be other good examples. I guess you could just include it into offense for that position.
***
As fielders, yes.
It doesn't necessarily have to be Willie. It's always nice to have a real face to the issue. Willie plays all the positions, and he looks average, and he has overall, pretty average numbers. Plus, my readers voted him the worst player in baseball two years ago, so I have a special affinity for this guy. If you want to use someone else, Melvin Mora, or whoever, feel free to bring a better name forward. I used to call this player "Hubie Raines", in honor of Hubie Brooks and Tim Raines.
This was first brought forward in 2003/2004, and one of the first articles on the subject is here:
http://tangotiger.net/UZR9903TT.html
My blog is filled with such "multiple position" comparison. Only recently have I included the lefthanded issue, which brought the CF down a peg, and in-line with 2B/3B.
I go out of my way to say that they are not gospel and that lots of work needs to be done here.
I "promulgate" on my blog. If you think more people should cite their references more, you should take it up with them. In this case, you should shoot the messenger! Seriously, as Bill James once said, if it sounds like you are walking into the middle of a conversation, it's because you are. It would be nice if we have everything in one spot. It's just not always possible for us amateurs to find that time to get everything summarized.
A good article and followup can be had here. We just need some time. I know I would enjoy the discussion with Chris and Dan R and several others.
Now, let me comment on the rest...
Untrue.
The idea behind what VORP does is to "neutralize" the offensive disparity among positions so that you can simply add in the fielding value relative to position (which is what SuperVORP is). The end result is that you don't have a positional bias.
Otherwise, if you simply do: offense above average (without regard for position) and defense above positional average, then the average 1B will be far higher than the average SS (by some 20-25 runs or more).
Chris with OPD (offense plus defense) ensures that the average at each position is exactly equal. He does that because his presumption is that a position is a position, like QB, RB, DT. I don't do that because baseball players are alot more fluid in their movement, like Left Defensemen and Right Defensemen, or Left Wing and Right Wing in hockey. No one in the NHL would measure Centers against other Centers. You WOULD do Forward against Forward. So, you really have "pools" of positions: C, IF, OF, 1B/DH. (In baseball, the pool on the right includes the pool on the left.)
Anyway, like I said, we can have a long discussion about this.
Right, my position entirely.
People may also be surprised to know that the average RF hitter was WORSE than the average CF hitter in the 1950s. Would it make any sense that the average CF (off+def) equals the average RF in this case? You'd have to believe that the average RF was a better fielder in the 1950s.
To say nothing of when Brian Giles gets the bad luck of having Barry Bonds in his NL LF pool, and the next year he has the good luck of not having him in the NL RF pool.
And if you consider the DH/1B issue?
How about high school? Would you want the average SS there to be equal to the average 2B? OPD sets that as a requirement. Now, Chris may say that the SS from one team is competing with the SS of the other team. But, when he makes his list, he includes all players, and so if you have an average high school SS and an average high school 2B, they both come out equal in Chris' list.
That's why I can't go along with positional adjustments that make it a requirement that the average at each position is always identical under all circumstances.
These adjustments apply only to the time period it was based upon (2003-2008). If for example the 1970s was filled with a ton of fantastic fielding SS, I'd have different adjustments. So, Dan's presumption here is in fact not a presumption of my system.
***
If there's anything else that I need to directly reference, please let me know.
***
In summary, I ask the question: "How does this player field compared to Willie Bloomquist, or some composite who is average in all tools respect (speed, strength, agility, etc), and who is equally experienced at all positions?"
Since this is the common baseline that we are comparing all baseball players against, we can make the apples-to-apples comparison. And that's what those positional adjustments do.
Agreed.
While I absolutely agree that number of plays isn't enough by itself, the fact is that 2B in the deadball era were making 200 more assists each year than 3B were. That suggests that we'd have to consider 3B WAAAAY more difficult in order for the positions to be even.
Maybe they were. But the right way to test that is not by their relative offensive contributions; that's too indirect a measure. The right way is Tango's way. As he says, his estimates are era-dependent. Until someone does a good study of the deadball era which directly measures the difference, I'm going to remain agnostic on the question whether 3B was, in fact, more valuable defensively.
It's not so much a matter of rationality, it's a matter of the data they put into their reasoning process. Sure they're rational (though Bill Bergen is hard to explain on that basis), but they didn't have PBP data to actually measure what was happening on the field. They had to rely solely on what they saw. As I said above, it's very hard to sort out the relative importance of two positions when one makes a lot more plays but the other's plays are harder. Those are incommensurate factors and people in general aren't very good at weighting them without more systematic methods than simple observation.
Well that's certainly NOT my position, nor is it Tango's. The measure is not the number of assists made; I just mentioned that to show that the number of plays made at a position hasn't changed much over the years. That's suggestive but not conclusive. As I've agreed, it's important to account for difficulty as well. I just want to do that directly, rather than use offensive ability as a proxy. That's a bad choice because it depends on the accuracy (not the rationality) of a manager's observations.
An average 2B creates/saves runs the same as an average SS as compared to the league average 2B or SS as applicable. I don't think that's quite the same. I have, in this argument before, simply posted the raw values rather than adjusted against average (but then people talk about zero being the baseline - you can't please everyone , and usually not anyone). If you use zero, you can clealry see that a SS isn't "equal" to a 2B.
Untrue.
The idea behind what VORP does is to "neutralize" the offensive disparity among positions so that you can simply add in the fielding value relative to position (which is what SuperVORP is). The end result is that you don't have a positional bias.
I understand that. But your last sentence is what I understand to be the point of what you're trying to do (and yes, I realize the irony of arguing with you about what your point was. :) ) VORP's attempt is to completely neutralize the context of position. The problem is that we shouldn't neutralize that context. A 2b with a VORP of 50 is more valuable than a RF with a VORP of 50, because VORP has normalized out a part that makes him more valuable.
Here's the progression, with the last part being my understanding of what you're trying to do:
Old - No adjustments whatsoever
VORP - Adjust hitting by position, add AdjHitting+Defense+Baserunning
New - Adjust hitting by position, add defensive positional modifier, add AdjHitting+AdjDefense+Baserunning
No?
Ok, so let me ask the question: if you have a 2B who hits as well as the average 2B and who fields as well as the average 2B and you have a SS who hits as well as the average SS and who fields as well as the average SS, will they both, when you produce an "overall" list of all nonpitchers, be listed side-by-side as equals?
This is exactly what I can't follow. Are we talking about a scenario where what would happen if Frank Thomas played SS? Or are we more interested in what would happen if player X went down for 30 games?
Because we shouldn't need a construct for the latter situation. Take the average ( offence and defence ) of all players who have played games at Position X, but appeared there less than 50 times that year. That SHOULD be your replacement level.
Once that is defined, we should not need positional adjustments, as the replacement level should give us an idea of the available talent in the market ( This is making a very big assumption that the baseball player movement is governed by an efficient market ).
Or if Chipper Jones plays SS in a shift, is it counted as a game at SS?
If Willie Bloomquist is -2 wins per 700 PA as a hitter, I don't care what position he plays.
If he is -0.75 wins as a fielder, relative to the average SS, I then give him +0.75 wins because the average SS is that much better than an average overall fielder (Hubie Raines, Willie Mora, or whatever composite you want to come up with).
If he is +1.25 wins as a 1B relative to the average 1B, that's no great feat. I give a -1.25 adjustment to that.
In the end, the fielding + position_adjustment is really what his fielding is, relative to the composite average Hubie Raines.
While we have a composite average hitter (that's the league average on 189,000 PA), we don't have such an animal on the fielding side. So, I have to "make one up".
Not to belabor the point, but this is where I am getting a disconnect. I might just be a little dense today, so a little patience please.
Why do we need a fictional Hubie Raines/Willie Bloomquist figure? Every team has their version of that figure, who is plugged into holes opened up by injury. Ryan Freel or Omar Infante or Eric Bruntlett or Willie Harris. The list goes on.
What we do know however is the average production of these characters over a season at various positions in terms of solid numbers without names. So averaging all these characters over a season should give us a replacement level production at the position.
Assume this were not true. Then if say Chase Utley went down, and I don't have a replacement level player, I will trade for one ( say Iguchi if he were available), and thus set the replacement level value at 2B. Now the market of available players at a position should set its replacement level. So then you can directly compare 2Bs v 2Bs..or if you want a hypothetical replacementlevel figure, you average out the replacement level production at all positions, and normalise the replacement level for each position.
The way I understand it, you are not approaching it in the way I posited above, so maybe my disconnect.
But where do the numbers come from? Why is the adjustment from SS to 1B 20 runs? What data was analyzed to come up with that? How much variance is there?
Reposting what JoeHova said as well, because I thought it warranted saying twice:
I'm probably missing something, but lately it seems like the people who are working with defensive numbers are treating them more like gospel than they deserve to be treated at this point. I think anybody who starts promulgating a new stat has the duty to explain why the stat has value and how it came to be calculated. I'm not saying that people are necessarily failing to do that, I'm just saying that the burden is on the prosecution.
Can anyone point me to whatever study or studies these numbers are based on? I've been reading Tango's stuff for years, and he's done alot of great work, but I can't just buy these numbers that have such precision attached to them (7.5 runs? Why not just 7 or 8? Is your precision really that good?) as gospel without understanding how they were developed.
Not true is it, atleast not semantically? Shouldn't the value of a SS who is average but has lower replacement level be higher than a 2B who was average but has a higher replacement level, atleast to the team?
As individual players in no team context, your statement should have validity.
EDIT : Not to clutter up the thread, in response to #82. I was of the opinion that you wanted to compare non-pitchers across positions. I thought that was the whole idea behind positional adjustments.
Why on earth would anyone do this? He's a SS. You can't treat him as a 3B . . . you need to figure out his actual value as a SS, anything else is a wildly inaccurate shortcut, IMO.
Why are his peers only NL 2B? Why not MLB 2B? Why not MLB middle-infielders? Why not MLB infielders? Why not all MLB position players?
And, what about all the players who played multiple positions? Who are Blake DeWitt's peers? Does all of his offensive performance count as a "peer" for other 2B and 3B? Or only his time hitting while fielding those individual positions?
Those aren't rhetorical attacking questions, I'm sincerely interested in the way your position would address those questions.
But Jeter's TALL!
is this not making the same "adjustment" from different start points, that in the "new" method you are double counting (or penalising)?
1. Offense
2. Position
3. Fielding
Some ways to combine things...
Offense + Position = VORP (hitting valued by position)
Position + Fielding = Defense
VORP + Fielding = Total Value
Offense + Defense = Total Value
It's all just fun with the associative property (with some difficult details and philosophies to work out.)
Fortunately, these players are extremely few, and their PAs are rarely evenly split *and* players that play multiple positions generally play "similar" positions. I categorize a player like that *for offense* with the group he played the most defense as. His Defensive Innings is what I use to assign his offensive performance above average. Yes, there are very few for whom this will result in a few runs "off", but not many. His defense is balanced against the position he played it against. So Blake DeWitt's defense as a 3B is compared only to other NL 3B (and so on and so forth).
The error bars of offensive and defensive measurement are certainyl wide enough that if OPS says DeWitt is +30 runs above average, and wOBA says +28 runs and VORP says +26 runs above average, we're entering the realm of a quibble over a fineness none of the metrics can attain. YES, OPD could be slightly more refined in this respect *BUT* one of my key rules in working to develop a metric like this is that it is verifiable by ANYONE. My calculations are completely transparent and can be duplicated by anyone. The extra complications required to achieve VORP (or other adjustments of breaking ABs by position played) just don't add a layer of accuracy that, IMO, justify the extra work for the vast majority (what - 700 of 750 players?), particularly in most cases the differnces will be less than a run (due mostly to the fact that players that play many positions usually don't get many PAs). There are a few exceptions, but I don't think they justify the legwork.
And as there is some error in all methods, OPD stands up (confession time: I am changing the calculation due to an issue Tango and terpnats mentioned at THE BOOK BLOG) to all of them within a run or two. I was undervaluing doubles, but that's being corrected.
And this is why I can't accept it.
If we know that the average CF is a better hitter than the average RF, and we "know" that the average CF is a better fielder than the average RF, then I can't accept that the average CF is equal to the average RF. But in your system, that's what you end up with, because the average at each position will always rank as side-by-side equals (given equal playing time).
How do you reconcile this issue in high school?
But, hypothetically, what if GM's went crazy and decided to play suboptimal players at 2B? Say 2Bs are now AA-caliber, so that Uggla is a superstar in relation to the rest of the pack. Does he deserve credit for that? Yes, he is a relative advantage in one of those football style pro/con checklists when comparing teams. But is HE more valuable, or have the Marlins just made a better decision than other teams. If the Braves move their SS Escobar to 2B and bring up a replacement-level SS, their team improves and Escobar rates as way more valuable. But is he? Or are the Braves just smarter? Isn't the real upgrade from the crappy AA-level 2B to the passable SS? Escobar is who he is.
I understand that, if your goal is to account for what happened last year, you can compare only like positions to like positions and still have it all "add up". But 1) that becomes of little use once you move away from the "accounting for last year" scenario (What are our team's strengths and weaknesses? Who is more valuable to our franchise? If we're "timelining" and thus not assuming that every year of baseball history is zero-sum, who is greater, Player X from 1950 or Player Y from 2000?, etc. -- all of these questions are better answered by the more abstract method than by the "accounting" method), and 2) although you can make things add up that way, I don't see why it's actually preferable to do so.
¹ Although you don't, actually, right? You just need to field nine players. If anything, that argues further for the second positional adjustment, IMO, since if you followed that argument to its conclusion, it would lead you to "Uggla's identity is one of the hitters on the team", not "Uggla's identity is a second baseman."
² And as a bonus, said player will likely be a better defensive 2B than he was a SS.
You must be Registered and Logged In to post comments.
<< Back to main