|
|
|
|
Baseball Primer Newsblog— The Best News Links from the Baseball Newsstand
Wednesday, July 28, 2010
Baseball Prospectus has published an article by Colin Wyers today that may be one of the most important pieces written about fielding measurement in the last decade. The full piece is available only to BP subscribers, but let me briefly recap some of the topics Colin covers.
Colin reiterates the point that uncertainty in fielding measurements is something that can be tackled with bigger sample sizes, i.e., more season of data. Bias, on the other hand, is persistent. It does not decrease with larger sample sizes of fielding data. He mentions two types of bias: that related to park/scorer and that related to the fielder’s range.
He then outlines a clever method for using data like putouts and assists in order to develop a fielding metric for infielders that should be much less subject to those two sources of bias than our current advanced metrics like Ultimate Zone Rating (UZR), Plus/Minus, and TotalZone. His metric very likely has greater uncertainty than the advanced fielding metrics that use ball-in-play data to determine which fielder had the best chance to field a batted ball. However, at some point, larger sample sizes should decrease the effect of the uncertainty, such that the reduction in bias using Colin’s method will actually produce more accurate measures of fielding. Is Colin’s method better after two seasons? Three seasons? Five seasons? Because we don’t yet know the size of the park-scorer bias or range bias, we don’t know exactly at what point that occurs.
Thanks to Seth.
|
Support BBTF
Thanks to robinred for his generous support.
Bookmarks
You must be logged in to view your Bookmarks.
Hot Topics
Newsblog: YESNetwork: A look at five Yankees' cases for enshrinement in Monument Park (2 - 8:14am, May 26)Last: Jolly Old St. Neck Wound, Moral IdiotNewsblog: Maddon on Red Sox beaning Luke Scott: 'I think it's ridiculous, I think it's absurd, idiotic' (5 - 8:03am, May 26)Last: Obi One Kenobi NilNewsblog: HP: Baseball is leaving the human factor behind (60 - 7:55am, May 26)Last: Designated Sitter (GGC)Newsblog: Wilmoth: Nate McLouth Designated For Assignment (13 - 7:52am, May 26)Last: RussNewsblog: OT: NBA Monthly Thread, May 2012 (1835 - 7:45am, May 26)Last:  thokNewsblog: The Hall of Very Good: Former Cards Slugger Critical of "LaRussa's Regime" (6 - 7:16am, May 26)Last: Shooty: Applying to be Fearless LeaderNewsblog: Matschulat: Did I Miss The "Paul Konerko Is So Overrated OMG" Bandwagon? (30 - 7:15am, May 26)Last: baudibNewsblog: CSN to host ‘Phillies at the Beach’ on Memorial Day (19 - 7:11am, May 26)Last: GodNewsblog: T.R. Sullivan: Of Frank Robinson, Milt Pappas and Jim Palmer (10 - 7:09am, May 26)Last: GodNewsblog: Bud Selig -- No need for more MLB replay for now - ESPN (88 - 6:12am, May 26)Last: LassusNewsblog: Himrich’s Top Ten Target Field Foods (8 - 2:43am, May 26)Last: Long John McCaine Mutiny on the Bounty (scott)Newsblog: Boston.com: Curt Schilling’s 38 Studios lays off all staff (119 - 1:28am, May 26)Last:  Swedish ChefHall of Merit: Most Meritorious Player: 1973 Discussion (15 - 12:13am, May 26)Last: DanGHall of Merit: Most Meritorious Player: 1972 Ballot (28 - 11:25pm, May 25)Last: lieiamSox Therapy: A Winning Ballclub? (20 - 11:24pm, May 25)Last: Dan
|
|
Reader Comments and Retorts
Go to end of page
Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.
1. TDF, situational idiot Posted: July 28, 2010 at 06:10 PM (#3601707)Or in other words, any scout with a clipboard could tell you a lot more than this metric about what shortstop to sign for next season.
That's true for batting and pitching and baserunning, too, but we still find a lot of meaning in measuring players' abilities in those parts of the spectrum over periods ranging from three years to a 20-year career.
This is not a solution (yet, by any means) for accurately quantifying a player's true fielding ability at any given moment, but it's a necessary and very important step on that road. Before, we had established that there was bias, but the only tool we had to try to measure the bias was a blunt one in WOWY. This is a sharper tool in the tool chest for measuring bias. Once we can measure the bias, we can start to eliminate it or adjust for it.
And I should have added, not that there's anything wrong with that. Especially if a player's reputation doesn't seem to match what he accomplished over a given 5-year span. Do some organizations/managers judge fielders better than others? is the kind of question this could help explore.
What about the scout in KC who thought Betancourt was good?
That's true for batting and pitching and baserunning, too, but we still find a lot of meaning in measuring players' abilities in those parts of the spectrum over periods ranging from three years to a 20-year career.
But we're lead to believe that a season's worth of data for a pitcher or hitter can be somewhat predictive. Every defensive metric I've seen discussed talks about the need for 2-3 years of data to tell us anything. If it takes that much time to accumulate enough data on a fielder for it to tell us anything, how can any of the metrics tell us anything about a player's defensive ability today?
neither bias nor variance is the concern in this situation but rather the mean squared error (MSE) or similar measure of total error. You will generally be happy to trade a little bias for a big gain in variance reduction. The point estimates aren't that big of a deal and it's more important how confident you are in saying "this guy is close enough to average to not worry about."
Now, not having read the article, I don't know how you judge bias in this situation anyway. We have no true value at the individual level to compare our estimate to (except perhaps plays made). We have some team-level measures (runs allowed) but that needs adjustment for pitching.
My guess is you guys want to look into techniques like "generalized regression" which is a rather simple but odd technique used for "benchmarking" or "calibrating" survey weights (so you need to look into sample survey statistics to find it) so they add up to population totals using various optimization criteria. I'm still not sure that will work.
It's not a question of how long it takes to tell us anything. Even one play tells us something, just not very much. It's a question of how much error you're willing to tolerate in your measurement.
My statement (really a WAG at this point) was that 2-4 seasons of Colin's system would have equivalent error to UZR or Plus/Minus or (insert your favorite advanced fielding metric here), and that for larger samples, Colin's system would probably be better, and for smaller samples, UZR or whatever might be better. I was not saying anything with respect to how many seasons of fielding data you would need to get a good read on a fielder's ability.
Fielding measurements are hard to make, and Colin's system is an important step on the road in trying to improve them. I did not make any claim, nor did Colin, as far as I know, that his system is better or worse than a scout's measurements. On that subject, though, a few thoughts:
1. What's the average error in runs on a scout's estimate of fielding ability?
2. Which scouts are better than others at estimating fielding ability?
3. How many scouts' opinions do you have access to for any given player? I have access to none, outside of something like the amateur scouting from Tango's Fans Scouting Report, which may be quite valuable. I'd love to have more of that information, but I think it's a fair assumption that we as a public community are never going to have access to that information. We would still like to estimate fielding ability, though, wouldn't we?
That's the crux of the ongoing discussion on this topic. The working assumption had been that UZR and similar systems introduced little or no bias in exchange for a big gain in variance reduction.
Colin's work here is very helpful in quantifying the size of the bias. I estimated, for instance, in the case of Ozzie Smith, that the range bias in systems like TotalZone was on the order of 5 runs per season. That's where I came up with my guess that the tradeoff in variance reduction was a losing cause at somewhere around and after the 3-season mark. That's a very rough number, not to be taken as some sort of gospel, but that's the kind of thing that Colin's work is moving us toward.
Michael Kay was AMAZED by Betancourt's play a few days ago. Ken Singleton wagged his head in agreement.
I'd use a video frame-counter off a real-time replay. But yes, it could maybe work.
Again, I do not say they aren't allowed to do as they choose; of course they are. But scholarship can't be taken seriously unless it's universally available for review, critique, and response.
So I guess nothing ever published only in a book or a journal can be taken seriously as scholarship, since by definition none of it is universally available.
You can always access these via a library somewhere. Maybe not your local one, but somewhere.
You must be Registered and Logged In to post comments.
<< Back to main