User Comments, Suggestions, or Complaints | Privacy Policy | Terms of Service | Advertising
Page rendered in 0.3045 seconds
41 querie(s) executed
Dialed In — Wednesday, November 02, 2005D - ## ! D- ## !I love the defense. As long as I have been stumbling around the internets, I have been arguing long and loud about how much defense was undervalued by Statheads. However, when I started I was arguing some things I would never say now, but I was at one point just as ignorant as Bill Plaschke. Looking back, I was pretty amazed at the Re-Education of Chris Dial from November of 1997 to February of 1998. In June of 1998 , I devised some methodology to evaluate both sides of the ball using Zone Rating data, or about the same thing as SuperLwts/UZR. I went through a few permutations of offense and defense, and began using Jim Furtado’s Extrapolated runs, and did a ton of work with Dale Stephenson (Google that name for the work of a top-notch sabermatrician) in determining runs for defense. Dan, we still need to get Dale to post his Peak Lists here. Last year, I wrote a couple of “Who should be MVP?” articles. Right now the methodology link is dead, but it is pretty close to the above rsb post. I wanted to improve what I had done. So I did some data-mining. I went back to the source – STATS - to better determine what ZR chances occurred at each position, and worked backwards from there to calculate defensive runs. In doing so, I was able to come up with good averages for balls in play based on 3000-7000 defensive games (up to 60000 innings and 7000-22000 chances depending upon the position), and effectively draw a baseline of where a fielder’s production will lie based on his zone rating. Converting to runs is simple enough, once you figure out chances. So I have done all this, worked on my calculations, and generated defensive runs. These runs saved (RS) are above average and specific to the individual’s playing time. Yes, the first critique is: I have to do one of two things: normalize everyone to the same number of chances (the average) and indicate that the rate would result in so many more (or less) outs and runs. Or I use the average out conversion rate and subtract actual outs from average player outs. I’d certainly prefer to do it the second way, but I can’t. So I do it the first way. I have some seasons where I can do it the second way. Working through the math, the difference between defensive plays at shortstop (the position with the most plays on average) converted to outs is plus or minus four plays. That’s three runs. So the first way is going to be within three runs of the second way from best to worst in 525 chances. I took all the positions, made a nice spreadsheet, and have it ready to calculate runs saved above (or below) average. The basic calculation is: This yields a Player’s RS(cal), where (cal) is every inning of every game. We then subtract the league average RS(cal), and adjust those runs by the player’s actual playing time. That yields Runs saved compared to what a player converting outs at X rate given an equal number of chances normalized to the playing time. I know – how much does the normalization affect the data – as noted above, not very much. With these formulas, you can generate good defensive value numbers on your own, anytime you need to. The results are going to be robust too. What about UZR? MGL has said that UZR *has* to be better than ZR because it is an extension of ZR. Well, it isn’t exactly. MGL converts all of the data from STATS zones to a different grid (Project Scoresheet) that is far less discriminating. Envision this: This doesn’t turn out too terrible because Zone T is the responsibility of second basemen in ZR. This is why UZR misses 2B the most (I think). Half of the groundballs hit in that portion of the PS 34 zone are in ZR by default at full credit, whereas in UZR they are not. And so it goes around the diamond. In addition, the PS 4M zone (up the middle to the 2B side of second) is split in ZR. Half of it is the responsibility of the 2B, but the other half is not. The 3B/SS side of the field sees the same problems. As I pointed out in Mike Emeigh’s great eight part series on Jeter’s defense, there was a zone assignment error from Project Scoresheet that rendered one season of Jeter’s data unusable. It’s a big issue. In a nutshell, UZR is a very nice system. However, I believe the proprietary nature of the STATS’ raw data compels the data user to “tweak” it, and in my opinion the use of Project Scoresheet zones makes it less accurate than generating runs saved from ZR, as I have done. |
BookmarksYou must be logged in to view your Bookmarks. Hot TopicsSteve Austin is not a Baseball Player
(159 - 12:27am, Jul 07) Last: Infinite Yost (Voxter) Defensive Replacement Level Defined (41 - 1:20pm, Mar 14) Last: Foghorn Leghorn Reconciliation - Getting Defensive Stats and Statheads Back Together (30 - 1:42pm, Apr 28) Last: GuyM Handicapping the NL East (77 - 2:02pm, Oct 15) Last: The Interdimensional Council of Rickey!'s Landing Buerhle a Great Move (79 - 8:43am, Feb 04) Last: Foghorn Leghorn Weekly DRS Update (Defensive Stats Thru July 19, 2010) (3 - 2:47pm, Sep 27) Last: Home Run Teal & Black Black Black Gone! You Have Got To Be Kidding Me (8 - 3:52am, May 01) Last: Harris Weekly DRS Update (Defensive Stats Thru July 4, 2010) (2 - 4:05pm, Jul 11) Last: NewGrass Weekly DRS Update (Defensive Stats Thru Jun 29, 2010) (5 - 12:47pm, Jul 04) Last: Harveys Wallbangers Weekly DRS Update (Defensive Stats Thru Jun 13, 2010) (15 - 1:51am, Jun 16) Last: Chris Dial Weekly DRS Update (Defensive Stats through games of June 6, 2010) (17 - 7:08pm, Jun 14) Last: Foghorn Leghorn Daily Dose of Defense (41 - 8:31pm, Jun 04) Last: Tango 2009 NL OPD (Offense Plus Defense) (37 - 11:22pm, Feb 17) Last: Foghorn Leghorn NOT authorized by Major League Baseball or its Member Teams (40 - 7:32pm, Feb 16) Last: GregQ 2009 AL OPD (Offense Plus Defense) (35 - 9:05pm, Jan 05) Last: Foghorn Leghorn |
|||||||
About Baseball Think Factory | Write for Us | Copyright © 1996-2021 Baseball Think Factory
User Comments, Suggestions, or Complaints | Privacy Policy | Terms of Service | Advertising
|
| Page rendered in 0.3045 seconds |
Reader Comments and Retorts
Go to end of page
Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.
Well, that's what we don't know - how many OOZ plays there are. But take it to 400 in-zone BIP if you want, with a .750 average ZR. The in-zone guy will catch 300 of those; the OOZ guy will get 5 plays OOZ but if he gives up more than one ball in-zone to catch those five OOZ he'll still have a lower ZR.
Jump the average ZR higher, to .850. The in-zone guy makes 340 plays. The OOZ guys still gets 5 balls OOZ, but if he gives up even one in-zone ball to catch those 5 OOZ balls, his ZR will drop below the in-zone guy's.
-- MWE
Ooof. No, that assumes the *other* method is right, when it uses the larger zones. Of course, maybe that's why you had quotes around it.
My method that he references is slightly different, but maybe it shouldn't be.
... here is the CF DA data from 1988-1992. Parsing the explanation, NEB comes out of how many 2B and 3B were hit to a fielder's zone, compared to the league average for that position.
If you just search Google groups for "Dale Stephenson" "fielding runs" and/or "defensive average", you'll find all kinds of great stuff on the topic. It's presented pretty well and I really wish someone was still distributing data this way.
Are we confident in pegging an individual hitter's run contribution within a margin of 5? Fairly so ... but I don't know that anyone would argue that a batter that was +10 was indisputably better than one that was +5. There's a margin of error around everything they do.
Now you know why I pimp rsbb so much.
When Giants walked the Earth, and all that.
Szym and I are working on presenting the data like that all the time. My niext piece will be up shortly.
Stupid awards articles... (I kid! I started it, and I love it when Ant writes.)
No, we aren't. I think the margin is right around 5 runs.
I have generally contended that methodology was about half as close as offense (3-4 and 6-8). But that's mostly due to the limitations of not being able to reach the minutia of all teh context that exists on every BIP. It simply cannot be done.
Solves more trouble than you can imagine.
Life is not quite long enough to dig through all the interesting stuff done there; you can lose days just wading through stuff written 12 or 13 years ago ...
You must be Registered and Logged In to post comments.
<< Back to main