Primate Studies— Where BTF's Members Investigate the Grand Old Game
Wednesday, November 13, 2002
And the Beat Goes On: Derek Jeter and the State of Fielding Analysis in Sabermetrics - Part 4
Next up: Context-Adjusted Defense
The Saga Continues: Context-Adjusted Defense
In 1999, the Big Bad Baseball Annual included a lengthy article by Charles Saeger, outlining another approach to evaluating fielding performance. This system was first published to rec.sport.baseball in 1994, revised extensively before the BBBA article in 1999, and revised again since the original publication. Recent innovations have been based on inferences that Charlie has drawn from play-by-play data that I have provided to him.
The details of Context-Adjusted Defense (CAD) were recently described in a Primer article. Charlie starts with three basic categories for each position - fielder outs (range), runners removed from the basepaths, and errors. Each of these is placed into a specific context. Fielder outs are placed into the context of the team’s hits allowed (estimated GB singles allowed for infielders, 2B, 3B, and estimated FB singles allowed for outfielders), then adjusted for contextual elements that affect the number of opportunities that a player sees - GB/FB rates, LHP/RHP, and ballpark. The LHP/RHP adjustments are position-specific, based on inferences drawn from the play-by-play data that show that LHB and RHB not only hit the ball to different sides of the diamond but at a different rate to each position on each side of the diamond. Similarly, runners removed from the basepaths are placed into the context of the estimated number of runners available to be removed, then adjusted for contextual elements that affect the number of opportunities a fielder may have to remove baserunners (GB/FB rates and total number of BIP). Error rates are directly compared to the league rate for the position (theoretically, a player would make errors at the same rate regardless of the number of opportunties he had to make such errors). In addition to these three basic categories, Charlie will also add an extra category based on information specific to a position that provides additional information about fielding skill at a position. As in DFTs, fielder opportunities at a position in each category are first determined at the team level, then allocated to individual fielders proportionately based on playing time. The reader is referred to the linked article for calculation details.
In CAD, shortstops are evaluated in four categories - range (using assists to measure fielder outs), DP ability (runners removed), error rate, and combined 2B/SS putout rate. This last is based on the observations by Davenport and James that many middle infield putouts can be made by either fielder, thus the combined rate of putouts is a better indicator of fielding skill than the number of putouts by an individual fielder. Putouts are not included in the range estimates for middle infielders in CAD for that reason, and are thus evaluated separately. The outs saved in each category are converted to runs, then to a defensive winning percentage based on the estimated level of defensive responsibility for the position and the percentage of playing time at the position. The regular shortstops for 1998-2000 are ranked in Table 6, with the net number of plays made and number of runs saved in each category shown, followed by the total, the defensive responsibility for the position, and the DWP. Fielding Runs and DFTs are also shown for comparison purposes. (These rankings are based on the method as it was described in the Primer article; more changes are in process as I write.)
Table 6. Regular SS Rankings, Context-Adjusted Defense, 1998-2000
|
1998 |
Team |
Range |
RRuns |
DP |
DPRuns |
Err |
ErrRuns |
PO |
PORuns |
TotRuns |
BaseR |
DWP |
FR |
DFT |
|
Vizquel, O |
CLE |
-1.47 |
-0.35 |
-5.48 |
-1.37 |
15.49 |
9.88 |
18.96 |
2.18 |
10.35 |
121.30 |
0.541 |
7.44 |
11 |
|
Stocker, K |
TBA |
7.29 |
1.75 |
10.24 |
2.55 |
3.76 |
2.40 |
5.68 |
0.65 |
7.35 |
88.89 |
0.540 |
11.63 |
14 |
|
Gonzalez, A |
TOR |
25.46 |
6.10 |
3.58 |
0.89 |
2.80 |
1.79 |
-3.70 |
-0.43 |
8.36 |
112.85 |
0.536 |
-9.40 |
5 |
|
Cruz, D |
DET |
3.11 |
0.74 |
9.12 |
2.27 |
7.07 |
4.51 |
-1.89 |
-0.22 |
7.31 |
114.03 |
0.531 |
16.10 |
17 |
|
Jeter, D |
NYA |
-7.28 |
-1.75 |
-9.06 |
-2.26 |
8.77 |
5.60 |
6.61 |
0.76 |
2.35 |
119.28 |
0.510 |
-20.02 |
-3 |
|
DiSarcina, G |
ANA |
-9.85 |
-2.36 |
0.34 |
0.08 |
5.67 |
3.62 |
4.54 |
0.52 |
1.87 |
125.88 |
0.507 |
-3.63 |
3 |
|
Bordick, M |
BAL |
-19.71 |
-4.72 |
-9.13 |
-2.28 |
12.58 |
8.02 |
-7.28 |
-0.84 |
0.19 |
122.03 |
0.501 |
18.26 |
4 |
|
Meares, P |
MIN |
-3.24 |
-0.78 |
7.15 |
1.78 |
-4.13 |
-2.64 |
-0.40 |
-0.05 |
-1.67 |
111.82 |
0.492 |
-7.68 |
0 |
|
Rodriguez, A |
SEA |
-12.06 |
-2.89 |
-10.20 |
-2.54 |
2.98 |
1.90 |
5.29 |
0.61 |
-2.92 |
127.85 |
0.488 |
2.06 |
0 |
|
Garciaparra, N |
BOS |
9.88 |
2.37 |
-20.78 |
-5.18 |
-5.78 |
-3.69 |
14.94 |
1.72 |
-4.79 |
112.34 |
0.478 |
-15.26 |
-11 |
|
Tejada, M |
OAK |
7.40 |
1.77 |
9.89 |
2.47 |
-11.24 |
-7.17 |
-6.78 |
-0.78 |
-3.71 |
84.19 |
0.477 |
2.16 |
0 |
|
Caruso, M |
CHA |
-8.69 |
-2.08 |
9.06 |
2.26 |
-17.39 |
-11.09 |
-12.23 |
-1.41 |
-12.32 |
108.85 |
0.440 |
-7.33 |
-16 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1999 |
Team |
Range |
RRuns |
DP |
DPRuns |
Err |
ErrRuns |
PO |
PORuns |
TotRuns |
BaseR |
DWP |
FR |
DFT |
|
Bordick, M |
BAL |
25.22 |
6.09 |
18.61 |
4.69 |
14.02 |
9.00 |
33.49 |
3.80 |
23.57 |
185.25 |
0.560 |
35.38 |
23 |
|
Sanchez, R |
KCA |
21.67 |
5.23 |
19.00 |
4.79 |
7.63 |
4.90 |
7.67 |
0.87 |
15.79 |
144.26 |
0.552 |
31.94 |
24 |
|
Cruz, D |
DET |
10.52 |
2.54 |
4.14 |
1.04 |
8.39 |
5.39 |
-5.06 |
-0.57 |
8.40 |
160.76 |
0.525 |
4.68 |
20 |
|
Batista, T |
TOR |
10.46 |
2.52 |
3.71 |
0.93 |
2.30 |
1.47 |
0.36 |
0.04 |
4.97 |
105.12 |
0.523 |
10.18 |
13 |
|
Garciaparra, N |
BOS |
16.91 |
4.08 |
-15.83 |
-3.99 |
1.48 |
0.95 |
22.85 |
2.59 |
3.64 |
140.24 |
0.513 |
-7.53 |
0 |
|
Tejada, M |
OAK |
10.05 |
2.42 |
1.01 |
0.25 |
2.29 |
1.47 |
0.93 |
0.11 |
4.26 |
178.39 |
0.512 |
7.13 |
4 |
|
Rodriguez, A |
SEA |
-9.88 |
-2.38 |
11.77 |
2.96 |
3.51 |
2.26 |
-1.25 |
-0.14 |
2.70 |
134.19 |
0.510 |
7.23 |
3 |
|
Guzman, C |
MIN |
10.17 |
2.45 |
1.45 |
0.37 |
-6.66 |
-4.28 |
-3.31 |
-0.38 |
-1.83 |
129.56 |
0.493 |
-7.20 |
-3 |
|
Vizquel, O |
CLE |
-6.62 |
-1.60 |
-14.82 |
-3.73 |
3.83 |
2.46 |
-10.30 |
-1.17 |
-4.04 |
156.60 |
0.487 |
1.57 |
-7 |
|
Clayton, R |
TEX |
6.05 |
1.46 |
-1.35 |
-0.34 |
-6.17 |
-3.96 |
-9.98 |
-1.13 |
-3.98 |
149.77 |
0.487 |
3.10 |
-7 |
|
Jeter, D |
NYA |
-19.25 |
-4.65 |
-21.97 |
-5.54 |
4.97 |
3.19 |
-11.27 |
-1.28 |
-8.27 |
167.16 |
0.475 |
-33.55 |
-13 |
|
Caruso, M |
CHA |
-16.82 |
-4.06 |
0.59 |
0.15 |
-7.77 |
-4.99 |
-4.13 |
-0.47 |
-9.37 |
128.41 |
0.462 |
-19.92 |
-10 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2000 |
Team |
Range |
RRuns |
DP |
DPRuns |
Err |
ErrRuns |
PO |
PORuns |
TotRuns |
BaseR |
DWP |
FR |
DFT |
|
Sanchez, R |
KCA |
28.48 |
6.88 |
10.11 |
2.55 |
13.87 |
8.92 |
-15.88 |
-1.78 |
16.56 |
163.63 |
0.548 |
15.18 |
28 |
|
Rodriguez, A |
SEA |
11.65 |
2.81 |
23.87 |
6.02 |
7.87 |
5.06 |
12.35 |
1.38 |
15.27 |
157.83 |
0.546 |
8.05 |
17 |
|
Martinez, F |
TBA |
23.88 |
5.77 |
6.79 |
1.71 |
1.51 |
0.97 |
8.12 |
0.91 |
9.36 |
126.77 |
0.536 |
31.20 |
9 |
|
Cruz, D |
DET |
0.44 |
0.11 |
7.47 |
1.88 |
5.91 |
3.80 |
-33.69 |
-3.78 |
2.01 |
182.39 |
0.505 |
3.95 |
13 |
|
Clayton, R |
TEX |
-9.26 |
-2.24 |
0.20 |
0.05 |
2.82 |
1.81 |
15.78 |
1.77 |
1.39 |
146.74 |
0.505 |
-2.59 |
-4 |
|
Valentin, J |
CHA |
17.18 |
4.15 |
19.74 |
4.98 |
-16.87 |
-10.84 |
0.98 |
0.11 |
-1.61 |
148.27 |
Mike Emeigh
Posted: November 13, 2002 at 06:00 AM | 27 comment(s)
Login to Bookmark
Related News:
|
Support BBTF
Thanks to tshipman for his generous support.
Bookmarks
You must be logged in to view your Bookmarks.
Hot Topics
What do you do with Deacon White? (17 - 1:12pm, Dec 23)Last: Alex KingLoser Scores (15 - 12:05am, Oct 18)Last: mkt42Nine (Year) Men Out: Free El Duque! (67 - 10:46am, May 09)Last: DanGWho is Shyam Das? (4 - 8:52pm, Feb 23)Last: RoyalsRetro (AG#1F)Greg Spira, RIP (45 - 10:22pm, Jan 09)Last: Jonathan SpiraNorthern California Symposium on Statistics and Operations Research in Sports, October 16, 2010 (5 - 12:50am, Sep 18)Last: balamarMike Morgan, the Nexus of the Baseball Universe? (37 - 12:33pm, Jun 23)Last: The Keith Law Blog Blah Blah (battlekow)Sabermetrics, Scouting, and the Science of Baseball – May 21 and 22, 2011 (2 - 8:03pm, May 16)Last: Diamond ResearchRetrosheet Semi-Annual Site Update! (4 - 4:07pm, Nov 18)Last: SweatpantsWhat Might Work in the World Series, 2010 Edition (5 - 3:27pm, Nov 12)Last: fra paoloPredicting the 2010 Playoffs (11 - 5:21pm, Oct 20)Last: TomHSABR 40: Impressions of a First-Time Attendee (5 - 11:12pm, Aug 19)Last: Joe Bivens, Minor GeniusSt. Louis Cardinals Midseason Report (12 - 12:42am, Aug 10)Last: bjhankeNapoleon Lajoie: Definition of Grace (9 - 12:38am, Jul 01)Last: Hang down your head, Tom FoleyYouth Baseball Hitting Drills: Shine the Light (5 - 6:47am, Mar 11)Last: Pat Rapper's Delight
|
|
|
Reader Comments and Retorts
Go to end of page
Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.
1. Charles Saeger Posted: November 13, 2002 at 02:03 AM (#607178)Also, the putout run rate looks high; it should be half the assist rate.
-- MWE
-- MWE
BTW, what was the correlation between DFTs and chances again? You said it was something around r=0.50 while AIM chatting the other night, but I don't remember offhand.
Althopugh their numbers do not correspond exactly to yours, STATS 2001 fielding numbers show the Yanks once again finishing last in DP per DP opportunity: 105 out of 203, or 51.7%, vs AL average of 58.4% (NL was at 59.1%). This is for all infield DPs, not just those started GB6 or GB4.
The correlation between DFT and SS fieldable opportunities is r=0.526.
Bob Allen, thanks for the 2001 update on DPs. How does STATS determine a DP opportunity? The opps counts seem a bit low to me.
-- MWE
Apropos of the MVP discussion thread, FWIW, ARod and Tejada are fairly close to each other in 1998 and 1999, and ARod comes out *much* better in 2000. Of course, the $22 million man was still with Seattle then; I haven't looked at his Texas numbers at all.
-- MWE
-- MWE
WRT Bordick: Davenport's fielding numbers for 2002 show Bordick with FRAA of 32 runs. His last five seasons (based on the V.2003 of DFTs, which has some changes from V.2002 which is what I evaluated in the article) are 11, 34 -12 (-11 BAL/-1 NYM), -1, and 32.
-- MWE
For my Superlwts, for DP defensive value, I also use DP's per DP opp, where a DP opp is runner on first and less than 2 outs. I can't remember if I except any situtaions (like bags loaded) from DP opps. If a Dp is turned I give equal credit to the fielder and to the pivot man. This 50/50 credit is arbitrary. My "feeling" is that the pivot man has more influence over whether the DP is turned or not. I've never looked into this though. And for DP opps, I don't think that I make a distinction between ground balls that are initially thrown to second and ground balls that are inititally thrown to some other base.
Anyway, my questions are:
1) What credit split would you use for the fielder and the pivot man when the throw goes to second and either the DP is turned or it isn't?
2) If a fielder throws to first or some other base (and not to second) in a DP situation, should we not consider this an opp for the fielder? IOW, if a fielder does not even attempt a DP, is it becuase no one would have had a chance for a DP (because of where and how hard the ball was hit and/or becuase the runner was in motion or very fast, etc.), or because that particular fielder could not at least get the force at second?
You have to exclude SH from this - also failed sac attempts (which might be part of the difference, now that I think of it, between the opps I report and the opps that STATS reports; I got the SH but not the failed sac attempts).
Anyway, my questions are:
1) What credit split would you use for the fielder and the pivot man when the throw goes to second and either the DP is turned or it isn't?
For me, this is the same situation as for the ball in the hole that either the SS or 3B could field. I don't know who should have fielded it, so they both get full blame for not fielding it. Same thing here - both the SS and the 2B get full blame for not turning the DP. My subjective feel, as I noted earlier, it's likely that in the Yankees' case the problem is Jeter and in Boston's case it's the 2Bs.
2) If a fielder throws to first or some other base (and not to second) in a DP situation, should we not consider this an opp for the fielder?
I count it, again because I generally don't *know* whether or not the DP was possible.
BTW, I think that my UZR runs should have been included in the charts for comparison purposes! ;)
That's in part 6. Patience, grasshopper - one method at a time :)
-- MWE
My understanding of Win Shares, CAD and DFTs is that they place defensive contribution in a team context. Thus, just as the sum of offensive accomplishments of players on a good hitting team will be greater than the sum of offensive accomplishments of a bad team, so will defenses be rated.
As I read these articles, the point is not that team defense has been incorrectly measured. At least, that argument has yet to be advanced. If so, then measuring individual defense within one team becomes a zero-sum game. In other words, if Derek Jeter is underrated by CAD, DFT or WS, then Brosius or Bernie or somebody must be overrated, since the overall team has been measured reasonably well.
Is this correct, or are you arguing that team defense has been measured as suboptimally as individual defense?
A good challenge will be learning how to spot these teams. No one knows how to do this yet.
Here's another interesting point about subjective evaluation. I've thought about this before but it just popped into my head again after reading Slappy's post. We (sabers) generally eschew (debunk, poo-poo, disdain, mock, etc.) subjective evaluation when it comes to things like offense and even defense, and rightfully so - to some extent. However...
Since regression is an important part of an objective evaluation process (for ability or projections that is), it is critical that we estimate the population that a players comes from well (sorry about the awful syntax). (See the discussions about regression and population on some of the Clutch Hits [argh!] threads.) Without getting into all of the mathematical details, the reason we need to know about that population is that that determines what number we regress a sample stat towards. If we are given a player with a .300 BA in 100 AB's and we know nothing else about that player, we regress the .300 towards the mean BA of all players. If we find out that our player is a LHB, then we regress the .300 towards the mean BA of all LHB's. If we find out that our player is 6'2", 185 lbs., then we uyse the BA of all such players, etc.
Now here's where subjective evaluation comes into play. Subjective evaluation can give us a better idea of what population a player comes from (in order to determine what number to regress his sample stats towards) - no here's the important part - as long as that subjective evaluation is completely independent of any sample data (the subjective evaluation must either be NOT subject to much if any sampling error - like watching a player run several times is not subject to much sampling error - some but not much OR the sampling error in the subjective evaluation must be independent of the sampling error in the objective evaluation - i.e., the sample stats.
For example, let's say we have a normalized UZR or ZR for a particular OF, say Torri Hunter, that is 1.02 (2% above average) for one season. Normally, we would regress the 1.02 towards 1.00 to estimate his true number. Since our sample is only 1 year, we regress say 50%, to give him a true number of 1.01.
Now what if in observing Hunter, we notice that he gets good jumps on the ball (we think), he runs like the wind, and he makes tough catches look easy (like Andruw Jones). Even if we are "wrong" about some of these observations, doesn't it change the population that Hunter is from? Before, without any observation, he is just a center fielder, so we use the mean UZR of a (starting) CF (around 1.00) to determine what to regress his 1.02 towards. But if we notice all of the above things, we now have to say that the population he comes from is not only all CF's, but CF's who run fast, get good jumps on the ball, etc. So what is the mean (true) UZR of that population? We don't know for sure, but it might be 1.01 or 1.02!. So instead of regressing Hunter's sample UZR of 1.02 towards 1.00, to get an estimate of 1.01 for his true UZR, we might regress the 1.02 towards 1.01 or even 1.02 to get an estimate of 1.15 or even 1.02 for Hunter's true UZR!
This is basically the mathematical explanation for how and why we can indeed use subjective evaluations ALONG WITH objective, sample measures to better estimate players' true abilities (i.e.projections), especially for abilities that are hard to measure through objective means (for various reasons - e.g., small sample sizes, lots of sampling error), and especially when the subjective evaluation is reliable enough to make some significant inferences about the population that a player comes from!
You can even do it with hitting! For example, let's say that you ave the same hitter as above - .300 in 100 AB's. You know nothing else about him, so you regress the .300 towards the mean BA of all players, say .260. Since it is only 100 AB's, you regress maybe 80%, to get an estimate of his true BA of around .268. But what if you watch the guy (let's forget about his handedness, hieght and weight, etc.) and say, "Boy this guy "looks" like a great hitter! He has a sweet swing, etc." Now you may have a different population! Assuming that your observations are not based upon (completely independent of) the fact that he hit .300 in those AB's - which is hard to do of course - you no longer have a hitter who comes from a population of all hitters - he now comes from a population of all hitters who "look" like great hitters! So instead of regressing the .300 towards .260, you might regress it towards .280 or something like that. The tricky part, of course, is to somehow make the subjective evaluations independent of the sample data, in order for the above to be valid. That is tough in hitting, which is why we generally don't like to use much if any subjective evaluation when it comes to hitting, fo rour projections. For example, if we are looking at a player's sample HR total and trying to determine his true HR rate (i.e. HR projection), how can we separate observation and sample performance? If he hits 50 HR in 500 AB's, he is going to "look" like a great HR hitter, right?
But since in order to use subjective evaluation (observation) to change our population, and hence, the number we regress to, the subjective evlaution must be INDEPENDENT of the sample data (the 50 HR's), we cannot change our population in this case even though we observed that this guy "looks" like a great HR hitter, because that observation is obviously going to be influenced by the fact that he hit 50 HR's. Now if we gave a tape of the guy to a person who didn't know that he hit 50 HR's, and on the tape were this player's swing and misses, and our observer (presumably a competent scout), and he remarked "Wow, this guy has a powerful swing! I think he is a great HR hitter," THEN we could legitimately change the population from whence we think this player came!
Anyway, just something (important, IMO) to think about...
I just want to repeat the most important part again, for those who skim long posts - The tricky part, of course, is to somehow make the subjective evaluations independent of the sample data, in order for the above to be valid.
Not necessarily. We are watching Hunter, and comparing him to some baseline for an outfielder that exists *in our mind*. If we watched Andruw Jones for a while, or Darin Erstad, or Jim Edmonds, or Doug Glanville, we could easily draw the same conclusion about them as we do about Hunter. It might very well be a requirement for *every* center fielder that he get a good jump on the ball, run like the wind, and make tough chances look easy, in which case the base population hasn't changed at all.
-- MWE
In practice, what we are doing when we regress a player like Hunter's (whom we observe to have certain positive characterisitics that not ALL Cf'ers possess) sample defensive stats, is saying "Even though the average, say ZR, of a starting center fielder is .73, that includes a few "lumbering" CF'ers, perhaps a few CF'ers whom we "know" (independent of their ZR - hopefully) should not even be playing CF (perhaps the team has no other CF'er), so since we "know" that Hunter is probably not one of those types (lumbering, doesn't really belong there), we cannot use .73 as the "mean of the populaton" ZR to regress his sample ZR to. We would need to estimate the mean ZR of all CF'ers who are not lumbering, etc., which would presumably be greater than .73...
When it comes to the DP, shouldn't the LH/RH thing also apply? For example, what is Jeter's rate at turning the DP with a RH hitter and LH hitter? What is the league rate? Breaking those down with him as the pivot man and not, what are the breakdowns?
If Jeter's ball distribution is really skewed one way, isn't it possible that the DPs are also skewed?
Futhermore, the more balls hit up the middle or directly at him would make it a little easier to turn the DP, than a ball in the hole. So, break it down even further: what is Jeter's rate at turning the DP, compared to the league when
1 - LH/RH
2 - pivot / not-pivot
3 - hole / direct / up the middle
( I realize we lose sample size along the way.)
And the gain in doing this is...what? How more accurate are we likely to be if we try to split a population of 30 or so players - all of whom are in the top 1% if not in the top 0.5% of the entire population of baseball players (which would include minor leaguers, amateur players, beer leaguers, etc.) - into two or more groups? Aren't we at the point where we're trying to adjust for differences that are likely to be "smaller" than random year-to-year performance variation? Brock Hanke, in the 1998 BBBA, called this the Law of Diminishing Returns as applied to baseball analysis, and I think he's right.
-- MWE
Funny thing is, I'm actually conceding what non-sabers often say - namely that subjective observation/evaluation is an important part of a proper analysis of a player's ability. I'm just giving a mathematical explantion as to why they may be right...
My answer is that the distinction between CFs based on the skills that MGL has cited has already been made before the players every get to CF - players who don't have the requisite skills aren't playing CF. The variation in those skills within the group is almost certain to be *far* smaller than the variation between those inside the group and those outside the group, to the extent that it does not matter whether you account for it or not, because you cannot see the differences between members in the group from outside the group.
-- MWE
You must be Registered and Logged In to post comments.
<< Back to main