User Comments, Suggestions, or Complaints | Privacy Policy | Terms of Service | Advertising
Page rendered in 0.9115 seconds
41 querie(s) executed
You are here > Home > Hall of Merit > Discussion
| ||||||||
Hall of Merit — A Look at Baseball's All-Time Best Wednesday, January 03, 2018Most Meritorious Player: 2010 ResultsCongratulations to Josh Hamilton, our 2010 Most Meritorious Player Player Name pts ballots 1sts Josh Hamilton 161 12 4 Robinson Cano 143 11 4 Albert Pujols 131 12 1 Evan Longoria 127 11 0 Joey Votto 116 11 0 Roy Halladay 107 11 2 Jose Bautista 80 10 0 Carl Crawford 71 9 0 Ubaldo Jimenez 66 9 0 Miguel Cabrera 64 9 0 Felix Hernandez 56 7 1 Adrian Beltre 52 8 0 Troy Tulowitzki 33 7 0 Josh Johnson 33 6 0 Adam Wainwright 30 7 0 Joe Mauer 19 6 0 Brett Gardner 18 4 0 Matt Holliday 17 3 0 Adrian Gonzalez 16 2 0 Aubrey Huff 13 3 0 Tim Hudson 13 2 0 Jason Heyward 12 1 0 Kevin Youkilis 11 1 0 Shin-Soo Choo 9 3 0 Jayson Werth 9 2 0 Paul Konerko 6 1 0 Ryan Zimmerman 5 2 0 Chase Utley 5 1 0 Carlos Gonzalez 4 2 0 Justin Morneau 4 1 0 Cliff Lee 3 2 0 Clay Buchholz 3 1 0 CC Sabathia 1 1 0 Andrew McCutchen 1 1 0 David Price 1 1 0 |
BookmarksYou must be logged in to view your Bookmarks. Hot TopicsMock Hall of Fame 2024 Contemporary Baseball Ballot - Managers, Executives and Umpires
(28 - 10:54pm, Dec 03) Last: cardsfanboy 2024 Hall of Merit Ballot Discussion (170 - 7:45pm, Dec 03) Last: Chris Cobb Hall of Merit Book Club (16 - 6:06pm, Dec 01) Last: ERROR---Jolly Old St. Nick Most Meritorious Player: 2023 Results (2 - 5:01pm, Nov 29) Last: DL from MN Most Meritorious Player: 2023 Ballot (12 - 5:45pm, Nov 28) Last: kcgard2 Most Meritorious Player: 2023 Discussion (14 - 5:22pm, Nov 16) Last: Bleed the Freak Reranking First Basemen: Results (55 - 11:31pm, Nov 07) Last: Chris Cobb Mock Hall of Fame Discussion Thread: Contemporary Baseball - Managers, Executives and Umpires 2023 (15 - 8:23pm, Oct 30) Last: Srul Itza Reranking Pitchers 1893-1923: Results (7 - 9:28am, Oct 17) Last: Chris Cobb Ranking the Hall of Merit Pitchers (1893-1923) - Discussion (68 - 1:25pm, Oct 14) Last: DL from MN Reranking Pitchers 1893-1923: Ballot (13 - 2:22pm, Oct 12) Last: DL from MN Reranking Pitchers 1893-1923: Discussion (39 - 10:42am, Oct 12) Last: Guapo Reranking Shortstops: Results (7 - 8:15am, Sep 30) Last: kcgard2 Reranking First Basemen: Ballot (18 - 10:13am, Sep 11) Last: DL from MN Reranking First Basemen: Discussion Thread (111 - 5:08pm, Sep 01) Last: Chris Cobb |
|||||||
About Baseball Think Factory | Write for Us | Copyright © 1996-2021 Baseball Think Factory
User Comments, Suggestions, or Complaints | Privacy Policy | Terms of Service | Advertising
|
| Page rendered in 0.9115 seconds |
Reader Comments and Retorts
Go to end of page
Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.
1. DL from MN Posted: January 03, 2018 at 04:19 PM (#5600346)Fred Lynn, Ron Guidry and Dwight Gooden are MMP plus-one.
Pujols 11
Votto 7
M Cabrera 7
Halladay 6
Utley 6
McCutchen 5
Cano 5
Beltre 5
Mauer 5
Tulowitzki 4
Wainwright 4
Holliday 4
Barry Bonds 7 18
Henry Aaron 0 16
Willie Mays 7 16
Mike Schmidt 2 13
Roger Clemens 0 13
Rickey Henderson 1 12
Honus Wagner 4 11
Albert Pujols 3 11
Mickey Mantle 3 11
Frank Robinson 1 11
Eddie Mathews 0 11
Cal Ripken 3 8
Joe Morgan 3 7
Mike Trout 4 6
Ty Cobb 3 5
We aren't finished covering Cobb, Wagner or Trout. Pujols has 2011 left.
This is the most Hank Aaron thing ever. I love it so much.
I definitely need to get a job that starts later in the morning so I can stay up late and watch Angels games. This is a generational talent who at age 26, may already be HoM-worthy.
1. Chris Fluit said that I did not include defense in my 2010 ballot. I did include defense!
2. DL asked about Halladay, Longoria and Cano's RPA values:
Halladay was just outside with a +24.16 runs. Longoria was at + 21.66 runs and Cano was at +14.32 runs.
He appears on no one else's ballot, let alone so high as #4 - and that is another reason that I do not take other people's evaluation of defense seriously!
I am also a bit baffled by the high rating for Cano. He was only the fourth most valuable player and 5th most valuable, if you include Sabathia, on his own team! Was it for his 'defense'? I have him as a plus on defense for 2010, but only to the tune of +.48 runs.
My RPA's, for more than two decades have included defense for each player and the defense behind each pitcher, but for the purpose of this historical rating evaluations for a single season, I would need more than the overall defensive rating that was averaged over three years (from my prior ratings) in order to go back and separate out each player's defense in each single season.
Cano: .319/.381/.534 (696 PA, 141 OPS+)
Uggla: .287/.369/.508 (674 PA, 131 OPS+)
Now those are pretty similar lines, so I could see how an evaluation system might rate them similarly. So why the difference? By any defensive metric (or just eyeballing, for that matter), Cano was by far the superior fielder.
Just a couple other comparisons:
- by WAA, Cano (5.6) beats Uggla (2.3)
- by WPA, Cano (4.07) beats Uggla (2.71)
Florida played in a strong pitcher's park.
The huge difference between the two parks mean that the OPS numbers are heavily skewed against Uggla.
As for defense, yes Uggla was below average at -1.62 runs on defense, but his offense was HUGE!
More details:
Cano walked 33 times fewer than Uggla and hit into 19 GDP's while Uggla hit into only 9. OBP has a multiplier effect. It is not linear.
In addition, Cano stole 3 bases while being caught or picked off 6 times! Uggla stole 4 bases but was only caught stealing or picked off two time.
My RPA formula WORKS! WAR and all its variants are fatally flawed.
Catcher: Yadier Molina +15.53 runs; Humberto Quintero +11.60 runs; Russell Martin +10.58 runs.
First Base: Aubrey Huff +4.22 runs: Daric Barton + 3.58 runs; Lyle Overbay +3.51 runs
Second Base: Dustin Pedroia + 3.16 runs; Aaron Hill +3.02 runs
Third Base: Casey Blake +5.71 runs; Jose Lopez +3.98 runs
Shortstop: Josh Wilson 4.87 runs; Cliff Pennington +4.62 runs; Ramon Santiago +4.20 runs
Left Field: Carl Crawford +12.28 runs; Brett Gardner +9.38 runs
Centerfield: Drew Stubbs +7.86 runs; BJ Upton + 7.72 runs; Julio Borbon +7.13 runs
Right Field: Jason Heyward +10.23 runs; Wil Venable +9.63 runs; Justin Upton +9.02 runs
You can clearly see that catcher and the three outfield positions are much more critical on defense, than the four infield positions.
The outfield is where balls that fall in, drop for doubles and triples, wheres as balls hit up the middle, on the ground can, usually, only go for a single.
First base and third base are slightly more critical than the middle infield because balls hit down the line end up as doubles.
I know that this goes counter to all you think that you know, but these are the FACTS!
I'm just guessing, but my bet is that it's a tough, tough thing to be playing in the same time and league as Willie Mays.
OPS+ includes park effects
As I stated, the internals of their performances, with the far fewer walks and far greater GDPs and caught stealing and picked off in regards to Cano, may account for a good portion of the difference. Even in taking the extra base, Uggla was thrown out twice and Cano three times. In RPA, the extra out involved in the GDP is added to Cano's plate appearances and the runner removed is subtracted from Cano's on-base numbers. The times thrown out on the base paths also removes an on-base occurrence for each runner removed. These adjustments affect the half of the RPA formula involving the 'set-up' value for the hitters that follow in the lineup.
What are the OPS+ park factors for NY and FLA in 2010? Can you let us know?
Here's the RPA park factors:
Yankees at Yankee stadium: .1423 RPA
Yankees at visitor park: .1227 RPA A drop off of almost 20 points at the visitor park!
All Visitors at Yankee Stadium: .1271 RPA
All visitors performances against the Yankees at the visitors park: .1150 RPA A drop-off of over 12 points!
The total drop off in performance, from Yankee Stadium to the visitors park, gives a park factor of .8823 to drop the performance at Yankee Stadium to league level of 1.000
Florida at Home: .1178 RPA
Florida at the away park: .1212 RPA
Visitor at Florida: .1125 RPA
All visitor performances against Florida at the visitor's park: .1218 RPA
The Florida park variant for 2010 was 1.0551 which increases the offensive performances, at Florida, to league level 1.000
In any case, when all the data was processed, Uggla was at .164 offensive RPA and Cano at .140 RPA. That's a big difference. Cano's was very good, but Uggla's was excellent.
https://www.baseball-reference.com/leagues/AL/2010-misc.shtml
https://www.baseball-reference.com/leagues/NL/2010-misc.shtml
Yankee Stadium is a 106/104 and Miami is a 103/103. They're almost the same.
Their formula is described here: https://www.baseball-reference.com/about/parkadjust.shtml
It is utterly ridiculous that the Yankee stadium and Florida park adjustments that you posted were so similar. One was a big hitters park in 2010 and the other a pitchers park in 2010, but you would not know that from baseball-reference!
Baseball-reference is a wonderful source of player and team data, both current and historical, but analysis is clearly not their game!
That overly simplistic park factor makes ALL their player ratings bogus!
We have come a long way from the neanderthal era of player performance ratings. That is something that we can all take pride in, but we, obviously, have a long way to go! YIKES!
Robinson Cano's top two offensive seasons were 2012 @ .152 RPA and 2013 @ .150 RPA.
While I have almost always had good to very good RPA annual ratings for Cano, he's been highly over-rated as to his actual value, for his entire career.
One side note and 'pet peeve': I still marvel over the 'baseball establishment' lowering the value of a fabulous hitter in David Ortiz, simply because he was a DH. In fact, while Ortiz was with the Bosox, his production at DH was often the biggest reason for their success, simply because most other teams had no understanding of the value of the DH position, by putting over-the-hill hitters and assorted 'junk' at that position. Prior to the 2016 season I told the Philadelphia SABR meeting that Mitch Moreland was no answer to the loss of David Ortiz. In fact, after I recently completed my analysis of the 2016 season, the Bosox were lucky to win their division. The Yankees were a far, far better team!
I don't see why. They're adjusting on runs scored. You're adjusting on RPAs calculated. If RPA doesn't correlate to runs, then isn't the onus on you to show how runs scored is destroying information?
Pretty sure that A) their park factors are based on actual runs scored and B) unless something has changed of which I'm unaware, Runs Created is not a linear formula.
RPA uses all available data and for 162 games because it uses the home and away games as well as all the available data points for each. Each team has about 6,000 plate appearances per season and their opponents, home and away, also have 6,000 plate appearances in those 162 games. That's a lot of data to analyze and to come to a much more accurate park factor.
RPA uses single season park factor data ONLY!
I explained, in one of my first books, why using averaged multi-year park factors is a huge mistake, by using a particular season that threw the stadium park variations wildly out of line.
RPA does not work backwards from runs scored, simply because RPA was tested as a PREDICTIVE formula for run scored. It ignores the runs scored completely! It takes the internal game data, for those 162 games so as to come up with offensive production predictions of team run scoring that correlate excellently with actual team run scoring. The test is to be able to PREDICT performance. There is no predictive test for going in the other direction.
My park factors are useful because they are predictive of performance in terms of runs scored by a team over a full season. Are there exceptions, that fall outside of good predictive amounts for each team, every year, under the RPA process? Of course! That is true for any predictive system. However, the majority of team predictive runs scoring does very well with the RPA process.
In my historical ratings for pre-1990 teams, I was forced to use the unsatisfactory method of park factors based on run scoring alone. It is not something that I could have avoided, but it is not something that I would recommend as a method of analysis.
No, it's an average of 162 data points.
In the OPS figure is onbase%, but that onbase percentage leaves out critical adjustments:
Uggla's 9 GDP's and his 2 Caught Stealing and 2 Thrown out on the base paths removed 13 base runners and added 9 extra outs.
In addition, his two IBB's should not be counted in onbase%
This results in a plate appearance figure of 589 AB's + 78 BB+HBP-IBB + 9 AB's (for the GDP out created) = 676 Plate appearances
Effective onbase% = 169 hits + 78 - 13 runners removed = 234 / 676 = .346 EOB% for Uggla.
Cano's 19 GDP's + 6 CSPO + 3 TO's removed 28 base runners and added 19 extra outs.
Cano's 14 IBB's should not be counted in his OB%.
This results in a plate appearance figure of 626 AB's + 51 BB+HBP-IBB + 19 AB's for the extra out created by the GDPs = 696 plate appearances.
Cano's effective OB% = 200 hits + 51 BB+HBP-IBB - 28 runners removed (6 cspo + 3 TO + 19 GDP) = 223/696 = .320 EOB% which is 26 points lower than Uggla's .346 EOB%.
OBP is more important than slugging%, in that scoring for others, after them in the lineup, is raised or lowered by the onbase% of those batters in front of them. Ob% has a multiplier effect and changes the value of singles, doubles, triples, etc...
Linear just means each factor is multiplied by a weight. There's no multiplying or dividing of the various factors by each other. I'm not sure what you mean by changing the value of specific events dynamically. You clearly state that you are not changing event values based on in-game situations, so what is causing the event values to change?
"How do you take a season line for a player with 600 AB's, under runs created, and make the values of the individual events 'flow' with the on-base production?"
What do you mean by "flow" since again, you clearly state that you are not doing a WPA type in-game weighting system?
"In the OPS figure is onbase%, but that onbase percentage leaves out critical adjustments:
Uggla's 9 GDP's and his 2 Caught Stealing and 2 Thrown out on the base paths removed 13 base runners and added 9 extra outs."
I agree that OPS leaves out GDP and SB/CS data, but all the major WAR/WS systems include GDP and Baserunning in the Offensive calculation. For example, on BBRef, Cano beats Uggla 6.4-5.3 just in offensive WAR. In Offensive WS, Cano beats Uggla 24.3-22.2. In gWAR, its 5.6-4.9. In wRC+ (used for the offensive component of fWAR, its 143-135. In WPA, which takes into account the actual game situations, its 4.07-2.71. What value did Uggla bring that your system is catching, but every other system is missing? Or is this entirely that you are using drastically different park factors than every other system?
"RPA uses single season park factor data ONLY!"
You imply that single-season park factor data would have less various than multi-season park factor data. This makes no sense. On top of that Sun life's 2010 park factors are virtually the same on BBRef for 2010. It played as a mild hitters park during 2010 and the surrounding seasons.
"Dan Uggla's top two offensive seasons were in 2008 @ .160 RPA and 2010 @ .164 RPA.
Robinson Cano's top two offensive seasons were 2012 @ .152 RPA and 2013 @ .150 RPA."
Again, what is RPA? What does .160 RPA mean in baseball terms? Is it a measure of Wins? Runs? Compared to Average? Replacement? Zero? The only thing I come up with is Runs over Positional Average, but your numbers seem really low if that's what that is.
I also don't see how a GDP is all the batter's fault. Some of the credit goes to the defense for turning it, some of the blame goes to the baserunner. I think you're double counting a situation where the batter is really only responsible for the out they make. If the player ahead of them in the lineup had struck out before the Cano groundout why would that be better? Uggla struck out almost twice the rate as Cano. I think that's why Cano has more GDP. With runners on base Cano was able to put the ball in play and Uggla was not. In what world does a strikeout say that a player is a better batter than the guy who put the ball in play?
Where do you give Cano credit for those 14 IBBs? That is an out avoided and a baserunner with the potential to score runs. I would assume that figures into RPA. How do you break down the "non-intentional" intentional walk where the batter is just pitched around? That would kill Barry Bonds' late career productivity if you ignored those.
No ballpark is on an island, even the ballpark that is fully enclosed. Everything is relative! Teams do not play all 162 games in the same park. If scoring is higher at the other parks, the park variant will make the enclosed park a pitchers park and vice versa. In any case, even in an enclosed park, temperature, barometric pressure, air conditioning and even external sun/darkness and fan attendance changing backgrounds and noise affect the play on the field.
Weather is huge! Parks often play hugely differently year-to-year, depending on the weather. In one of my first books I showed how lucky the NY Mets were when the were unsuccessful in getting any interest in trading Howard Johnson, after HoJO's supposedly terrible 1998 season, when he hit .230.
HoJo was #10 in the MVP in 1997 and #5 in 1999, so what happened in 1998? In 1998 all the eastern USA was cold in the Spring and even into the summer. Nowhere was this more evident whenever the Mets were at home. Shea Stradium was terribly affected. Once I adjusted the park variant on a year-to-year basis, HoJo's RPA ratings were a straight line improvement from 1997 to 1998 to 1999. HoJo did not have a bad year in 1998. And the Mets were very lucky that no team knew HoJo's real 1998 value.
Sometines the wind blows in. Sometimes the wind blows out. Sometimes it rains, but not enough to postpone the game. Some days it is blistering hot. Some days there is a high sky, that affects defensive ability. Some days the ball 'flies'. Some days it just does not fly.
Weather varies greatly from year to year.
Howard Johnson retired in 1995. But I can see how in your system he could have produced value given the weather patterns in the late 1990s.
In Barry Bonds last season, 2007, he had an incredible .204 RPA!!! That's a Hall-of-fame rating! It is one of the facts that I used to proclaim that Barry Bonds was being blacklisted. There isn't a team on this planet that wouldn't salivate to have that kind of offensive power in their lineup! There was only one single player in all of MLB who had a higher RPA that 2007 season: Alex Rodriguez, and that was only by a single point at .205 RPA!!!
If I included IBB's, what we we do with the 8th place hitters who would be given IBB's, so as to get to the pitcher? That would really give an unfortunate and untrue value to their performances! An IBB is just a plate appearance that did not take place.
As for gdp opportunities vs. strikeouts, I think that your point iis well taken and something that I've thought about for some time, but do not have a clear answer. Strikeouts are a lot better than hitting into GDPS, which are the ultimate rally killer, but there is also the question of moving runners along when putting the ball in play. That is why I've long wanted to do the situational ratings, but I have always pulled back, not only due to lack of time, but also about their accuracy simply because situational events, by definition, are small subsets of data with, therefore, huge swings year-to-year that are inherent every time you split up events into smaller and smaller bits of data. Like when this hitter is 6 for 24 when playing at night or when facing the Mets or on Tuesdays, etc...
Here's the RPA's and the HR's for each. The reason that I include HR's is because that has a huge effect on the onbase%. There is no on onbase after a home run, including the base runners on, when the homer was hit. The HR LOWERS the effective onbase % for all the hitters following and, therefore, lowers the value of all hits after the HR. The HR has huge value for the runners already on base and that is where RPA gives it its due credit.
Here's the 6 players you listed, for the 2010 season:
For the Yankees:
Jeter: A horrible .106 RPA with 10 Homers
ARod: A pedestrian (for ARod) .137 RPA with 30 HR's
Teixeira: A good but pedestrian .139 RPA with 33 HR's
For Florida:
Gaby Sanchez: .132 RPA with 17 HRs
Hanley Ramirez: A terrific .160 RPA with 21 HRs
Jorge Cantu: a horrible .109 RPA (but still better than Jeter) with 10 HRs
I would contend, from those data points that Uggla probably had more opportunities to hit into GDPs, but partially avoided them by hitting the ball in the air and by striking out. Hitting the ball in the air is the main way to avoid GDPs.
None of these data points actually tell us who came up with a runner on first with less than 2 outs more often.
Jeter: 136 Singles, 63 BBs plus 4 IBBs and 9 HBPs = 212 times on 1B
ARod: 80 Singles, 59, 1, 3= 143
Tex: 85, 93,6,13 = 197
Total: 552
Gaby Sanchez: 97,57,2,5 = 161
Hanley: 112,64,12,7 = 195
Cantu: 80,29,1,6 = 116
Total: 472
Now, I don't know how many of these occurred with less than 2 outs, plus some of these innings ended prior to Cano/Uggla batting and other guys also batted in front of them at various points in the season. But without analyzing play-by-play data, I'd say its very likely that Cano had significantly more GDP opportunities than Uggla in 2010.
Not only does HR's clear the bases but triples also clear the bases, while putting the hitter onbase. Effective Onbase% (EOB%) in RPA takes that into account. Also a double also puts the hitter on base, but also removes all base runners from 2nd and 3rd base and sometimes even from first base. Likewise a single will remove a runner from third base, and often a runner from 2nd base. RPA does its best to account for those removals of runners as well as the addition of the hitter to the base path in regards to singles, doubles and triples in the EOB%.
IBB is a subset of BB's. You can't add them together. That would be double-counting. You must subtract IBB from BB's.
Gaby Sanchez, 97,55,5, +4 reached on an error (ROE), -19 hrs -14 GDP -2 caught stealing, picked off and thrown out (CSPOTO). = 126
Hanley: 112, 52,7 + 7 ROE, -21 hrs - 14 GDP - 12 CSPOTO = 131
Cantu+Helms: 99,47,10 +7 ROE -14 hrs -18 GDP -3 CSPOTO = 128
Total = 385
Jeter: 136, 59,9, + 6 ROE -10 hrs, -22 GDP, -7 CSPOTO = 171
ARod: 80,58,3, +4 ROE -30 hrs, -7 GDP - 5 CSPOTO = 103
Teix: 85,87,13, +6 ROE -33 hrs - 15 GDP -4 CSPOTO = 139
Total = 413
Yes, a very small 28 more opportunities in Cano's 696 Plate appearances and 601 AB's, which can easily be accounted for by the fact that Yankee Stadium was a hitter friendly park and those 28 extra opportunities accounted for an increase of how many actual GDPs for Cano? Cao had 19 GDPs in those 413 opportunities. A rate of 1 GDP for each 21.74 opportunities. Ergo. it only accounts for no more than a little over 1 extra GDP as compared to Uggla!! And that single GDP can easily be accounted for by the fact that Yankee Stadium was a hitters park!
Uggla was a better hitter than Cano in 2010.
I have no idea what question you are trying to answer with effective on base percentage.
That said, BBRef gives Cano -2 DP runs and Uggla an even 0 DP runs for 2010, so clearly Uggla was better at avoiding DPs than Cano, but probably not by as much as the 10 DPs difference would indicate. As DL has pointed out though, GDPs are definitely not worth a full double counting of the outs.
Clearly every other method out there (of which I'm aware anyway) disagrees with you. That doesn't mean you are wrong, but I'd need to know your methodology to judge for myself.
"I have no idea what question you are trying to answer with effective on base percentage."
Ditto. It seems on the surface to be adding complication for complication's sake. Though, like RPA I don't know how its calculated, so maybe it will make more sense once he explains it. It certainly seems as though he is using an extreme park effect for Sun Life though.
This stat is adjusted for opportunities, Cano was 2 runs below average and Uggla was average. RPA is saying there is a huge difference between the two players due to park effects and double plays. BBREF is saying there is essentially no difference between the two players due to park effects or double plays.
There is no difference in run scoring between pitching around a batter and intentionally walking him. I wouldn't take IBB out of any assessment. That player earned the walk. There are base-out situations that make a walk worth less than average but that applies equally to intentional and non-intentional walks in those situations.
I think Fangraphs and Bill James are saying the same thing only they don't break it out as neatly as BBRef.
"I wouldn't take IBB out of any assessment."
I think he was saying that IBBs are already included in BB on BBRef so its double-counting to add them again. I think that's a true statement, but I haven't verified it.
"RPA uses the full season 600 AB's dynamically to determine the value of individual hits. A single, e.g., is .29 runs, but that value changes within the 600 AB's by the effective onbase% for half the AB's. A hitter has two responsibilities: 1. to the runners on base and 2. to the batters following. It is the value of the hits for the batters following in the lineup that change depending on the on base% of this hitter. That is why the dynamic change in value only affects 1/2 of the data. As for situational park effects for individual situations, that is another matter. I dealt with that about 20+ years ago, rating all the individual situations for all 30 parks, over several seasons, and have been thinking about doing an individual player situational rating for some time. It's another big job but I've got a lot of other 'balls to juggle' at this time."
So you use one set of coefficients for half the batter's at bats and a different set for the other half? How do you decide which half of his stats are applied to which set of coefficients?
If IBB are in BB, then just don't include an IBB term in the equation. (H+BB+HBP)/(AB+BB+HBP)
Florida had 80 wins in 2019. My RPA predicted, based on the data alone, without seeing the final record, that Florida should have won about 79.5 wins in 2010.
The Yankees had 95 wins, and the RPA predicted just over 93 wins, based on the season data alone, in 2010.
While not all results, or even most, should be expected to come that accurate, I find it remarkable how close my method came! And it could not have come anywhere close, if I had not used the park factors in RPA.
I should note that I balance out the results, offense + defense + pitching over the entire league so that the league win/loss value is as close to .500 as possible. That 'balancing' affects every player and every team with the same factor. Therefore, any changes kept every team's wins the same in relations to all other teams.
"Florida had 80 wins in 2019. My RPA predicted, based on the data alone, without seeing the final record, that Florida should have won about 79.5 wins in 2010.
The Yankees had 95 wins, and the RPA predicted just over 93 wins, based on the season data alone, in 2010.
While not all results, or even most, should be expected to come that accurate, I find it remarkable how close my method came! And it could not have come anywhere close, if I had not used the park factors in RPA."
Assuming you are applying similar park factors to both offense and defense, it shouldn't make to0 big of a difference in terms of predicting wins (I'm assuming by pyth method) whether the park factor used was accurate. For example if a team in a neutral park scored 700 runs and allowed 600, pyth predicts 92.3 wins. If you applied 120 park factor to both sides, it still rounds to 92.3 wins. If you change to 90 park factor, it still rounds to 92.3 wins. 90 to 100 to 120 is a pretty extreme park factor swing and it really doesn't affect win prediction much at all.
When I read, in on of the Sabermetric publication, that "we all know how great Ozzie is, but we can't prove it in the data". When I did my defensive study, in the method shown, Ozzie (OF COURSE!) ended up as a phenomenal defensive shortstop, even at Ozzie's old age year, near the end of his career, that I used for the study.
Another example was the unfortunate shifting of Cal Ripken from SS to 3B. My RPA showed that Cal Ripken was still an excellent shortstop and moving him to 3B destroyed both his defensive and offensive advantage for the Orioles! The shift was a big mistake.
He was playing on the carpet in Toronto and it distorted the impression of his defensive prowess.
I'm a New Yorker and my co-workers and friends were often huge Mets fans. When the Mets acquired Alomar, they couldn't believe me when I said that Almoar was not a great defensive second baseman. That belief of Alomar's defensive prowess quickly changed after they got to see him play every day and he showed he was not the spectacular defensive player they had been led to expect.
Carl Crawford's +12.28 runs in LF was based upon these numbers:
The opponent LF @TB allowed an average of .148 runs (per RPA) for every ball hit into LF.
Carl Crawford allowed a value of .132 runs per the 236 balls (143 outs, 59 singles and 34 doubles) hit into LF, while he was playing LF @TB. That's an advantage of .016 runs per every ball hit into LF @TB
The opponent LF @the opponent's park allowed an average of .155 runs per ball hit into LF.
Carl Crawford allowed a value of .121 runs per the 250 balls (158 outs, 63 singles, 29 doubles) hit into LF, while playing LF at the opponent's park. That's an advantage of .034 runs per every ball hit into LF.
That's a total of 486 balls hit into Crawford's defensive area. Even with 486 data points, it is possible to get lucky, just as with some years hitters get lucky, but RPA treats defense just as it treats offense for a batter with 486 balls put in play. A credit for the hitter is a debit for the defender and vice versa. It should all average out over a full season, but doesn't always, as we all know.
Dustin Pedroia missed half the season in 2019 but still managed to lead the MLB in defensive value at 2B.
The opponent 2B, at Fenway, had an RPA defensive value of .065. Notice the huge difference in that value from the value of each ball hit into LF in Crawford's example. It is much harder to be outstanding on defense in the infield because the overall value for each ball that is hit on the ground is so low.
Pedroia's RPA on 135 (112 outs and 23 singles) balls hit to 2B area, @ Fenway, was .049, an advantage of .016 runs per ball for Pedroia.
The opponent 2B, at the opponent's park, had an RPA value of .074.
Pedroia's RPA on 77 (61 outs, 15.5 singles, .5 doubles - the .5 results from balls hit into the area between Pedroia and the first basemen, where their is shared responsibility) balls, for an RPA of .061, an advantage of .013 for Pedroia. Since this only involves 212 balls put into play, over that half season, it is likely that the differences, per ball, home and away would narrow a bit for Pedroia over the full season.
The opponent's RPA, at the particular park, is based upon the full season data for all the opponent players at that position, at that park.
1) A player has an RPA on offense and defense for every park they played in for a given year and that's all added together for their season total?
2) If a ball is hit to LF for example, the LF and the hitter are the only players credited or blamed for what happens?
Pedroia certainly turned some double plays and had more outs per ball hit to him. I think overall when you add things up a the team level your system looks to be reasonable but I think your accounting for double plays is putting credit in the wrong player buckets which is causing some distortions. Outfielders almost never turn double plays.
Are you saying that a good defensive LF is more valuable than a good defensive 2B? Or are you saying that a good LF stands out more because the other LF general aren't great defenders?
From what I've read on RPA, it seems like on offense, it falls more into the BaseRuns/RC category (although it takes a much different route to get there) in that it tries to account for the dynamic nature of an offense; ie the 2 basic components of getting on base and then advancing/scoring the runners on base.
In summary, I guess what I'm saying is that it depends what question(s) you are trying to answer.
1) Measures like rbat, rbaser, rdp (BBRef) or wOBA, wSB, wRC+ (Fangraphs) that use linear weights to determine offensive value tell us that Cano was probably (remember, whatever system we are using is only generating an estimate of value) a better offensive player in isolation than Uggla in 2010. I'd like to see what Fangraphs or BBRef think an offensive confidence interval would look like for these 2 players, but I'm comfortable enough with the measurements in linear weights to say that its more likely Cano was the better offensive player. When adding defense into the equation, the differences become large enough that I am close to 100% confident Cano was the better overall player.
2) RPA is telling us (IMO) that Fredi Gonzalez probably did a better job of deploying his offensive resources (or at least Uggla and those around him in the order) than Joe Girardi did with Cano and those around him.
However, all three OF positions have a huge difference from the infield positions in the value of balls hit through the position. It does not matter which way the ball is hit. If it is hit in the air, the ball has a much higher value in terms of run production.
I'll have to pour through my software to answer your question about the defensive GDP. My software is huge, with many, many thousands of lines. It has been a long time (several years) since I wrote the defensive portion. I assume that I gave a ball hit to Pedroia, the same way that I handled offensive GDP's, accounting for the full result, although I have no current recollection. At this point, I can only say that I assume that I did. Wish that I could be clearer.
One more thing: I do not care about errors. If the batter hit a grounder, and the infielder threw the ball away, with the batter ending up on 2nd base, my software credits it as a double against the fielder. It is where the batter ends up that matters, not what the official scorer recorded. As such, my software runs through every play, in the downloaded Retrosheet database, to determine the actual result. The same goes for the OFer. If the OF botches the ball, even if it were officially called a single, but the batter went all the way around the bases, it is scored by RPA as a home run against the OFer.
I assume that the Yankee intention is to play Stanton in LF.
He's not a good defensive RF, while Judge appears to be a good RF.
Judge was outstanding with a +6.55 runs in 2017.
Here's Stanton's bad defensive runs situation for the last three years for RF:
2015: -3.25 runs
2016: -7.49 runs
2017: -2.94 runs
He'll be greatly benefited by playing LF, but if he can't cut it there, despite having weaker defensive players at that position, then he's a DH in the making.
It would be a mistake to move Judge from RF. I assume that they are NOT contemplating doing that.
Byron Buxton was an incredible +14.84 runs on defense in CF in 2017.
One of the best, if not the best player available as a free agent this year is Lorenzo Cain.
Cain's offensive RPA, in 2017 was a very strong .149.
Here's Cain's defensive numbers for the last 5 seasons:
2013: +7.04 runs
2014: +8.26 runs
2015: +8.23 runs
2016: +9.35 runs
2017: +5.52 runs
"The age adjustment is NOT for evaluating past performance. As stated, my RPA ratings, each year, for players at each position, are for the coming season, after evaluating the player's past two seasons. As such I add or subtract points for the age of the player."
So you're voting on players for 2011 MMP based on your projections for the 2011 season? We know what happened in 2011 already. We don't need projections.
This would explain the Dan Uggla being rated better than Robinson Cano issue from 2010. He's rating 2010 based on 2008 & 2009 plus an age adjustment. Based on a combo of 2008 & 2009, I would rate Uggla better than Cano as well (even without using his park factors and minimized 2B defense). Problem is that we were voting on what they did in 2010.
Problems with defense and positional adjustments aside, surely MMP voters are required to vote on what the players did in the year for which we are voting!
You must be Registered and Logged In to post comments.
<< Back to main