User Comments, Suggestions, or Complaints | Privacy Policy | Terms of Service | Advertising
Page rendered in 1.4525 seconds
75 querie(s) executed
|
| |||||||||
Baseball Primer Newsblog — The Best News Links from the Baseball Newsstand Saturday, November 21, 2009Marchman: Why the Mets are going to sign John LackeySomewhere along the line...footage from Jerry Lewis’ unfilmed “The Lackey” will be discovered.
Repoz
Posted: November 21, 2009 at 01:25 AM | 43 comment(s)
Related News: General, NY Mets, Projections, Rumors |
My BookmarksYou must be logged in to view your Bookmarks. Hot TopicsNewsblog: SNY: Megdal: Have Mets eschewed cheaper options? (18 - 4:11am, Feb 09) Last: Omar'sBlackCloud Newsblog: SNY: Salfino: Yanks prove money does equal winning (23 - 4:05am, Feb 09) Last: Omar'sBlackCloud Newsblog: AP: Brewers erecting a statue of Selig outside Miller Park (73 - 3:54am, Feb 09) Last: Omar'sBlackCloud Newsblog: ATH: Should Albert Belle be in the MLB Hall of Fame? (39 - 2:54am, Feb 09) Last: Joshua Gibsons Ruth (Voxter) Newsblog: THT: Jaffe: Ranking MLB stadiums (that I’ve been to)
(157 - 2:52am, Feb 09) Last: vortex of dissipation |
||||||||
|
About Baseball Think Factory | Write for Us | Copyright © 1996-2008 Baseball Think Factory
User Comments, Suggestions, or Complaints | Privacy Policy | Terms of Service | Advertising
|
| Page rendered in 1.4525 seconds | |||||||
Reader Comments and Retorts
Go to end of page
Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.
Rk Player ERA+ SO/BB W IP L W-L% BB SO ERA1 Jose Rijo 123 2.45 111 1786.0 87 .561 634 1556 3.16
2 Fergie Jenkins 122 3.72 149 2303.2 109 .578 489 1820 3.13
3 Jimmy Key 122 2.40 103 1479.0 68 .602 345 827 3.41
5 Mark Buehrle 122 2.53 135 2061.0 97 .582 470 1188 3.80
6 Bartolo Colon 121 2.13 100 1388.2 62 .617 525 1120 3.86
8 John Candelaria 120 2.65 122 1799.0 80 .604 413 1095 3.10
9 Jim Maloney 118 2.06 132 1755.0 75 .638 749 1544 3.10
10 John Smoltz 118 2.56 129 2060.1 102 .558 690 1769 3.40
11 Dean Chance 118 2.08 128 2147.1 115 .527 739 1534 2.92
12 David Cone 118 2.47 95 1521.0 65 .594 574 1418 3.14
13 Jon Matlack 118 2.55 112 2037.2 108 .509 533 1357 3.05
14 Don Drysdale 118 3.25 124 2059.0 96 .564 457 1485 2.86
15 Andy Pettitte 118 2.07 128 1584.1 70 .646 529 1095 3.93
17 John Lackey 117 2.72 102 1501.0 71 .590 441 1201 3.81
18 Steve Carlton 117 2.16 148 2451.0 119 .554 889 1916 3.10
19 Matt Morris 117 2.61 101 1377.1 62 .620 378 986 3.61
20 Gaylord Perry 117 2.87 95 1686.0 84 .531 430 1234 2.95
21 Gary Nolan 117 2.52 110 1674.2 70 .611 413 1039 3.08
22 Dwight Gooden 116 2.88 157 2169.2 85 .649 651 1875 3.10
23 Rick Reuschel 116 2.21 114 1834.1 101 .530 508 1122 3.43
25 Tommy John 115 2.02 111 2012.2 103 .519 591 1195 3.00
Rk Player ERA+ SO/BB W IP L W-L% BB SO ERA
26 Frank Tanana 115 2.53 135 2356.2 130 .509 650 1647 3.25
27 Alex Fernandez 115 2.27 107 1760.1 87 .552 552 1252 3.74
28 Kevin Millwood 114 2.70 107 1559.1 75 .588 474 1280 3.76
29 Jack McDowell 114 2.16 119 1753.1 77 .607 564 1216 3.73
30 Jim Kaat 114 2.66 141 2173.2 114 .553 530 1410 3.22
31 Freddy Garcia 114 2.22 99 1427.1 62 .615 481 1067 3.93
32 Bob Welch 114 2.29 115 1820.2 86 .572 565 1292 3.14
33 Doug Drabek 114 2.36 108 1732.0 88 .551 447 1053 3.21
34 Charles Nagy 114 2.22 89 1354.0 65 .578 404 897 3.93
35 Dennis Eckersley 113 2.56 145 2295.0 117 .553 581 1490 3.59
36 Frank Viola 113 2.42 137 2107.2 110 .555 608 1469 3.70
37 Sid Fernandez 113 2.42 98 1590.2 79 .554 603 1458 3.15
38 Vida Blue 112 2.04 170 2664.2 120 .586 859 1753 3.12
39 Brad Radke 111 3.09 116 1868.1 110 .513 364 1124 4.32
40 Chris Short 111 2.27 109 1726.1 87 .556 559 1270 3.12
41 Dennis Leonard 111 2.21 120 1798.2 84 .588 506 1119 3.52
42 Denny Neagle 111 2.64 90 1311.0 60 .600 378 998 3.82
Provided by Baseball-Reference.com: View Play Index Tool Used
Generated 11/21/2009.
(EDIT) That's actually 38 names, I didn't notice it didn't re-rank after I deleted the few guys.
Alternatively, you could look at ages 25-30, rather than career totals through 30. Of course, there would still be big differences in innings totals, especially across eras.
Player ERA+ IP SO/BB W L D/E+ D/IP D/KBB D/W D/LJose Rijo 123 1786 2.45 111 87 6 285 -0.27 9 16
Jimmy Key 122 1479 2.4 103 68 5 -22 -0.32 1 -3
Mark Buehrle 122 2061 2.53 135 97 5 560 -0.19 33 26
Bartolo Colon 121 1388.2 2.13 100 62 4 -112.8 -0.59 -2 -9
John Candelaria 120 1799 2.65 122 80 3 298 -0.07 20 9
John Smoltz 118 2060.1 2.56 129 102 1 559.1 -0.16 27 31
Dean Chance 118 2147.1 2.08 128 115 1 646.1 -0.64 26 44
David Cone 118 1521 2.47 95 65 1 20 -0.25 -7 -6
Jon Matlack 118 2037.2 2.55 112 108 1 536.2 -0.17 10 37
Don Drysdale 118 2059 3.25 124 96 1 558 0.53 22 25
Andy Pettitte 118 1584.1 2.07 128 70 1 83.1 -0.65 26 -1
Steve Carlton 117 2451 2.16 148 119 0 950 -0.56 46 48
Matt Morris 117 1377.1 2.61 101 62 0 -123.9 -0.11 -1 -9
Gaylord Perry 117 1686 2.87 95 84 0 185 0.15 -7 13
Gary Nolan 117 1674.2 2.52 110 70 0 173.2 -0.2 8 -1
Dwight Gooden 116 2169.2 2.88 157 85 -1 668.2 0.16 55 14
Rick Reuschel 116 1834.1 2.21 114 101 -1 333.1 -0.51 12 30
Tommy John 115 2012.2 2.02 111 103 -2 511.2 -0.7 9 32
Frank Tanana 115 2356.2 2.53 135 130 -2 855.2 -0.19 33 59
Alex Fernandez 115 1760.1 2.27 107 87 -2 259.1 -0.45 5 16
Kevin Millwood 114 1559.1 2.7 107 75 -3 58.1 -0.02 5 4
Jack McDowell 114 1753.1 2.16 119 77 -3 252.1 -0.56 17 6
Jim Kaat 114 2173.2 2.66 141 114 -3 672.2 -0.06 39 43
Freddy Garcia 114 1427.1 2.22 99 62 -3 -73.9 -0.5 -3 -9
Bob Welch 114 1820.2 2.29 115 86 -3 319.2 -0.43 13 15
Doug Drabek 114 1732 2.36 108 88 -3 231 -0.36 6 17
Charles Nagy 114 1354 2.22 89 65 -3 -147 -0.5 -13 -6
Dennis Eckersle 113 2295 2.56 145 117 -4 794 -0.16 43 46
Frank Viola 113 2107.2 2.42 137 110 -4 606.2 -0.3 35 39
Sid Fernandez 113 1590.2 2.42 98 79 -4 89.2 -0.3 -4 8
Vida Blue 112 2664.2 2.04 170 120 -5 1163.2 -0.68 68 49
Brad Radke 111 1868.1 3.09 116 110 -6 367.1 0.37 14 39
Chris Short 111 1726.1 2.27 109 87 -6 225.1 -0.45 7 16
Dennis Leonard 111 1798.2 2.21 120 84 -6 297.2 -0.51 18 13
Denny Neagle 111 1311 2.64 90 60 -6 -190 -0.08 -12 -11
Except Marchman's point seems to be that Lackey's likely to tire out/get injured. I don't see how guys who threw considerably more innings but stayed healthy are anything but perfectly valid comps.
Rk Player SO W ERA+ SO/BB To From Age G GS L W-L% IP BB ERA1 Fergie Jenkins 1424 121 123 4.05 1968 1973 25-30 236 234 87 .582 1817.2 352 3.16
2 Steve Carlton 1330 101 115 2.06 1970 1975 25-30 228 226 85 .543 1713.0 647 3.26
3 David Cone 1329 90 121 2.57 1988 1993 25-30 203 193 59 .604 1399.0 517 3.06
4 Don Drysdale 1303 111 118 3.48 1962 1967 25-30 247 243 86 .563 1815.0 374 2.75
5 John Smoltz 1246 87 128 2.98 1992 1997 25-30 190 190 60 .592 1327.1 418 3.22
6 Gaylord Perry 1162 91 121 3.00 1964 1969 25-30 245 194 77 .542 1567.0 387 2.83
7 Don Sutton 1156 104 120 2.93 1970 1975 25-30 217 216 66 .612 1585.0 394 2.87
8 Jim Bunning 1139 98 123 2.62 1957 1962 25-30 235 205 68 .590 1514.2 435 3.27
9 Frank Viola 1109 108 122 2.74 1985 1990 25-30 215 215 73 .597 1514.0 405 3.41
10 Chris Short 1099 92 118 2.54 1963 1968 25-30 240 199 66 .582 1457.0 432 2.84
11 Andy Messersmith 1067 92 125 2.01 1971 1976 25-30 206 199 65 .586 1517.1 530 2.70
12 Bartolo Colon 1054 96 125 2.20 1998 2003 25-30 194 194 55 .636 1294.2 480 3.73
13 Bert Blyleven 1030 72 120 2.53 1976 1981 25-30 191 189 63 .533 1389.1 407 3.15
14 Jim Kaat 1010 104 116 3.01 1964 1969 25-30 234 216 73 .588 1525.2 335 3.03
15 Catfish Hunter 989 128 121 2.38 1971 1976 25-30 227 226 64 .667 1770.1 416 2.80
16 John Lackey 981 83 121 2.87 2004 2009 25-30 183 182 51 .619 1188.2 342 3.69
17 Robin Roberts 976 126 118 3.28 1952 1957 25-30 251 223 92 .578 1865.1 298 3.30
18 Jack McDowell 952 97 117 2.31 1991 1996 25-30 188 188 58 .626 1361.2 413 3.72
19 Bob Welch 933 80 115 2.41 1982 1987 25-30 189 187 62 .563 1273.0 387 3.12
20 Eddie Plank 892 129 119 2.20 1901 1906 25-30 223 213 79 .620 1812.1 406 2.59
21 Josh Beckett 889 80 117 3.41 2005 2009 25-29 151 151 42 .656 970.2 261 3.92
22 Jerry Koosman 875 79 124 2.26 1968 1973 25-30 192 178 66 .545 1308.1 387 2.82
23 Doug Drabek 857 90 117 2.44 1988 1993 25-30 204 202 68 .570 1424.0 351 3.04
24 Tim Hudson 854 88 126 2.24 2001 2006 25-30 194 194 52 .629 1312.1 382 3.47
25 Dave McNally 851 117 117 2.01 1968 1973 25-30 220 219 65 .643 1569.0 424 2.91
26 Denny Neagle 834 83 118 2.87 1994 1999 25-30 174 172 48 .634 1123.1 291 3.66
27 Andy Pettitte 819 95 117 2.08 1997 2002 25-30 184 183 53 .642 1188.1 394 3.91
28 Burt Hooton 773 85 117 2.06 1975 1980 25-30 194 191 59 .590 1339.1 376 3.04
29 Mark Buehrle 772 82 121 2.75 2004 2009 25-30 197 197 62 .569 1319.0 281 3.86
30 Charles Nagy 762 77 119 2.40 1992 1997 25-30 160 160 46 .626 1097.0 317 3.81
31 Larry Jackson 699 76 116 2.04 1956 1961 25-30 257 149 61 .555 1242.2 343 3.56
32 Jimmy Key 698 85 121 2.65 1986 1991 25-30 186 185 57 .599 1204.1 263 3.42
33 Doc White 674 107 122 2.23 1904 1909 25-30 205 180 66 .618 1472.1 302 1.98
34 Larry Jansen 649 96 120 2.29 1947 1951 26-30 200 170 57 .627 1338.1 284 3.33
35 Jesse Tannehill 553 96 122 2.96 1901 1905 26-30 160 148 51 .653 1276.1 187 2.38
36 Don Newcombe 528 76 119 2.36 1951 1956 25-30 141 128 29 .724 918.0 224 3.39
Provided by Baseball-Reference.com: View Play Index Tool Used
Generated 11/21/2009.
Player ERA+ IP SO/BB W L D/E+ D/IP D/KBB D/W D/LJimmy Key 122 1479 2.4 103 68 5 -22 -0.32 1 -3
Bartolo Colon 121 1388.2 2.13 100 62 4 -112.8 -0.59 -2 -9
David Cone 118 1521 2.47 95 65 1 20 -0.25 -7 -6
Andy Pettitte 118 1584.1 2.07 128 70 1 83.1 -0.65 26 -1
Matt Morris 117 1377.1 2.61 101 62 0 -123.9 -0.11 -1 -9
Gaylord Perry 117 1686 2.87 95 84 0 185 0.15 -7 13
Gary Nolan 117 1674.2 2.52 110 70 0 173.2 -0.2 8 -1
Alex Fernandez 115 1760.1 2.27 107 87 -2 259.1 -0.45 5 16
Kevin Millwood 114 1559.1 2.7 107 75 -3 58.1 -0.02 5 4
Jack McDowell 114 1753.1 2.16 119 77 -3 252.1 -0.56 17 6
Freddy Garcia 114 1427.1 2.22 99 62 -3 -73.9 -0.5 -3 -9
Doug Drabek 114 1732 2.36 108 88 -3 231 -0.36 6 17
Charles Nagy 114 1354 2.22 89 65 -3 -147 -0.5 -13 -6
Sid Fernandez 113 1590.2 2.42 98 79 -4 89.2 -0.3 -4 8
Chris Short 111 1726.1 2.27 109 87 -6 225.1 -0.45 7 16
Denny Neagle 111 1311 2.64 90 60 -6 -190 -0.08 -12 -11
Seems to me the best comps on that list are Key, Cone, Morris, Millwood, Garcia, El Sid, and then Nolan/Perry.
Player ERA+ IP SO/BB W L D/E+ D/IP D/KBB D/W D/L / ERA+ IPJimmy Key 122 1479 2.4 103 68 5 -22 -0.32 1 -3 122 1112
Bartolo Colon* 121 1388.2 2.13 100 62 4 -112.8 -0.59 -2 -9 97 688
David Cone 118 1521 2.47 95 65 1 20 -0.25 -7 -6 122 1377
Andy Pettitte* 118 1584.1 2.07 128 70 1 83.1 -0.65 26 -1 113 1342
Matt Morris 117 1377.1 2.61 101 62 0 -123.9 -0.11 -1 -9 86 428
Gaylord Perry 117 1686 2.87 95 84 0 185 0.15 -7 13 118 3664
Gary Nolan 117 1674.2 2.52 110 70 0 173.2 -0.2 8 -1 0 0
Alex Fernandez 115 1760.1 2.27 107 87 -2 259.1 -0.45 5 16 0 0
Kevin Millwood* 114 1559.1 2.7 107 75 -3 58.1 -0.02 5 4 100 755
Jack McDowell 114 1753.1 2.16 119 77 -3 252.1 -0.56 17 6 86 135
Freddy Garcia* 114 1427.1 2.22 99 62 -3 -73.9 -0.5 -3 -9 99 345
Doug Drabek 114 1732 2.36 108 88 -3 231 -0.36 6 17 84 803
Charles Nagy 114 1354 2.22 89 65 -3 -147 -0.5 -13 -6 82 600
Sid Fernandez 113 1590.2 2.42 98 79 -4 89.2 -0.3 -4 8 102 276
Chris Short 111 1726.1 2.27 109 87 -6 225.1 -0.45 7 16 88 477
Denny Neagle 111 1311 2.64 90 60 -6 -190 -0.08 -12 -11 95 579
2 guys who remain excellent over many IP, from each extreme of the spectrum in Vinay's list sorted by Ks (Cone and Key)
1 guy who sucked in about 2 years IP (Morris)
3 guys who were slightly above average for starter ERA+, two done after few innings (Garcia, El Sid) and one still going strong (Millwood)
and then Gaylord Perry and Gary Nolan, who are the complete extremes.
Given we're talking about pitchers over the age of 31, that is not a bad ####### list of comps.
25% -- established performance
25% -- average or worse performance
50% -- disaster
Marchman might have cherry-picked but the gist seems to be correct.
Note, that doesn't necessarily mean Lackey would be a bad signing at, say, 4/$60. At his peak, fangraphs had him at worth $20-23 (and that was in older dollars) and last year he was still worth $17.6. (I'm not a particularly big fan of these numbers but they at least give an independent valuation). So you could repeat the above table as something like
25% -- $70-80 M value
25% -- $40 M value
50% -- $10 M value
Still pretty ugly as that would give you an expected value of about $35 M, not a good expected return on a $60 M contract.
By the way, is there an easy way to take a P-I table and then look at the performance of those players going forward? I've been wanting that forever. Or did you have to calculate that from each player's record.
read with the knowledge of how the Mets handle injuries, gave me pause.
DB
On bb-ref, you can take a look at comps (by similarity score) for a given age, and then look at the performance of those players going forward. On looking at Lackey's comps, I'd say the percentages are more like:
15% -- established performance
55% -- average or worse performance
30% -- disaster
Using your weights, this would come out to a similar expected return of $37 M on his contract. It's definitely not worth it to the Mets on a 4- or 5-year basis. It's really only worth it for the Mets if you believe that Lackey will help them win in 2010 or 2011 while Wright, Beltran, Reyes, Santana, and Rodriguez remain at or near peak.
When you look at the age 31-35 performance of the comps, it is unimpressive. One, Bunning, the one Hall of Famer, significantly improved versus his pre age 30 seasons. Flanagan improved slightly (99 ERA+ to 104 ERA+). The rest arguably declined and delivered league-average performance or worse and ate (or in the case of Millwood will eat) fewer than 1000 IP.
Lackey's closest comps through age 30 have scores from 949-977. They are, in descending order, Kevin Millwood (ERA+ of 114 through age 30), Mike Flanagan (99), Bartolo Colon (121), Freddy Garcia (113), John Smiley (103), Matt Morris (116), Jim Bunning (118), Bill Gullickson (101), Brad Penny (104), and Pat Hentgen (113).
Five of the seven retired pitchers pitched through their age 35 seasons; Smiley pitched two years and Morris only three. (Penny's age 31 season was 173 IP at 94 ERA+. Millwood (754 IP at 100 ERA+) and Garcia (345 IP at 99 ERA+) just concluded their age 34 seasons.)
And this is one reason why 1-year fliers, even though they don't solve the hole in your rotation long-term, are attractive.
Unless things have changed, those are not era-adjusted and as such, especially for a guy like Lackey, the comps are next to useless.
I don't see why Marchman chose K/BB rates instead of K/9 rates, other than the convenience of using one rate instead of two (K/9 and BB/9). On the list shown in [3], there is a weak correlation between the K/BB and K/9 data. Lackey's K/9 rate of 7.2 puts him 9th of 38, with a range from 5.0 to 8.4 and a median of 6.4.
The K/9 rate shows where Marchman's comparison set for Lackey is less balanced. All 5 have a similar WHIP (Drabek-1.18, Morris-1.27, Lackey-1.31, Fernandez-1.28, McDowell-1.28). All 5 have a similar BB/9 (Drabek-2.32, Morris-2.47, Lackey-2.64, Fernandez-2.82, McDowell-2.90). However, Lackey's strikeout rate is nowhere near the others: Drabek's K/9 rate was 5.47, Matt Morris 6.44, Lackey 7.2, Alex Fernandez 6.40 and Jack McDowell 6.24.
In 2009, 78 MLB pitchers pitched 162+ innings. The median K/9 rate was about 7.0; 12 had rates of 5 or less. The median K/BB rate was 2.33, the same or worse than three of the four Marchman comps, and 33 of the 78 pitchers had K/BB rates worse than 2.16, i.e., what Marchman quoted as Jack McDowell's multiyear average.
I would suspect but cannot prove that the K/9 rate is the clearer predictor of performance decline than K/BB. (Of course, it is clearer to look at separate K/9 and BB/9 rates than K/BB.) The K/9 rate should drop as a pitcher ages and the BB/9 rate should rise but less drastically if the power/change of pace "skills" age quicker than the control "skills." Does the K/BB rate fluctuate from year to year over a typically pitcher's career? I don't have a good feel for it.
"not era-adjusted ... next to useless" ?!?
Did you even take a look at Lackey's comps? Five, including his closest, pitched in the 2000s. 7 of 10 pitched in the 1990s or 2000s. Hell, the 1st, 3rd, 4th and 5th closest comps are on your final list in [13]! 4 of 10!
By comparison, in your first list, 29 of the 38 pitchers played before the year 2000. Sure, you _tried_ to control for era using ERA+, but did that really control for all the things that have changed that affect the longevity of a pitcher? These include training regimens, nutrition, medical procedures (note Tommy John was on the list), use of video, five man rotations, etc.
In your final list in [13], 10 of 16 pitchers never pitched in the 2000s. 3 pitched in the 1960s. Gaylord Perry through a spitball. Again, does that list "control for the era" any better than Lackey's similarity score comps? 12 of these comps have a K/9 rate lower than Lackey, between 5.0 and 6.9. A different subset of 12 have a BB/9 rate higher than Lackey, from 2.7 to 3.4.
After looking at all this data, I think Millwood probably is a pretty good comp. While ERA+ trend and peak through age 30 are not exactly the same, the ERA+, WHIP, K/9 and BB/9 rates through age 30 are pretty close.
Lackey comps through age 30:
<br>
<br>Sim Player From To Years<br>
John Lackey 2002-2009<br>
977 Kevin Millwood 1997-2005<br>
967 Mike Flanagan 1975-1982<br>
962 Bartolo Colon 1997-2003<br>
961 Freddy Garcia 1999-2005<br>
956 Matt Morris 1997-2005<br>
956 John Smiley 1986-1995<br>
951 * Jim Bunning 1955-1962<br>
950 Bill Gullickson 1979-1989<br>
949 Brad Penny 2000-2008<br>
949 Pat Hentgen 1991-1999<br>
I'm a Lackey fan, and that made me laugh.
By comparison, in your first list, 29 of the 38 pitchers played before the year 2000. Sure, you _tried_ to control for era using ERA+, but did that really control for all the things that have changed that affect the longevity of a pitcher? These include training regimens, nutrition, medical procedures (note Tommy John was on the list), use of video, five man rotations, etc.
In your final list in [13], 10 of 16 pitchers never pitched in the 2000s. 3 pitched in the 1960s. Gaylord Perry through a spitball. Again, does that list "control for the era" any better than Lackey's similarity score comps? 12 of these comps have a K/9 rate lower than Lackey, between 5.0 and 6.9. A different subset of 12 have a BB/9 rate higher than Lackey, from 2.7 to 3.4.
Well, I would argue the exact opposite. The fact that an inordinate number of comps on the list come from the last 10 years, when the whole point of similarity scores is that they are supposed to look at guys across all eras and say "These are the dudes a lot like that dude, and so looking at what these dudes did will be instructive to figuring out what this dude might do." (ignoring the HOF-worthiness discussion that actually led to SS), is just further proof that they aren't era-adjusted. I mean, we all know they aren't, but even if you didn't, if you looked at a list purporting to find comparables from the past and 7 of 10 guys were exact contemporaries or damn close, you'd say "These aren't era-adjusted."
Because what that kind of extreme overrepresentation of contemporaries shows you is that you're essentially seeing era differences overwhelm actual player differences. And that's pretty true. The difference between Tanana and Lackey's raw numbers is more due to the fact they pitched in different eras than it is due to their personal/talent differences. Which is, of course, why we adjust for era. The issue with SS not doing so is that you have huge era effects (the increase in strikeouts being a major one here, along with the decrease in IP per year) that shape the careers/numbers of every player that played in the era, and so if you don't adjust them out, you end up with the only guys having a chance at being similar being those who were contemporaries.
And so yes, 10 of 16 pitchers (when looking at all pitchers in the expansion era) not pitching in the 2000s *is* adjusted for era, especially moreso than similarity scores that are explicitly *not adjusted for era*. We can reasonably say that "pitched in the 2000s" covers a period of about 15-20 debut years. So of the ~50 year expansion era, ~15-20 of those would result in guys who pitched in the 2000s. This actually overstates the number because of the qualifications Marchman used (kind of.) For a guy right now to have pitched in the 2000s and yet be around long enough to compile those baselines, he'd have had to debut some time between 1984/1985 and 2003 or so, but those are the edges. The vast bulk of the representative population probably debuted between 1990 and 2001, inclusive. Anyway, 15-20 of 50 means a properly balanced for era list of 16 should have between 4 and 5.33 pitchers who pitched in the 2000s.
As for Gaylord Perry through a spitball
So?
12 of these comps have a K/9 rate lower than Lackey, between 5.0 and 6.9. A different subset of 12 have a BB/9 rate higher than Lackey, from 2.7 to 3.4.
What, are you asking for a list where exactly 8 are above and exactly 8 below in each and every category?
No, what I want is a set of starting pitchers who are similar to Lackey through age 30. (I'd rather have a couple good comps than many lousy comps.)
By similar, I mean pitchers who have achieved similar results, had a similar workload in similar environments who have achieved their results through similar skills. ERA+ controls for similarity of rate results across different run scoring environments. IP controls for the quantity of those results and somewhat for workload. SO/BB tells you next to nothing about whether the set of pitchers has similar skills, i.e. high strikeout power pitcher, low walk control pitcher, ground ball or fly ball pitcher. Neither IP nor ERA+ addresses changes in training, medicine, days of rest, pitch counts, etc., and bullpen usage across eras.
No. The significant differences between Tanana and Lackey in assessing longevity are:
(a) Tanana's first full major league was at age 20 in 1974, but Lackey came up at age 23 in 2002.
(b) Tanana pitched 2,350 innings through age 30, but Lackey has pitched only 1,500 innings through age 30;
(c) Tanana pitched about 7.20 innings per start through age 30, Lackey only 6.44; and
(d) Tanana completed 116 games through age 30, Lackey only 14.
It's not the use of ERA+ that shows that these two men are quite different from an actuarial perspective. Tanana is 35th on the career list for innings pitched (at 4,188 innings of 106 ERA+ pitching.) Does Lackey's IP history indicate that he is or will be the workhorse that Tanana was? That's not a bet I would take. (As you probably know, "similarity scores" are calculated to adjust for differences in innings pitched.) To your credit, your final list narrowed down the list of comps using IP.
1. I should have written Perry "threw" a spitball. I was tired.
2. Written correctly, it goes to my point about similar skills.
Gaylord Perry is 6th on the career leaderboard for innings pitched at 5,350. The reputation and ability to throw a spitball undoubtedly contributed significantly to his longevity. Lackey doesn't throw a spitball. Therefore, they do not have similar skills. Therefore, Perry is a poor comp for Lackey.
Similar skills can be far better assessed through component statistics--such as strikeout rate, walk rate, and groundball rate--than through ERA+. The skills are the means to the end and pitchers with difference skills will obviously age differently, even starting from the same or similar levels of effectiveness as gauged by ERA+.
David Cone's K/9 rate through age 30 was 8.4 and his BB/9 rate through age 30 was 3.4. For Jimmy Key, it was 5.0 and 2.1, respectively. Lackey is 7.2 and 2.6. To roughly adjust for era, Cone was more of (or more effective) a strikeout pitcher than Lackey than the raw numbers show, as shown by his more frequent appearances on annual leaderboards. Similarly, Key was a more effective control pitcher, i.e., at limiting walks.
As [20] noted, Morris sucked in 2 years because he was injured at age 29. So his pre age-30 health history makes him a poor comp for Lackey.
Sid Fernandez has similar K/9 and BB/9 rates through age 30 as his contemporary David Cone: 8.3 and 3.4. However, he was an extreme flyball pitcher, far more so than Lackey is, for example.
That brings you to Garcia (6.7 and 3.0) and Millwood (7.4 and 2.7), who, by the way, are the third and first most similar pitchers to Lackey through age 30, with sim scores of 961 and 977, respectively. Interesting. ;)
1. There's little evidence that "similar players" are useful for projecting future performance. Systems like ZiPS and PECOTA (and maybe others now) make some use of sims (based on the last 3-4 years of performance, not career to date) in projecting future performance but they have a minimal effect. In short, sims are a fun toy, not a serious too. As such, quit getting worked up about them.
2. Sim scores don't adjust for era at all and, as a tool, are close to useless. (As a toy ...) Sometimes they come up with reasonable comps, sometimes they don't. P-I will almost certainly come up with a better set of comps. Still, for these sorts of questions, it's probably better to look at sims over a limited age range (and possibly limited era range) rather than whole career.
3. But P-I is far from perfect. If we really wanted to era-adjust, we'd probably have to look at Lackey's K-rate vs. league average and compare that to others (for example). You can't do that in P-I.
4. Both Jeff and Bob are in some sense correct about the effects of era adjustment. You don't really want a lot of older comps because so many of the underlying factors of the game have changed since then you should feel uncomfortable making the assumption that aging patterns weren't dramatically affected. That is, even if Gary Nolan (in context) was similar to Lackey through a given age, it's still a huge assumption to think that means Nolan's aging is indicative of Lackey's future.
The ideal would be to have players from the same era because whatever era effects there are will be zeroed out. The problems there of course are that you necessarily will have a small sample size from which to draw your comps and, being recent players, you will have little/no future performance information on the comps. In short, you're screwed both ways and the best you can do (within the context of people bloviating on the web) is try to balance the two in your argument.
5. If we make a mistake, it's in insisting the comps be too similar. 1-2 good comps, and nothing else, do you absolutely no good. If you fine-tune your comps enough, you'll end up with nobody. It's not at all clear why you'd expect (for example) a 2B with a career OPS+ of 114 through age 30 to age substantially differently than one with a 100 OPS+. You don't look at your tailored set of comps and say "the average future performance was X, my guy is going to be X." You look at a broader set of comps and say "these guys tended to decline by about 10% in OPS+ and 15% in playing time." There would be reasons to think the 114 OPS+ 2B won't see as big a decline in playing time (because he'll likely still be an average or better hitter for his position) but the pool of "2B who started from ages 28-30" is likely to be more useful than "the 2 guys who are a good match based on a combination of OPS+, HR rate, BB rate and K rate."
5a. Actually my preferred approach is similar to what Jeff did. Start with the broad set, pick up the general pattern. Narrow the focus to see if the pattern shifts substantially. Narrow again (or in a different way) and see if something different emerges. If you keep coming up with fairly similar results, you have more confidence.
6. Tanana is a lousy comp for ANYBODY except maybe Doc Gooden. Tanana was looking like a future HoFer before his arm came up lame at 24 and fell off at 25. He managed to come back as more of a finesse innings-eating lefty ... who oddly got a nice boost to his K/9 rate in his early 30s. Anyway, Tanana 19-23 had a 131 ERA+, 7.8 K/9 and 3.2 K/BB. Tanana 24-29 had a 100 ERA+ with 5 K/9 and a 2.0 K/BB. Completely different pitcher.
6a. Now, if you have a 29-year-old finesse lefty and you're curious as to how he might do from ages 30-33, then Tanana (ages 25-29) might be a perfectly good comp for that guy.
(a) Tanana's first full major league was at age 20 in 1974, but Lackey came up at age 23 in 2002.
(b) Tanana pitched 2,350 innings through age 30, but Lackey has pitched only 1,500 innings through age 30;
(c) Tanana pitched about 7.20 innings per start through age 30, Lackey only 6.44; and
(d) Tanana completed 116 games through age 30, Lackey only 14.
Other than the fact that Tanana came up at age 20 vs. 23 for Lackey (and possibly even that), every single one of these goes exactly to my point. The biggest difference in the raw numbers between Lackey and Tanana is that Tanana pitched in a time where starters went deeper in games, got more decisions, and threw more complete games. Those factors, the extra innings worked, combine to create a huge differential in the raw numbers, where looking at the rate stats shows much more similar pitchers. The ERA+ is 2 points apart. The K/BB ratio is .19 apart. The K/9 ratios are 7.2/6.3 and the BB/9 ratios are 2.9/2.5 (Lackey/Tanana.)
Similar skills can be far better assessed through component statistics--such as strikeout rate, walk rate, and groundball rate--than through ERA+. The skills are the means to the end and pitchers with difference skills will obviously age differently, even starting from the same or similar levels of effectiveness as gauged by ERA+.
I don't at all disagree. I didn't pick ERA+, Marchman did. But I don't see how this applies to your point about era-adjustments. I'm saying that the components of similarity scores should be era-adjusted, you're saying that it's better that they're not because, from what I understand, you would rather deal with a list comprised of worse comps (or less comps) and not have to deal with era adjusting than have a bigger pool and better comps but have to do the admittedly-imperfect era adjusting. I don't think your stance is ridiculous or anything, but I do think it's inferior.
Jeff:
Thanks for a good discussion.
I'm not advocating for similarity scores. Their implementation on bb-ref is very nice because you can compare the 10 most "similar" players to your subject at age X, and then with one click you can see the totals for the same players from age X+1 through the end of their careers (or to the present for active players). I wouldn't dismiss any data source out of hand, as it may be useful in generating or refining one's hypothesis. For example, had you looked at the sim score data, you'd see that the IP to age 30 for the 10 comps ranged from 1377 IP -1644 IP, in other words, +/- 120 to 140 IP around Lackey's 1501. You may have approached your first IP constraint differently from merely setting IP > 1200.
I agree. ERA+ is well suited to assess results across different run-scoring environments.
However, I repeat, that to assess skills, not results, one needs to at component statistics, and then adjust those for the era in which a pitcher played, in order to get a sense how Lackey's skills compare to the pool of comps. (You could eyeball it using the annual leaderboards, or use P-I to generate the data for starting pitchers on a year-by-year basis and calculate the mean and standard deviations manually.)
I don't think Lackey will age as well as Cone or Key did, but they are not an unreasonable ceiling for your expectations of Lackey. Among comps, I still think Millwood represents the most likely trajectory for Lackey: solid but uneven #2 performance, declining over time, without much "ace" type upside.
FYI - Here's the Transaction Oracle post from Millwood's 12/2005 signing
John Smiley
Frank Viola
Harvey Haddix
Bruce Hurst
Bobby Ojeda
Mickey Lolich
Billy Pierce
Jon Matlack
Mike Flanagan
Charlie Leibrandt
This is all, of course, just one fan's opinion, but I think the Mets should go after Lackey because it makes them a better team. I don't think Lackey alone makes the Mets serious contenders, but I do think that Lackey combined with a serious upgrade at one of the corner outfield positions (i.e. Matt Holliday) does. I think the Beltran window should be a factor in motivating the Mets front office to do all that it can to make the team into a contender for 2010.
DB
It depends on the money, of course, but at something like 6/120, which seems high to me, I'd rather have Holliday than Lackey at 5/80. The former seems much more likely to help in 2010 and 2011 without the fairly serious risk of punting 2012-2014.
You must be Registered and Logged In to post comments.
<< Back to main