Baseball for the Thinking Fan

Login | Register | Feedback

btf_logo
You are here > Home > Baseball Newsstand > Baseball Primer Newsblog > Discussion
Baseball Primer Newsblog
— The Best News Links from the Baseball Newsstand

Sunday, July 26, 2009

WSJ: Marchman: A Computer Cracks the Cooperstown Code

I guess this is good news for all you Andres Galarraga fan out there.

Now that Rickey Henderson and Jim Rice have been inducted into the National Baseball Hall of Fame, fans the country over can return to the only thing they like as much as watching games: Arguing over who should join them in Cooperstown. Sadly, machine learning may have just taken much of the steam out of such arguments.

According to research by Lloyd Smith, a professor of computer science at Missouri State University, and James Downey, an assistant professor of management information systems at the University of Central Arkansas, whether a player is elected to the Hall of Fame by veteran baseball writers is an entirely predictable outcome based on a few statistics.

...Of the 1,592 players considered by their study—anyone who retired between 1950 and 2002 and met several other criteria such as having played 10 years in the majors—the model was able to accurately identify whether they had been elected 98.7% of the time. Before the 2009 class was voted in, the model gave Mr. Henderson a 97.2% chance of being elected. On the flip side, Mo Vaughn had a 1.3% chance.

There is, though, good news for fans in the upcoming elections. Just as the public does, the model sees such borderline cases as Edgar Martinez and Mike Mussina as having essentially 50/50 chances of making Cooperstown. So let barroom debates commence.

Repoz Posted: July 26, 2009 at 11:04 PM | 22 comment(s) Login to Bookmark
  Tags: hall of fame, history, projections, sabermetrics

Reader Comments and Retorts

Go to end of page

Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.

   1. Swedish Chef Posted: July 26, 2009 at 11:09 PM (#3267709)
I thought they had finally quantified fear.
   2. cardsfanboy Posted: July 26, 2009 at 11:27 PM (#3267725)
isn't this basically just a different version of the hof standards? Of course the article didn't list Rices chances(or I missed it)
   3. Athletic Supporter leads the nation in drifters Posted: July 26, 2009 at 11:38 PM (#3267736)
...Of the 1,592 players considered by their study—anyone who retired between 1950 and 2002 and met several other criteria such as having played 10 years in the majors—the model was able to accurately identify whether they had been elected 98.7% of the time. Before the 2009 class was voted in, the model gave Mr. Henderson a 97.2% chance of being elected. On the flip side, Mo Vaughn had a 1.3% chance.

This doesn't seem that impressive. The model missed 21 or so people, or one every couple of years. How many players of the recent HoF classes have been even questionable? (I'm assuming this is BBWAA-only.)
   4. PreservedFish Posted: July 26, 2009 at 11:56 PM (#3267754)
98.7 sounds pretty good to me.
   5. puck Posted: July 27, 2009 at 12:00 AM (#3267756)
Todd Helton, 22.3%. I think that makes sense as like Larry Walker, he's low in the counting stats given that he's associated with Coors Field. Assuming he puts in a few more decent years, he'll be an odd case in that voters will have to deal with the Coors field issue, and there's also kind of a glut of great 1st basemen to compare him to--McGwire, Bagwell and Thomas had their peaks before his, and then Pujols hit first base just as his power started to go.
   6. JRVJ (formerly Delta Socrates) Posted: July 27, 2009 at 12:21 AM (#3267772)
I think Mussina has a much better chance of making it than Edgar Martinez, and I think voting will show that.
   7. PreservedFish Posted: July 27, 2009 at 12:23 AM (#3267775)
There is, however, something I don't understand about the numbers.

Where does the 98.7% come from? It seems impossible when I look at it more carefully.

If there are 50 players score around 75%, and the model is perfect, it will miss on 12-13 of those cases. If there are 50 players that score 52%, then the model gets an A+ if only 26 of them are elected. But you missed on 48% of them, right? You don't know which of those guys would get the call and which wouldn't.

How do they massage these numbers into 98.7?

Also, allow me to be the first to point out the classic skeptical stathead point this hasn't been proven to have any actual predictive value because they are just running correlations on what has already happened.
   8. Swedish Chef Posted: July 27, 2009 at 12:28 AM (#3267778)
What does it take to get a 100% around here, Mr. Computer?

The Corky Miller effect tells us there's always a risk that too many the senile half of the BBWAA forget that a player exists.
   9. Jolly Old St. Neck Wound, Moral Idiot Posted: July 27, 2009 at 01:19 AM (#3267819)
What did Mr. Computer figure Mark McGwire's percentage to be? What about Clemens'?
   10. JMPH Posted: July 27, 2009 at 02:54 AM (#3267903)
Sadly, machine learning may have just taken much of the steam out of such arguments.

Because if there's one thing that settles baseball arguments, it's the use of advanced statistical methods.
   11. still hunting for a halo-red october (in Delphi) Posted: July 27, 2009 at 02:56 AM (#3267905)
Now, here's the question?

If/when Vlad gets into the Hall, which cap does he wear on his plaque?

Obviously, given the handle, you know where I stand.
   12. JMPH Posted: July 27, 2009 at 02:59 AM (#3267909)
All-Star Game appearances count for both, being especially valuable for hitters as they serve as a useful proxy for position.

If Jeff Kent has only a 14.2% chance of election, it mustn't be quite the useful proxy they think it is.
   13. The District Attorney Posted: July 27, 2009 at 03:41 AM (#3267930)
The Corky Miller effect tells us there's always a risk that too many the senile half of the BBWAA forget that a player exists.
I think the guy who didn't vote for Rickey was Corky Simpson, but I'm open to the possibility that the HOF voting may be controlled by an obscure journeyman backup catcher whose main skill was being hit by pitches.
   14. Shock Posted: July 27, 2009 at 03:45 AM (#3267935)
Ahh, thank you TDA. I was very confused by that comment. Random sidenote though, I'm surprised that Miller is still playing. He's actually gotten some PT in 9 consecutive seasons now; I wonder if he's the worst player to accomplish that feat.
   15. a bebop a rebop Posted: July 27, 2009 at 04:27 AM (#3267964)
I think Mussina has a much better chance of making it than Edgar Martinez, and I think voting will show that.

You don't happen to be a Yankees or O's fan, huh?
   16. jwb Posted: July 27, 2009 at 07:00 AM (#3268059)
Corky Miller: Nine seasons, a little over 3 years service time. He got his usual 60 days with the ChiSox this year.
   17. Crispix Attacks Posted: July 27, 2009 at 08:00 AM (#3268074)
If Jeff Kent has only a 14.2% chance of election, it mustn't be quite the useful proxy they think it is.


Well, it's also a proxy for playing on bad teams.

He's actually gotten some PT in 9 consecutive seasons now; I wonder if he's the worst player to accomplish that feat.


If there's a worse player, he's probably a catcher.

How about Mark Parent's chance of being the worst player to play in 13 consecutive seasons?

Or Gary Bennett, 11 consecutive seasons?
   18. Daunte Vicknabbit! Posted: July 27, 2009 at 08:10 AM (#3268075)
I have to assume people are not serious when they criticize the program for missing people, because it seems to me like there would be a pretty good answer to the mystery: there are guys in the Hall who make no sense even within the standard logic of the Hall. It would not surprise me in the slightest if their program dropped someone like Nellie Fox from the group (although I think he comes immediately before the cutoff of the tested period, this is just a general example). Either that or I'm the one missing something here.
   19. Infinite Yost (Voxter) Posted: July 27, 2009 at 08:18 AM (#3268077)
If/when Vlad gets into the Hall, which cap does he wear on his plaque?

All his best seasons were with the Expos, so that's where my vote goes.

He's actually gotten some PT in 9 consecutive seasons now; I wonder if he's the worst player to accomplish that feat.

I was going to suggest Paul Bako, but then I looked them both up, and Corky Miller makes Paul Bako look like Johnny Bench.

Also, if you search for "the Corky" at B-Ref, you get Corky Miller's page.
   20. GroversRocket Posted: July 27, 2009 at 01:10 PM (#3268172)
If there's a worse player, he's probably a catcher.

It wasn't consecutive but Matt Sinatro played parts of 10 ML seasons accumulating 276 PAs with a lifetime OPS+ of 34 (Miller is at 44).

Not the same thing but Dave Eiland, John Vukovich (career OPS+ of 20!) and Raul Chavez have the fewest career WinShares (5) of any player who played 10 or ML seasons.

You must be Registered and Logged In to post comments.

 

 

<< Back to main

Support BBTF

donate

Thanks to
Backlasher
for his generous support.

Bookmarks

You must be logged in to view your Bookmarks.

Hot Topics

NewsblogMaddon on Red Sox beaning Luke Scott: 'I think it's ridiculous, I think it's absurd, idiotic'
(4 - 7:58am, May 26)
Last: TomH

NewsblogYESNetwork: A look at five Yankees' cases for enshrinement in Monument Park
(1 - 7:55am, May 26)
Last: bjhanke

NewsblogHP: Baseball is leaving the human factor behind
(60 - 7:55am, May 26)
Last: Designated Sitter (GGC)

NewsblogWilmoth: Nate McLouth Designated For Assignment
(13 - 7:52am, May 26)
Last: Russ

NewsblogOT: NBA Monthly Thread, May 2012
(1835 - 7:45am, May 26)
Last: thok

NewsblogThe Hall of Very Good: Former Cards Slugger Critical of "LaRussa's Regime"
(6 - 7:16am, May 26)
Last: Shooty: Applying to be Fearless Leader

NewsblogMatschulat: Did I Miss The "Paul Konerko Is So Overrated OMG" Bandwagon?
(30 - 7:15am, May 26)
Last: baudib

NewsblogCSN to host ‘Phillies at the Beach’ on Memorial Day
(19 - 7:11am, May 26)
Last: God

NewsblogT.R. Sullivan: Of Frank Robinson, Milt Pappas and Jim Palmer
(10 - 7:09am, May 26)
Last: God

NewsblogBud Selig -- No need for more MLB replay for now - ESPN
(88 - 6:12am, May 26)
Last: Lassus

NewsblogHimrich’s Top Ten Target Field Foods
(8 - 2:43am, May 26)
Last: Long John McCaine Mutiny on the Bounty (scott)

NewsblogBoston.com: Curt Schilling’s 38 Studios lays off all staff
(119 - 1:28am, May 26)
Last: Swedish Chef

Hall of MeritMost Meritorious Player: 1973 Discussion
(15 - 12:13am, May 26)
Last: DanG

Hall of MeritMost Meritorious Player: 1972 Ballot
(28 - 11:25pm, May 25)
Last: lieiam

Sox TherapyA Winning Ballclub?
(20 - 11:24pm, May 25)
Last: Dan

Buy MLB playoff tickets, plus 2011 World Series, 2011 ALCS tickets and NLCS game tickets. We also have Texas Rangers playoff schedule, tickets to Red Sox games and Yankees game tickets. Plus, buy Phillies baseball tickets, Tigers playoff tickets and the biggies like ALDS baseball tickets and 2011 NLDS tickets.

Demarini, Easton and TPX Baseball Bats

 

 

 

AllianceTickets.com has cheap MLB Tickets. Get all your Colorado Rockies Tickets, Seattle Mariners Tickets, San Francisco Giants Tickets and all your favorite baseball tickets here. We also carry cheap Denver Broncos Tickets, Seattle Seahawks Tickets and Denver Nuggets Tickets.

Page rendered in 0.1905 seconds
54 querie(s) executed