Baseball for the Thinking Fan

Login | Register | Feedback

btf_logo
You are here > Home > Baseball Newsstand > Baseball Primer Newsblog > Discussion
Baseball Primer Newsblog
— The Best News Links from the Baseball Newsstand

Sunday, July 26, 2009

WSJ: Marchman: A Computer Cracks the Cooperstown Code

I guess this is good news for all you Andres Galarraga fan out there.

Now that Rickey Henderson and Jim Rice have been inducted into the National Baseball Hall of Fame, fans the country over can return to the only thing they like as much as watching games: Arguing over who should join them in Cooperstown. Sadly, machine learning may have just taken much of the steam out of such arguments.

According to research by Lloyd Smith, a professor of computer science at Missouri State University, and James Downey, an assistant professor of management information systems at the University of Central Arkansas, whether a player is elected to the Hall of Fame by veteran baseball writers is an entirely predictable outcome based on a few statistics.

...Of the 1,592 players considered by their study—anyone who retired between 1950 and 2002 and met several other criteria such as having played 10 years in the majors—the model was able to accurately identify whether they had been elected 98.7% of the time. Before the 2009 class was voted in, the model gave Mr. Henderson a 97.2% chance of being elected. On the flip side, Mo Vaughn had a 1.3% chance.

There is, though, good news for fans in the upcoming elections. Just as the public does, the model sees such borderline cases as Edgar Martinez and Mike Mussina as having essentially 50/50 chances of making Cooperstown. So let barroom debates commence.

Repoz Posted: July 26, 2009 at 10:04 PM | 22 comment(s) | Login to Bookmark
  Related News: GeneralHistoryHall of FameSabermetricsProjections

Reader Comments and Retorts

Go to end of page

Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.

Page 1 of 1 pages
   1. Swedish Chef Posted: July 26, 2009 at 10:09 PM (#3267709)
I thought they had finally quantified fear.
   2. cardsfanboy Posted: July 26, 2009 at 10:27 PM (#3267725)
isn't this basically just a different version of the hof standards? Of course the article didn't list Rices chances(or I missed it)
   3. Athletic Supporter leads the nation in drifters Posted: July 26, 2009 at 10:38 PM (#3267736)
...Of the 1,592 players considered by their study—anyone who retired between 1950 and 2002 and met several other criteria such as having played 10 years in the majors—the model was able to accurately identify whether they had been elected 98.7% of the time. Before the 2009 class was voted in, the model gave Mr. Henderson a 97.2% chance of being elected. On the flip side, Mo Vaughn had a 1.3% chance.

This doesn't seem that impressive. The model missed 21 or so people, or one every couple of years. How many players of the recent HoF classes have been even questionable? (I'm assuming this is BBWAA-only.)
   4. PreservedFish Posted: July 26, 2009 at 10:56 PM (#3267754)
98.7 sounds pretty good to me.
   5. puck Posted: July 26, 2009 at 11:00 PM (#3267756)
Todd Helton, 22.3%. I think that makes sense as like Larry Walker, he's low in the counting stats given that he's associated with Coors Field. Assuming he puts in a few more decent years, he'll be an odd case in that voters will have to deal with the Coors field issue, and there's also kind of a glut of great 1st basemen to compare him to--McGwire, Bagwell and Thomas had their peaks before his, and then Pujols hit first base just as his power started to go.
   6. JRVJ (formerly Delta Socrates) Posted: July 26, 2009 at 11:21 PM (#3267772)
I think Mussina has a much better chance of making it than Edgar Martinez, and I think voting will show that.
   7. PreservedFish Posted: July 26, 2009 at 11:23 PM (#3267775)
There is, however, something I don't understand about the numbers.

Where does the 98.7% come from? It seems impossible when I look at it more carefully.

If there are 50 players score around 75%, and the model is perfect, it will miss on 12-13 of those cases. If there are 50 players that score 52%, then the model gets an A+ if only 26 of them are elected. But you missed on 48% of them, right? You don't know which of those guys would get the call and which wouldn't.

How do they massage these numbers into 98.7?

Also, allow me to be the first to point out the classic skeptical stathead point this hasn't been proven to have any actual predictive value because they are just running correlations on what has already happened.
   8. Swedish Chef Posted: July 26, 2009 at 11:28 PM (#3267778)
What does it take to get a 100% around here, Mr. Computer?

The Corky Miller effect tells us there's always a risk that too many the senile half of the BBWAA forget that a player exists.
   9. Jolly Old St. Neck Wound, Moral Idiot Posted: July 27, 2009 at 12:19 AM (#3267819)
What did Mr. Computer figure Mark McGwire's percentage to be? What about Clemens'?
   10. Joe Mauer Power Hour Posted: July 27, 2009 at 01:54 AM (#3267903)
Sadly, machine learning may have just taken much of the steam out of such arguments.

Because if there's one thing that settles baseball arguments, it's the use of advanced statistical methods.
   11. still hunting for a halo-red october (in Delphi) Posted: July 27, 2009 at 01:56 AM (#3267905)
Now, here's the question?

If/when Vlad gets into the Hall, which cap does he wear on his plaque?

Obviously, given the handle, you know where I stand.
   12. Joe Mauer Power Hour Posted: July 27, 2009 at 01:59 AM (#3267909)
All-Star Game appearances count for both, being especially valuable for hitters as they serve as a useful proxy for position.

If Jeff Kent has only a 14.2% chance of election, it mustn't be quite the useful proxy they think it is.
   13. The District Attorney Posted: July 27, 2009 at 02:41 AM (#3267930)
The Corky Miller effect tells us there's always a risk that too many the senile half of the BBWAA forget that a player exists.
I think the guy who didn't vote for Rickey was Corky Simpson, but I'm open to the possibility that the HOF voting may be controlled by an obscure journeyman backup catcher whose main skill was being hit by pitches.
   14. Morally Excellent Posted: July 27, 2009 at 02:45 AM (#3267935)
Ahh, thank you TDA. I was very confused by that comment. Random sidenote though, I'm surprised that Miller is still playing. He's actually gotten some PT in 9 consecutive seasons now; I wonder if he's the worst player to accomplish that feat.
   15. a bebop a rebop Posted: July 27, 2009 at 03:27 AM (#3267964)
I think Mussina has a much better chance of making it than Edgar Martinez, and I think voting will show that.

You don't happen to be a Yankees or O's fan, huh?
   16. jwb Posted: July 27, 2009 at 06:00 AM (#3268059)
Corky Miller: Nine seasons, a little over 3 years service time. He got his usual 60 days with the ChiSox this year.
   17. Crispix Attacks Posted: July 27, 2009 at 07:00 AM (#3268074)
If Jeff Kent has only a 14.2% chance of election, it mustn't be quite the useful proxy they think it is.


Well, it's also a proxy for playing on bad teams.

He's actually gotten some PT in 9 consecutive seasons now; I wonder if he's the worst player to accomplish that feat.


If there's a worse player, he's probably a catcher.

How about Mark Parent's chance of being the worst player to play in 13 consecutive seasons?

Or Gary Bennett, 11 consecutive seasons?
   18. Daunte Vicknabbit! Posted: July 27, 2009 at 07:10 AM (#3268075)
I have to assume people are not serious when they criticize the program for missing people, because it seems to me like there would be a pretty good answer to the mystery: there are guys in the Hall who make no sense even within the standard logic of the Hall. It would not surprise me in the slightest if their program dropped someone like Nellie Fox from the group (although I think he comes immediately before the cutoff of the tested period, this is just a general example). Either that or I'm the one missing something here.
   19. tl; dr (Voxter) Posted: July 27, 2009 at 07:18 AM (#3268077)
If/when Vlad gets into the Hall, which cap does he wear on his plaque?

All his best seasons were with the Expos, so that's where my vote goes.

He's actually gotten some PT in 9 consecutive seasons now; I wonder if he's the worst player to accomplish that feat.

I was going to suggest Paul Bako, but then I looked them both up, and Corky Miller makes Paul Bako look like Johnny Bench.

Also, if you search for "the Corky" at B-Ref, you get Corky Miller's page.
   20. Grovers Rocket Posted: July 27, 2009 at 12:10 PM (#3268172)
If there's a worse player, he's probably a catcher.

It wasn't consecutive but Matt Sinatro played parts of 10 ML seasons accumulating 276 PAs with a lifetime OPS+ of 34 (Miller is at 44).

Not the same thing but Dave Eiland, John Vukovich (career OPS+ of 20!) and Raul Chavez have the fewest career WinShares (5) of any player who played 10 or ML seasons.
Page 1 of 1 pages

You must be Registered and Logged In to post comments.

 

 

<< Back to main

Support BBTF

donate

Thanks to
Dingbat Charlie
for his generous support.

Bookmarks

You must be logged in to view your Bookmarks.

Hot Topics

NewsblogJeff Sullivan: The Worst Team Ever Projected?
(48 - 2:48am, Feb 10)
Last: Vaux, A.B.D.

NewsblogKnobler: Stay away from steroids -- but vote how you want
(18 - 2:10am, Feb 10)
Last: Something Other

NewsblogGrantland/Bill James: An Open Letter to the Hall of Fame About Dwight Evans
(26 - 2:04am, Feb 10)
Last: DanG

NewsblogL.A. Times: 11 bidders remain in running to buy Dodgers
(7 - 1:56am, Feb 10)
Last: Joe Kehoskie

NewsblogNYT: Alderson Remakes Needy Mets From Bottom Line Up
(41 - 1:37am, Feb 10)
Last: PreservedFish

NewsblogSources: Cubs’ Starlin Castro Accused Of Sexual Assault
(5843 - 1:12am, Feb 10)
Last: He's Bought a Bat Like Prince Fielder

NewsblogOrioles Scouts Banned from Korea
(3 - 1:11am, Feb 10)
Last: Tripon

NewsblogNY Daily News: Brian Cashman's accused stalker says Yankees GM misled feds on steroid probe
(48 - 1:05am, Feb 10)
Last: The Yankee Clapper

Newsblog'Duk: Tim Lincecum slims down with swim routine, loses appetite for McDonald’s
(282 - 12:44am, Feb 10)
Last: MM1f

NewsblogWhatever Happened to the Spitball?
(18 - 12:15am, Feb 10)
Last: Sleepy supports unauthorized rambling

Jim's Lab NotesPlease Excuse the Mess
(174 - 12:06am, Feb 10)
Last: SoSHially Unacceptable

NewsblogThe Book Blog: MGL: Today on Clubhouse Confidential
(78 - 11:50pm, Feb 09)
Last: RayDiPerna

NewsblogOT: NBA Monthly Thread, February 2012
(379 - 11:31pm, Feb 09)
Last: robinred

NewsblogYankees TV host Bob Lorenz charged with drunken driving; cops found YES anchor passed out in car
(15 - 10:56pm, Feb 09)
Last: boteman

NewsblogStrange Times in Baseball: 1891-1895
(13 - 10:48pm, Feb 09)
Last: boteman

Buy MLB playoff tickets, plus 2011 World Series, 2011 ALCS tickets and NLCS game tickets. We also have Texas Rangers playoff schedule, tickets to Red Sox games and Yankees game tickets. Plus, buy Phillies baseball tickets, Tigers playoff tickets and the biggies like ALDS baseball tickets and 2011 NLDS tickets.

Demarini, Easton and TPX Baseball Bats

 

 

 

AllianceTickets.com has cheap MLB Tickets. Get all your Colorado Rockies Tickets, Seattle Mariners Tickets, San Francisco Giants Tickets and all your favorite baseball tickets here. We also carry cheap Denver Broncos Tickets, Seattle Seahawks Tickets and Denver Nuggets Tickets.

Page rendered in 0.4171 seconds
40 querie(s) executed