|
|
|
|
Baseball Primer Newsblog— The Best News Links from the Baseball Newsstand
Tuesday, March 09, 2010
As most people who frequent this site know, I’m the guy responsible for ZiPS. Auditing the results is extremely important for projection systems as otherwise, one would only have our word that the results mean anything.
This year, the independent audits that I know about are Tango’s creative experiment for the year (while Tango’s the Marcel creator, he wants Marcel to lose) and the recent evaluations done by Dash Davidson, Peter Rosenbloom, and Jared Cross (the j cross that posts here?)
Davidson/Rosenbloom/Cross, 2009 Hitter Forecast Evaluations
Davidson/Rosenbloom/Cross, Overall Hitter Projection Standings, By Probability System is Better Than Marcel
Davidson/Rosenbloom/Cross, 2009 Pitcher Forecast Evaluations
Tango: Forecasters Challenge 2009
Dan Szymborski
Posted: March 09, 2010 at 02:58 PM | 22 comment(s)
| Login to Bookmark
Related News: Projections
|
Support BBTF
Thanks to Guts for his generous support.
Bookmarks
You must be logged in to view your Bookmarks.
Hot Topics
Newsblog: Grantland/Bill James: An Open Letter to the Hall of Fame About Dwight Evans (23 - 11:03pm, Feb 09)Last: SoSHially UnacceptableNewsblog: Whatever Happened to the Spitball? (17 - 11:00pm, Feb 09)Last: RichardNewsblog: Jeff Sullivan: The Worst Team Ever Projected? (35 - 10:59pm, Feb 09)Last: Gonfalon BubbleNewsblog: Yankees TV host Bob Lorenz charged with drunken driving; cops found YES anchor passed out in car (15 - 10:56pm, Feb 09)Last: botemanNewsblog: Sources: Cubs’ Starlin Castro Accused Of Sexual Assault (5814 - 10:52pm, Feb 09)Last:  ray jamesNewsblog: Strange Times in Baseball: 1891-1895 (13 - 10:48pm, Feb 09)Last: botemanNewsblog: OT: NBA Monthly Thread, February 2012 (378 - 10:47pm, Feb 09)Last:  Los Angeles ALBERT F. PUJOLS of AnaheimNewsblog: 'Duk: Tim Lincecum slims down with swim routine, loses appetite for McDonald’s (281 - 10:43pm, Feb 09)Last:  phredbirdNewsblog: The Book Blog: MGL: Today on Clubhouse Confidential (77 - 10:33pm, Feb 09)Last: villageidiomNewsblog: NYT: Alderson Remakes Needy Mets From Bottom Line Up (29 - 10:24pm, Feb 09)Last: PreservedFishNewsblog: NY Daily News: Brian Cashman's accused stalker says Yankees GM misled feds on steroid probe (45 - 10:22pm, Feb 09)Last: villageidiomNewsblog: Justice: 5 things that could make the 2012 season a successful one for the Astros (28 - 10:17pm, Feb 09)Last: bbc is prejudice bout menNewsblog: Fangraphs: Cameron: The 10 Worst Transactions Of The Winter (82 - 10:10pm, Feb 09)Last: Cooper NielsonNewsblog: MLB: Hall of Fame worthy? Furthest thing from Schilling's mind (11 - 10:00pm, Feb 09)Last: SquashNewsblog: Orioles Scouts Banned from Korea (2 - 9:31pm, Feb 09)Last: Voros McCracken, Human Shield
|
|
Reader Comments and Retorts
Go to end of page
Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.
OPS
CHONE: 93-69
ZiPS: 90-72
Oliver: 86-76
Marcel: 81-81
Fantistics: 79-83
PECOTA: 75-87
Sporting News: 73-89
Steamer: 70-92
2008: 43-119
ERA
ZiPS: 100-62
CAIRO: 95-67
CHONE: 91-71
Steamer: 90-72
PECOTA: 88-74
Oliver: 82-80
Marcel: 81-81
Fantistics: 80-82
Sporting News: 73-89
The pitching results are kinda interesting. ZIPS doesn't seem to have crushed the other systems in the component stats, but won hands down things in which hits allowed (WHIP) and defense (hit part of whip and ERA) are included.
Re: #1, I am interested in figuring out what ZiPS does beyond component stats. Seems like it must have an edge on the other systems there. It also manages to do well on each of BB/9, HR/9 and K/9 and seems to them combine them in the right way.
It's not exactly new to see a complex system perform better when operated by the developer.
Yeah - I don't know if it changed before he left, but I recall Nate Silver saying in an interview that it was more or less a bunch of delicately tied together spreadsheets and macros.
Back in the day (1999-2001) I did pretty well at the projection game and most of the time it was because I had a good system, knew where the holes were and when a player exploited one of the holes I made an adjustment (EG, the system didn't handle a succession of four or five 180 at bat seasons well). So it wasn't a case of changing projections you thought were wrong, it was a case of changing projections for guys who had a profile the system didn't handle very well. Early on there was a player like Darrin Fletcher that would cause significant adjustments to the system, but after a while I felt like I had the general idea down fairly well.
And Dan it warms my heart to see you using the old '$H' designation. I have a ton of projection sheets lying around with a bunch of '$BB' '$HR' '$SO' '$H'. I don't know why I chose the dollar sign to indicate the I was using one of my funky rate stats, but it's nice to see it thrown out there from time to time. :)
Heh...so in other words, he was training on his test data. Always THE big issue with signal modeling --- how to keep yourself from adjusting your algorithm so that you do a "better" job of fitting your test set. Just as bad as incorporating the test data into the training procedure. Would wager Nate had to make some changes to fit in his last year, and the input conditions are no longer true. I would speculate that unless BP plays with the algorithm, it will probably have some late career "renaissance" season when the planets and stars align and the input conditions match once again....
The fact that Dan hasn't felt a need to fiddle with ZiPS in a long time makes me happy.
For some reason, people on the internet always think I'm about 10 years older than I am. I preferred being confused for 30 better than actually being 30, though.
I just wish I actually sounded 30 - I have this nasally, highish, young-sounding voice. I should take up smoking or tuberculosis or something.
Did the Ira Glass comparisons put you off doing more audio Transaction Oracles? Because 1. Ira Glass is awesome, so I don't see the problem and 2. I thought it was pretty well-done.
I like your voice, Dan - it's a good fit for a podcast type format. (And, yes, a bit like Ira Glass.)
Right now there could, in theory, be a dozen things wrong with PECOTA. Maybe it was 100 problems at one time, and they were able to fix 99 of them, but in doing that they caused another 11 problems. I'm sure Nate would have been able to fix 100 of them without creating more problems, simply because he knew it better than anyone.
Maybe it's the slight Bal'mer accent.
To actually do something in audio, I can't really do it by myself. If it's just me talking, I have a tendency to ramble on. I'm an odd enough speaker that people that occasionally use the term Dan-Sequitur.
Maybe it's the slight Bal'mer accent.
Did you hear that? I try not to have any noticeable accent. I do have a problem with certain words - just still comes out "jest" (I had a bad one on the last PP bit) and the first syllable of horrible still rhymes with car.
You must be Registered and Logged In to post comments.
<< Back to main