Baseball for the Thinking Fan

Login | Register | Feedback

btf_logo
You are here > Home > Transaction Oracle > Discussion
Transaction Oracle
— A Timely Look at Transactions as They Happen

Reader Comments and Retorts

Go to end of page

Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.

Page 1 of 1 pages
   1. Der Komminsk-sar Posted: September 10, 2006 at 01:55 AM (#2172460)
Very cool - thanks for this and the MLEs, Dan.
   2. Dan Szymborski Posted: September 10, 2006 at 02:02 AM (#2172462)
Just noticed - the "3 year" is the same as the "2006" - I'll fix that as soon as I'm back at that computer.
   3. bigseries Posted: September 10, 2006 at 03:01 AM (#2172489)
These factors are all self-contained within each league, right?
   4. Dan Szymborski Posted: September 10, 2006 at 03:29 PM (#2172606)
These factors are all self-contained within each league, right?

Yes.
   5. catomi01 Posted: September 10, 2006 at 09:19 PM (#2172932)
There may be very few people who care about this other than me....but is there anywhere to find independent league park factors? i'd specifically be interested in the atlantic league if they're available.
   6. AROM Posted: September 12, 2006 at 11:58 PM (#2174998)
The strikeout factor for Rancho Cucamonga is .51 for multiyear. Can that be right? Its 1.05 for 2006.
   7. Dan Szymborski Posted: September 13, 2006 at 02:54 AM (#2175224)
that doesn't seem right!
   8. Dan Szymborski Posted: September 13, 2006 at 11:50 AM (#2175445)
Rancho fixed.
   9. Der Komminsk-sar Posted: September 14, 2006 at 04:34 PM (#2176892)
catomi01, I care about indy PFs. And, no, I don't know where they are.
***
Has anyone done work to try to account for imbalanced scheduling with PFs? It could be a fair amount of work and often wouldn't lead to huge gains but it's worth pursuing. For example, I suspect that PGE Park's (Portland OR) always low PF are in part because a disproportionate number of their road games are in places like Colorado Springs and Salt Lake.
***
Why no triples, Dan? I'd guess because the data is so noisy and levels are low enough that they don't hugely impact the game, but parks do influence them more than other types of hits. (The follow up would then be do you integrate them with work on doubles, but first things first.)
   10. AROM Posted: September 14, 2006 at 04:36 PM (#2176893)
I use the doubles park factor for triples as well. Triples are too rare an even and theres a lot of noise in the data.
   11. JPWF13 Posted: September 14, 2006 at 04:38 PM (#2176897)
Dumb question time- are these raw park factors or are they adjustment factors?
   12. Dan Szymborski Posted: September 14, 2006 at 05:32 PM (#2176947)
Raw.
   13. Mike Emeigh Posted: September 14, 2006 at 05:37 PM (#2176952)
I use the doubles park factor for triples as well.


I think it would be better to combine doubles and triples to create an EBH factor, then apply it to doubles and triples. It probably doesn't make a whole lot of difference except in the odd case.

-- MWE
   14. Mike Emeigh Posted: September 14, 2006 at 06:08 PM (#2176987)
Has anyone done work to try to account for imbalanced scheduling with PFs? It could be a fair amount of work and often wouldn't lead to huge gains but it's worth pursuing.


The problem you run into here is that, in addition to imbalanced schedules, you also have to deal with imbalances in talent due to when teams play. If a team plays all of its games against an opponent at home in the first half, and on the road in the second half, the quality of competition on both sides of the ball is likely to be markedly different due to personnel shuffles.

-- MWE
   15. Der Komminsk-sar Posted: September 14, 2006 at 06:28 PM (#2177014)
I think it would be better to combine doubles and triples to create an EBH factor, then apply it to doubles and triples. It probably doesn't make a whole lot of difference except in the odd case.
I used to do separate EBH, 3b, and 2b factors - estimating the # of XBH, then triples, then calculating doubles as XBH-3B. This, in retrospect, may have been overkill.

The problem you run into here is that, in addition to imbalanced schedules, you also have to
deal with imbalances in talent due to when teams play.

True - I'm just not sure that that's enough reason for someone not to do it. (Not meaning Dan - Lord knows he's working on enough other things.) Multiyear factors, for example, help with roster churn - they don't help with imbalanced scheduling.
If I've time today, I'll crank out a quick-n'-dirty estimate of how this issue affects Portland-PCL stats, since I used them as an example before.
   16. Mike Emeigh Posted: September 14, 2006 at 06:54 PM (#2177052)
Multiyear factors, for example, help with roster churn - they don't help with imbalanced scheduling.


Depends on how much the scheduling turns over year-to-year, I guess - when you play your opponents. If the intradivisional games and interdivisional games are always played in roughly the same rotation, it might not help all that much.

I'm not sure that roster churn is all that big an issue any more, though. Except for AAA, teams are showing more of a tendency to leave players at the same level virtually all season; where roster churn is happening, it's often with organizational soldiers who are moved around the organization to whichever team needs a fill-in for a couple of weeks.

-- MWE
   17. AROM Posted: September 14, 2006 at 07:00 PM (#2177057)
I think it would be better to combine doubles and triples to create an EBH factor, then apply it to doubles and triples. It probably doesn't make a whole lot of difference except in the odd case.

That would be ideal, but triples data are available only for offense on the minorleaguesplits site.
   18. Dan Szymborski Posted: September 14, 2006 at 07:17 PM (#2177085)
Out of the 3253 hitters that got an at-bat in the minors this year, only 15 were in the double digits in triples.

The park factors are going to be so inaccurate with so few triples that we're not going to have enough confidence in them to take anything more than a small park factor seriously. And a small park factor for triples is going to have zero effect on the translates triples for almost every player.

Translated, I only have 2 players total that get 10 triples in the majors (Brooks Conrad and Eugenio Velez). Just to add 2 triples after rounding to these guys, you'd have to be confident of a real triple park factor of 1.30 or greater.

I shouldn't even bother applying the doubles park factor either. Of the 142 parks I've done, only 2 applications of double park factor would increase or decrease a triples total by more than 1 triple - if Conrad or Velez had played at Batavia.

Just for fun, I decided to examine the MLE difference between my MLE triples totals for minor leaguers and my MLE triples totals if I had used doubles park factor for triples. It would have changed 9 players by 1 triple.
   19. Dan Szymborski Posted: September 14, 2006 at 07:25 PM (#2177096)

Just for fun, I decided to examine the MLE difference between my MLE triples totals for minor leaguers and my MLE triples totals if I had used doubles park factor for triples.


That shouldn't read, my MLE triples totals if I didn't use doubles factor for triples.
   20. AROM Posted: September 14, 2006 at 07:35 PM (#2177110)
Brooks Conrad has an interesting stat line - bad OBP but good power and a ton of XBH. Can he do anything with the glove?
   21. Der Komminsk-sar Posted: September 14, 2006 at 08:23 PM (#2177160)
Cliffs notes version: add 2% to PFs from the West, subtract it from the East.

The following has a lot of flaws/shortcuts (I said quick and dirty) and I don't trust my math today (under the weather) - so pick it apart more than usual.

***

The PCL plays a 144 game schedule, with 112 of those games coming against conference opponents (7 other teams, played 16 times - 8 home, 8 away). Of the remaining 32, half at home versus one division in the other conference (4*4), the other half on the road versus the other division.

In the following, I'm going to ignore the impact of alternating divisions (as well as time of year when they are played, as well as rainouts, etc...) and reduce the analysis to intra- versus inter-conference games. Also, I'm going to use reindexed three year run factors (reindexed: average of all team PF = 100 - needed primarily due to franchise movement - impact is about a 0.3 reduction per team).

Before the discourse, here's some numbers:

<u>Reindexed, but Schedule Unadjusted PCL Park Factors</u>
American Conference 0.984

North Division 0.983
Iowa 1.008
Nashville 0.989
Omaha 0.982
Memphis 0.953

South Division 0.986
Albuquerque 1.151
Round Rock 0.961
Oklahoma 0.921
New Orleans 0.909

Pacific Conference 1.016

North Division 1.013
Salt Lake 1.114
Colorado Springs 1.058
Portland 0.951
Tacoma 0.928

South Division 1.019
Tucson 1.084
Las Vegas 1.078
Fresno 0.976
Sacramento 0.937

No shock that more runs are scored in the west (Pacific) than in the east (American) - after all, many of the eastern teams were part of the now-defunct American Association (hence the goofy conference names) which featured scoring levels more in line with the then-IL than the then-PCL.

Luckily, the division thing doesn't probably mean too much on a year-to-year basis (actual difference between division is likely more than it appears here because of shortcuts, but ... whatever). So, the average reindexed PF in the west (Pacific) is 3 points higher than in the east (American) - what does this mean? Only two of every nine (well, 32 of 144) games is played outside the conference - with a balanced schedule (I'm going to say half of the 144 games, even though it would really mean 8/15 of the schedule, since you can't play yourself), this confers a "true" conference difference of about 7 points (1+(72/32*(1.016-1))=1.036 for the Pacific, 1+(72/32*(.984-1))=.964 for the American).

Here, then, would be the reconfigured park factors, where the west is adjusted upwards about 2% (1.036/1.016) and the east downgraded accordingly:

<u>Schedule Adjusted PCL Park Factors</u>
American Conference 0.964

North Division 0.963
Iowa 0.987
Nashville 0.969
Omaha 0.962
Memphis 0.934

South Division 0.966
Albuquerque 1.128
Round Rock 0.941
Oklahoma 0.902
New Orleans 0.891

Pacific Conference 1.036

North Division 1.033
Salt Lake 1.136
Colorado Springs 1.079
Portland 0.970
Tacoma 0.947

South Division 1.039
Tucson 1.105
Las Vegas 1.099
Fresno 0.995
Sacramento 0.956

***

Left unexamined is the DH factor (DH is used unless both teams are from the NL, thus NL affiliates have deflated hitter PFs and vice versa at upper levels).
   22. Der Komminsk-sar Posted: September 14, 2006 at 08:34 PM (#2177170)
First off, triples are way less important (I think) than other issues - they're just interesting. That said:

Translated, I only have 2 players total that get 10 triples in the majors (Brooks Conrad and Eugenio Velez). Just to add 2 triples after rounding to these guys, you'd have to be confident of a real triple park factor of 1.30 or greater.
Well, the things about triples is that we care about the big league factors (which we have more information about) and we care about identifying minor league parks that produce a ton of them (of course, you then run into a chicken and egg question as you only care about this category when looking at a player who triples a lot), so we know to downgrade those players, but we don't care so much finding minor league parks that depress triples (most MLE systems deflate the heck out of these totals anyway - plus, a triple PF of zero would do no more than double our estimate).

I shouldn't even bother applying the doubles park factor either. Of the 142 parks I've done, only 2 applications of double park factor would increase or decrease a triples total by more than 1 triple - if Conrad or Velez had played at Batavia.
Sure - doubles factors are rarely huge.

The most important thing is recognizing that a park surpressed triple is likely a double and so forth - as was discussed above.
   23. Der Komminsk-sar Posted: September 14, 2006 at 08:42 PM (#2177174)
In Diamond Mind parlance, DMB and ZiPS gave Brooks an Average (while other sources I've seen suggest something closer to Fair). He's relatively surehanded - don't know his rep on the pivot.

***

Depends on how much the scheduling turns over year-to-year, I guess - when you play your opponents. If the intradivisional games and interdivisional games are always played in roughly the same rotation, it might not help all that much.
Ah, that is what you meant. Yeah - I'm a little worried about that as I suspect there's an effect there. That said, you can't control for everything, no matter how hard I might argue that people should.
If I still had my own SAS license and more free time, I'd love to run some regressions to control for all this stuff, but alas...
   24. Dan Szymborski Posted: September 14, 2006 at 09:11 PM (#2177194)
Well, the things about triples is that we care about the big league factors (which we have more information about) and we care about identifying minor league parks that produce a ton of them

Actually, I don't do triples factors for the majors, either! :)
   25. Der Komminsk-sar Posted: September 14, 2006 at 09:28 PM (#2177210)
Actually, I don't do triples factors for the majors, either! :)
Well, I meant if you're gonna do 'em... :)

***

So I thought a little more about the sceduling and point in time issue and I'm going a different way with it. If schedules were perfectly static - then this wouldn't be something we'd want to control for at all, season would be an "attribute" of the city/park. It's only when the rotation changes that additional error is introduced to our estimate. Also, this is a concern with individual team PFs as well - for instance, I imagine that teams in the south are at home more often during cold months than more northerly squads.
   26. William K. Posted: September 15, 2006 at 11:21 AM (#2177527)
Tri-City (NW) 0.85 0.97 1.06 0.38 1.05 1.30
Tucson 1.17 0.97 1.06 0.43 1.04 1.28
Tulsa 0.95 0.97 1.05 0.48 1.04 1.25
Vancouver 0.89 0.98 1.18 0.46 1.01 0.78


The homer factor for these parks don't look right....
   27. Dan Szymborski Posted: September 15, 2006 at 12:29 PM (#2177571)
William, good catch - the formulae for 2 of those cells are wrong (Tri-City and Vancouver are correct). The raw HR factor for Tucson is 0.90 and 1.03 for Tulsa. I'll have that updated in the files next couple of hours.
   28. o_dawg Posted: September 15, 2006 at 04:30 PM (#2177765)
Vancouver is a large park with high walls. Now that it's short-season A instead of AAA, I'm surprised anybody ever hits one out there. When I used to go to AAA Canadians games, it was almost always a shock to see a HR hit.
   29. AROM Posted: September 16, 2006 at 06:02 PM (#2178912)
Colorado Springs SO factor of .39 can't be right.
   30. Dan Szymborski Posted: September 16, 2006 at 10:27 PM (#2179107)
Another good catch.

One thing I have to do is get a beta group for these spreadsheets.
   31. AROM Posted: September 17, 2006 at 01:56 PM (#2179382)
I'd just sort it and double check the extremes. That's how I found Colorado Springs. You might miss a 1.04 somewhere that's incorrect, but noone will ever notice or care.
   32. So Taguchi is My Sensei Posted: January 16, 2007 at 04:41 AM (#2280987)
where is johnson city?
   33. Orange & Blue Velvet Posted: January 16, 2007 at 04:57 AM (#2280994)
So Norfolk played as the second toughest park for homers and the third toughest park for doubles in all the minors? I wonder what that does to John Maine's numbers.
   34. Der Komminsk-sar Posted: January 21, 2007 at 01:59 PM (#2283823)
Johnson City is in eastern Tenessee, in the Appalachian League.
   35. 2ndedition Posted: January 21, 2007 at 02:33 PM (#2283829)
Sorry to be so late with this post - aooears that I missed this the first time around,but I'm not sure what to think about BB/SO park factors. Is it really a funxtion of park or just a random variance that depends on the talent a given team decides to sign? While the numbers may vary, I fail to see how a park could actually be a determining factor. Has this been regressed against other more likely causes?
   36. Dan Szymborski Posted: January 21, 2007 at 02:43 PM (#2283833)
I fail to see how a park could actually be a determining factor.

Vision. Configuration of field, lighting, and color scheme all have a very large effect on what the hitter sees.

Has this been regressed against other more likely causes?

From what I've seen, it's real, though it's a smaller effect than on other types of events.
   37. Chris C. Posted: February 05, 2007 at 05:30 PM (#2292169)
Weather factors are also important - parks vary in typical altitude, temperature and humidity. Those variables can affect pitch movement, which subsequently affects strikeout and walk rates.
   38. puck Posted: July 18, 2009 at 10:02 PM (#3258607)
I guess the 2003-2005 park factors for Corpus Christi and the pf's above for Round Rock help explain a little about Jason Hirsh.
Page 1 of 1 pages

You must be Registered and Logged In to post comments.

 

 

<< Back to main

Support BBTF

donate

Thanks to
The Piehole of David Wells, Red Sox Colostomy Bag
for his generous support.

Bookmarks

You must be logged in to view your Bookmarks.

Syndicate

Buy MLB playoff tickets, plus 2011 World Series, 2011 ALCS tickets and NLCS game tickets. We also have Texas Rangers playoff schedule, tickets to Red Sox games and Yankees game tickets. Plus, buy Phillies baseball tickets, Tigers playoff tickets and the biggies like ALDS baseball tickets and 2011 NLDS tickets.

Demarini, Easton and TPX Baseball Bats

 

 

 

AllianceTickets.com has cheap MLB Tickets. Get all your Colorado Rockies Tickets, Seattle Mariners Tickets, San Francisco Giants Tickets and all your favorite baseball tickets here. We also carry cheap Denver Broncos Tickets, Seattle Seahawks Tickets and Denver Nuggets Tickets.

Page rendered in 0.7332 seconds
38 querie(s) executed