Baseball for the Thinking Fan

Login | Register | Feedback

btf_logo
You are here > Home > Primate Studies > Discussion
Primate Studies
— Where BTF's Members Investigate the Grand Old Game

Reader Comments and Retorts

Go to end of page

Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.

Page 1 of 1 pages
   1. Robinson Cano Plate Like Home Posted: February 29, 2004 at 03:10 AM (#614737)
Fantastic. Do you have a breakdown for just the knuckleballers (sample size notwithstanding)?
   2. studes Posted: February 29, 2004 at 03:10 AM (#614741)
Awesome, MGL. This is the best work yet on DIPS. I love how the research keeps moving forward. Great job.

Ted, I think this is a big next step over Tippett's work.

I may come back with questions after I study it a bit more.
   3. John Posted: February 29, 2004 at 03:10 AM (#614742)
Fantastic. MGL definitely has the early lead in the race for the 2004 Ocsar...er, Primey...nominations.
   4. Human Papelbon Virus Posted: February 29, 2004 at 03:10 AM (#614743)
D from D raises a good point. More info is needed about the two groups before we can say the only difference between the two are y-t-y team
   5. Mike Emeigh Posted: February 29, 2004 at 03:10 AM (#614745)
Ted, I think this is a big next step over Tippett's work.

You still have the problem of selective sampling and correlation across a narrow range of performance variation. Pitchers who pitch enough to get 300 BIP in successive seasons may very well be the ones who have the most control over the results from balls in play, and pitchers who have the least control over the results probably won't hang around enough to get to 300 BIP in successive seasons. (300 BIP is usually around 70-80 IP).

The safest conclusion to be drawn is not that pitchers have *no* effect over the results from BIP - but that among the set of major league pitchers the differences are small enough so that we can treat the pitcher's impact as constant.

-- MWE
   6. studes Posted: February 29, 2004 at 03:10 AM (#614746)
A couple of reactions:

Mike, I'm not sure what you mean. I think MGL is uncovering the correlations that do exist between pitchers. He's finding the real trends under the noise, despite the sample size issues.

MGL, the more I think about it, the more I'm disturbed by the correlations in your last table. Don't large correlations like that undermine the validity of the data?
   7. David Concepcion de la Desviacion Estandar (Dan R) Posted: March 01, 2004 at 03:10 AM (#614749)
A good portion of Zito's consistently low BABIPs allowed can be traced to Oakland's superior defense and his pitchers' park, no?
   8. MGL Posted: March 01, 2004 at 03:10 AM (#614751)
Thanks for the great feedback so far. I'm off to Florida (Spring Training) for a few days, so I'll be out of touch until I get back. AED, the people who score these games tend to call flares, duck snorts, Texas Leaguers, etc., "pop flies to the OF," so yes, they are not caught as often as fly balls. It is just semantics...
   9. ChuckO Posted: March 01, 2004 at 03:10 AM (#614752)
I live in Atlanta and watched Greg Maddux pitch for quite a while. Maddux always said that he tries to get the batter to hit the ball on the ground back through the middle, between the shortstop and the second baseman. That way he figured he or the shortstop or the second baseman would get to a lot of them. It would be interesting to compare his success in this respect with similar pitchers, i.e., groundball control pitchers, especially mediocre pitchers. Is data available without charge to make such an analysis?

Anyway, the conclusions in this article certainly make intuitive sense. As many pitchers have pointed out, pitching is all about disrupting the batter's timing. If you're successful in doing that, it would seem that you'd be more likely to get weakly hit balls, and fewer homeruns as well.
   10. Chris Dial Posted: March 01, 2004 at 03:10 AM (#614754)
Problem with assigning scorer bias is that groundballs also jump. Regardless of how well scorers (and each data point MGL has is scored by more than one person) assess FB/LDs, there's almost no way to screw up a GB.

In fact, good pitchers probably tend to give up fewer and softer line drives and easier pop flies than do poorer pitchers.

That sounds really familiar...pretty much my adjustment for pitching staff quality on ZR about 6 years ago - and one of the big reasons I don't subscribe to "Andruw Jones is a god."

It's some nice work, MGL.
   11. bunyon Posted: March 01, 2004 at 03:10 AM (#614755)
I'm probably missing something, too. Why is it there would be little correlation, y-t-y, if everyone had the same skill.

Let's say I'm a pitcher with a true skill for $H of .333. Let's further say that that is a true skill and it is very repeatable. Why would it matter if other pitchers also had a true, repeatable skill for $H of .333? Wouldn't the correlation then be very strong for everyone?

   12. studes Posted: March 01, 2004 at 03:10 AM (#614756)
Well, good grief, guys. There's no reason to act like snobs. You want every thought and article to be completely original? Great ideas and analyses bear repeating, over and over. That's how they become accepted and used.

For instance, did you see that UZR was quoted in the New York Times today in a Mike Cameron article? No reference to primer, alas.

http://www.nytimes.com/2004/03/01/sports/baseball/01METS.html
   13. Dan 'The Boy' Werr Posted: March 01, 2004 at 03:10 AM (#614757)
I guess I'm missing something big. If the pitchers have more control over various components of BABIP, where does it go? I.e., what causes this control to drop out of the overall y-t-y correlation?

Ben, I'm not sure what you mean. The pitchers have better correlations on some components than overall, but they also have worse correlations on some components than overall. For the pitchers who switched teams, there are even two slightly negative correlations. In both sets, ground balls, the second most common type of BIP, have lower correlations than the overall correlations.

I assume the correlations balance out at around the overall BIP correlations. Right?

MGL, great work.
   14. Dan 'The Boy' Werr Posted: March 01, 2004 at 03:10 AM (#614759)
Let's say I'm a pitcher with a true skill for $H of .333. Let's further say that that is a true skill and it is very repeatable. Why would it matter if other pitchers also had a true, repeatable skill for $H of .333? Wouldn't the correlation then be very strong for everyone?

Bunyon,

Over a sample as short as a year, even if every pitcher has a true skill of .333, there's going to be some variation around that, and it will occur more or less randomly for everybody. So you might see results like this:
              Year 1  Year 2  Year 3
PITCHER A      .338    .333    .341
PITCHER B      .329    .334    .337
PITCHER C      .327    .330    .324
PITCHER D      .330    .339    .325

The overall correlation from year n to year n+1 in that example is -.115. With a real sample and with more pitchers and years, you'd get a better result.
   15. Chris Dial Posted: March 01, 2004 at 03:10 AM (#614760)
Re: pop flies
Pop flies are defined as "fly balls that don't travel 220 feet" in STATS scoring speak.

Duck snorts are problematic, as they will be interchangeably defined as popf flies and line drives. There generally aren't very many, as a percentage of any one pitcher's BIP.

Studes, do you think the article you cite *should* mention Primer?
   16. studes Posted: March 01, 2004 at 03:10 AM (#614761)
David, sorry if I misinterpreted your comment.

Chris, I don't think the Times should reference primer. After all, MGL has basically made his work public, with no copyright protection that I've seen. But it would be nice, particularly for readers who might want to learn more.
   17. Chris Dial Posted: March 01, 2004 at 03:10 AM (#614763)
Studes,
*everything* written down is copyrighted. It's implied (or actually, it's law). Everything that you write is copyrighted by you nowadays.

The answer is, yes, the writer should reference where he saw it. He doesn't have to say "MGL", he can link or saw "at Baseball Primer" or whatever - it helps his reader for starters.
   18. Mike Emeigh Posted: March 01, 2004 at 03:10 AM (#614765)
Mike, I'm not sure what you mean. I think MGL is uncovering the correlations that do exist between pitchers. He's finding the real trends under the noise, despite the sample size issues.

Suppose that pitchers really do have an ability to control the percentage of hits on balls in play - IOW, there is a level of "true talent" $H. Assume for the moment that, in order to pitch at all in the majors, you have to have a "true talent" $H of no more than .340. Over 300 BIP (which as I indicated is usually between 70-80 IP), this would mean that you'd allow 102 hits in 70-80 IP, exclusive of HR - somewhere around 11-13 non-HR hits per nine innings, which is about as many as a team can likely tolerate. To pitch well enough to make it through two consecutive seasons of 300+ BIP, you'd have to do better than that, in most if not all cases - a .300 level would be 90 non-HR hits over those 70-80 innings, which your team might be able to live with. The best pitchers usually average somewhere around .270 $H.

Now, if you select pitchers with 300+ BIP in back-to-back seasons, what you are effectively doing is selecting pitchers from the range .270-.300, rather than the .270-.340 range that is typical of all major league pitchers - and whether you realize it or not, you're removing a large group of pitchers from the study who operate at the margins of major league performance. Those pitchers might very well exhibit different effects from the ones that you are studying, because they aren't as good as the ones that you are studying, and it's conceivable that the reason they aren't as good is that they don't control H/BIP as well, for reasons which won't show up in a study limited to just good pitchers.

I know MGL understands all of this, so this comment wasn't really directed at him. The point I wanted to make is that correlation analysis needs to be taken with a grain of salt when applied across a restricted range of performance, especially when attempting to measure an aspect of skill that could - if it existed - directly impact that performance.

-- MWE
   19. studes Posted: March 01, 2004 at 03:10 AM (#614767)
Mike, I understand that. It's been discussed thoroughly.

But what about MGL's conclusions? Specifically:

To summarize the implications of the above chart, even though the y-t-y $H correlation of a 500 BIP pitcher is very small, a pitcher may have a fair amount of control over certain components of those BIP.

Seems to me that MGL has uncovered some interesting findings even within his "restricted range of performance". Do you see it differently?

Chris, you were testing me, right? Thanks for the info. I just might send the guy an e-mail.
   20. tangotiger Posted: March 01, 2004 at 03:10 AM (#614769)
If we remember the results of the Allen/Hsu "Solving DIPS" paper, which was based on pitchers with at least 200 BIP (I think), the implication there is that the true talent level of pitching preventing hits on BIP is 1 SD = .009 hits / BIP. That works out to +/- 10 runs for 95% of the pitchers.

The question is: how can we detect this, and how much of this can we detect.

Alot of this we just won't be able to detect.

It's apparent that it's alot easier to detect this based on the rate of Line Drives given up.

   21. Mike Emeigh Posted: March 01, 2004 at 03:10 AM (#614770)
Seems to me that MGL has uncovered some interesting findings even within his "restricted range of performance". Do you see it differently?

Is the correlation effect on hits on OF LDs for good pitchers applicable across the range of *all* pitchers? A correlation in a subgroup is not necessarily indicative of a correlation across the entire group, especially if the distribution of the performance being measured is heteroskedastic; there may easily be a subgroup in which a significant correlation is detected that can't be detected in the population at large, because the variance of the statistic is conditioned on the performance level.

That doesn't mean that the detected correlation is not still valuable; as Chris Dial suggests, the ability to generate *catchable* line drives might be a discriminant between good pitchers and bad pitchers. But there's a lot more investigation to do across the entire range of pitching performance before one can be assured that's the correct conclusion to draw.

-- MWE
   22. Human Papelbon Virus Posted: March 01, 2004 at 03:10 AM (#614771)
What if you took the methods behind UZR and applied it to pitchers?

For example: say Greg Maddux allowed 100 ground balls into zone 5 last yr, with 30 going for hits and 70 being turned into outs. Suppose the average GB-out rate for this zone was 60%. Maddux would have been expected to give up 40 hits on ground balls to zone 5. Then from ZR, we can see that between Castilla and Furcal, they saved Maddux 8 hits on ground balls to zone 5. Subtracting out the effects of the defense, we might see that Maddux allowed 2 less hits than expected on ground balls to zone 5. Do this over a number of years, and maybe we can observe a trend. Maybe maddux is able to prevent an average of 5 hits per year on ground balls to zone 5, independent of defense. If he can, then maybe we have quantified his skill at getting right handed batters to pull the ball weakly to the third baseman.

I'm just throwing this out there off the top of my head. Does this seem justifiable?
   23. Michael Humphreys Posted: March 01, 2004 at 03:10 AM (#614773)
MGL,

Thanks for the great article. Although I'm undoubtedly biased, it seems to support DRA:

"The number of IF pop flies and to some extent OF pop flies, as a percentage of all non-GB BIP, are somewhat a unique function of the pitcher as well. In other words, good pitchers may tend to get more pop files than bad pitchers, as a percentage of their total non-ground ball balls in play."

As you may recall, DRA allocates estimated infield pop-outs to fielders, and I speculated in the article that pop-ups might be the key variable (when zone data is unavailable) for determining a pitcher's effect on BABIP.

If I'm understanding your article correctly:

1) Infield pop-ups have the highest out-conversion rate of any category of BIP: 96%. As mentioned in the DRA article, it a pitcher generates a BIP that is *never* fielded (a HR), we charge the pitcher. If a pitcher generates a BIP that is almost *always* fielded (in infield pop-up), shouldn't the pitcher get credit?

2) With the exception of the overall tendency to give up either ground balls or outfield fly balls, the tendency to give up infield pop-ups is more persistent than the tendency to give up other forms of BIP. So generating infield pop-ups seems to be, to some degree at least, a skill. Query whether, given the relative rarity of infield pop-ups (at least compared with the category of all ground balls and all outfield fly balls), using larger samples of data (two-year v. two-year?) would show a higher "r" for infield pop-up generation.

The most surprising result seems to be that the rate of giving up line-drives is not that persistent (.14 "r", at least for pitchers who change teams, which I believe is the best data set to use), but *given* the allowance a line-drive, differing out-conversion rates *are* persistent. Perhaps AED's is right:

"If there were systematic differences between what is classified as a line drive, you would expect to see significantly lower r's for the players who changed teams. So it seems more likely that you're getting hammered by sample size or selective sampling . . . ."

On the other hand, it might be interesting to determine whether there is a *cross*-correlation between generating infield pop-ups and allowing line drives that are less likely to be hits. Throughout the development of DRA, I kept finding that the run-weight for infield fly outs was a little higher than it "should" be (compared to other BIP outs), but that was probably because infield fly outs correlate with *outfield* fly outs (which have higher out conversion than ground balls) and may correlate with "easier to field" line drives.

Thanks again for a very helpful study.
   24. bunyon Posted: March 01, 2004 at 03:10 AM (#614774)
Thanks, Dan. That seems clear no that you've said it.
   25. Michael Humphreys Posted: March 01, 2004 at 03:10 AM (#614775)
Sorry, meant to say:

"As you may recall, DRA allocates estimated infield pop-outs to *pitchers*."
   26. Human Papelbon Virus Posted: March 02, 2004 at 03:10 AM (#614782)
If that's right then the two intervals don't look that different to me, there's lots of overlap. So I would argue that taking park and defense out of the equation just gets us fewer data points.

But I'm a complete novice when it comes to using these types of statistics, so sorry if this point is completely wrong.


No such thing as standard deviation or confidence intervals for correlations.
   27. Too Much Coffee Man Posted: March 02, 2004 at 03:10 AM (#614784)
Sure, you can calculate a confidence interval for a correlation. The r must be converted to a Fisher's Z', with the standard error of Z' equal to 1/(square-root of N-3).
   28. Human Papelbon Virus Posted: March 02, 2004 at 03:11 AM (#614785)
No, this is not regression. r is just the correlation. we don't have the regression equation or the SEE. only then could we compute confidence intervals.
   29. Too Much Coffee Man Posted: March 02, 2004 at 03:11 AM (#614786)
I have a couple of questions, but I'll start with the less important ones. First, an observation:
In thinking about this:
"Even if something is indeed a skill, if there is little or no spread in true talent with respect to that skill, in the population from which we are sampling (major leagues pitchers), then the y-t-y correlation for any sample size will be close to zero."
This is a pretty important point. My first reaction was that you'd have to argue that the variance of this skill is very close to 0 for this to matter. Then, I realized that this is actualy very close to the current thinking about major league pitching (that most have no influence on batting average on balls in play). So, if you believe this, then the expected y-t-y correlation is zero. If you don't believe it, the expected correlation would be some value other than zero.

There's a few points in here I didn't understand. One is this:
"In fact, the size of the correlation coefficient is a direct function of the spread of talent in the population and the size of the samples in each data element. If there is any skill whatsoever related to the performance measure we are sampling and there is any spread of true talent in the population with regard to that skill, then as the sample size get larger, the correlation always approaches 1. The converse of that is also true. Regardless of how much skill and what the spread of talent is, as the sample size gets smaller, the correlation always approaches zero."
Why should the correlations be expected to approach 0 or 1.00 as the sample sizes get larger? Shouldn't the approach the "true" population value? This statement seems to assume that there are either perfectly linear or oblique relationships and nothing else.
   30. Too Much Coffee Man Posted: March 02, 2004 at 03:11 AM (#614787)
I don't want to get into a debate about this because it's off-track.
But correlation and (simple) regression are the same thing. In simple linear regression, the beta weight for standardized scores is also the correlation.
See the home page for a site that calculates confidence intervals.
   31. Human Papelbon Virus Posted: March 02, 2004 at 03:11 AM (#614791)
Thanks for correcting me TMCM. Sorry for the misinformation.
   32. Michael Humphreys Posted: March 02, 2004 at 03:11 AM (#614792)
Tom N,

"Even if the pitcher has a specific skill at generating various types of near certain outs that doesn't mean we necessarily would credit the pitcher. Imagine if we had even more granular data to work with that included not just the zone, but the precise speed of all BIP. Imagine further that Carlos Zambrano has a skill at inducing slow hit groundballs to the left side of the infield. His sinker is fast enough so that those pitchs usually aren't pulled directly down the line. So if 99% balls hit in the SS's zone hit between 55-80 MPH are converted into outs should the pitcher get credit?"

That is precisely what would happen under a UZR system that also provided a P[itcher] Zone Rating. David Pinto's Probabilistic Model for Range does something similar.

"If inducing pop flies of either variety is a skill, we should take that ability into account when evaluating pitchers, much the same way we take GB/FB ratio into account. That doesn't mean pitchers are contributing to pop flies being converted into outs. The H% r itself seems to suggest that it's the infielders doing the work. Just because OF fly balls are almost always converted into outs (86.5%) doesn't mean that outfielders shouldn't get credit for their defensive contribution."

Simplifying somewhat, under a PZR/UZR system, the pitcher would get almost all of the credit for the out; the fielder only a tiny amount. DRA is a non-zone system that allocates responsibility for BIP to fielders, except that it allocates responsibility for infield fly outs to pitchers. Doing so results in an allocation of pitcher and fielder credit for BIP that comes close to matching the ratio found under the Allen/Hsu "Solving DIPS" paper, as well as fielder ratings that match up well with UZR.
   33. Human Papelbon Virus Posted: March 02, 2004 at 03:11 AM (#614794)
(.050, .222) for the full data set, and (-.154, .224) for the pitchers that moved.

So I think taking park and defense out of the equation doesn't give you anything new. It just gives you wider confidence intervals.


Not to dwell on this point, but I was under the impression that the standard deviation reported was for the pitcher's $H, not for the correlation coefficient. If this is true, then this SD can't be used to create confidence intervals for the correlation coefficient, can it?
   34. Too Much Coffee Man Posted: March 02, 2004 at 03:11 AM (#614795)
This is, as I like to say, at the limits of my knowledge.
The sampling distribution of r is not normal, so you could not use the standard deviation to build a confidence interval.
To build the confidence intervals, you convert r to Z' and add/subtract a weight based on alpha times a multiplier (normally a standard error). I THINK it must be the case that Z' has a distribution that is determined by the sample size, hence the need to only know N to calculate the confidence intervals. But, I could be wrong.
   35. Jim (jimmuscomp) Posted: March 02, 2004 at 03:11 AM (#614798)
"I regressed all pitchers who had at least 300 BIP in back-to-back years from 1992 to 2003. There were 312 data pairs. Each data pair was independent (I regressed 1992 on 1993, 1994 on 1995, etc.)"


Can anyone explain why you wouldn't use overlapping data pairs (i.e. 1992 on 1993, 1993 on 1994, etc.)? I think this is what Tippett did. You would nearly double your sample size, and perhaps pick up on some weird every-other-year patterns (e.g. Saberhagen) that you would otherwise miss. Would this introduce some other selection bias?
   36. MGL Posted: March 04, 2004 at 03:11 AM (#614842)
Dick,

I would be happy to send you the data when I get back home. Anything I can do to help a fellow researcher...
   37. Mike Emeigh Posted: March 05, 2004 at 03:12 AM (#614856)
So cell C3 says that a pitcher has some control over whether a line drive becomes a hit or not, but then B3 says that he has little control over whether his BIPs become OF line drives in the first place. How can you control the second without first controlling the first?

Cell C3 indicates that there is a good y-t-y correlation for these pitchers in the rate at which outfield line drives become hits; it says very little about reasons why that might be true. It can be true, for example, if the rate at which outfield line drives are caught for outs varies little from team to team or pitcher to pitcher (IOW, if a high percentage of outfield line drives are inherently uncatchable). We need to keep open the possibility that there may be effects other than that of the pitcher which could lead to the correlations that we see. That's why range of performance - or, alternately, performance variance, which essentially tells you the same thing - across the data set is so important to know. If the entire range of performance within the data set varies from, say 26% of OF LD caught for outs to 28% of OF LD caught for outs, I'd have no problem concluding that pitchers have little influence over whether or not OF LD became outs, even if there were a strong y-t-y correlation within that group of selected pitchers.

-- MWE
   38. Noffs Posted: March 06, 2004 at 03:12 AM (#614860)
Not to make you do this again, MGL, but what happens if you change "Hit percentage" to "Percent reaching base"? I ask because the error rates associated with groundballs could make a grounder a lot more valuable than it appears.
   39. MGL Posted: March 07, 2004 at 03:12 AM (#614878)
Hoiles,

If we only look at pitchers who play for the same team in the regression "pairs," even if pitchers had no control over any of their BIP components, clearly we might (probably will) see some y-t-y correlations, due to good or bad defense, park effects, etc. Those are the correlations I am trying to "factor out."

Now, if we only look at pitchers who changed teams, we have no "systematic bias" because the assumption is that for a large sample, the average park effects and defense of the new team is league average. Therefore if pitchers had no control over any of the BIP components, the correlations for pitchers who changed teams would have to be zero, as there would be nothing "connecting" their BIP rates from one year to the next, which is essentially the definition of correlation (a "connection," loosely speaking).

If pitchers do have some control over some of their BIP components, then that control, as measured by the correlation coefficient, should survive the "changing of teams," since the variation in y-t-y rates as a result of the new defense and park is random (no bias).

As far as how introducing another variable (even if it isn't biased) effects the magnitude of the correlation coefficients, I don't know. I would have to defer to the statisticians for that answer. I could simulate the effect and come up with an answer I suppose. IOW, if the true y-t-y correlation for, say OF line drive hit percentage, were .200, and we then introduced some noise (changing teams and therefore changing defense and parks, which presumably affects the true OF line drive hit percentage independent of the pitcher, would the correlation change? I don't think so, but I am not sure.

In any case, changing of teams cannot create a correlation where none existed or even increase a correlation where one did exist, since where any given pitcher goes is presumably random, at least as far as his first year BIP rates. If there is no "connection" between his BIP rates in his first year, and where he goes, then there is by definition no bias and there can be no increase in the correlations.

As far as the ROE's on ground balls, there is probably not an extra bias associated with ROE's, so adding it into the mix shouldn't change anything, although since many more ROE's are on GB's to the left side of the infield, we might see some "extra" correlation (higher coefficient) based on a pitcher' handedness. Since we aren't really interested in that kind of "control," it is probably a good thing NOT to include ROE's in the ground balls...
   40. MGL Posted: March 08, 2004 at 03:12 AM (#614902)
Hoiles,

With all respect, I think that you are completely wrong, but not being a statistician, I would have to defer to someone like AED if he is still lurking.

If I get a chance, I'll do some "correlation sims" which should clear up the issue...
   41. Bobby Bonilla's Annuity (Matt) Posted: March 08, 2004 at 03:13 AM (#614909)
Each data pair was independent (I regressed 1992 on 1993, 1994 on 1995, etc.)

These data pairs are not independent, since it's still the same person pitching in 1992 and 1994. Let's say I'm taking your blood pressure once a year -- you would not say that your blood pressure in 1994 is independent of 1992. There are ways to adjust for this data structure, where you have repeated measurements on the same person, in linear regression. I'd have to pull out my school notes to give you the details, but basically you come up with some assumed correlation matrix that describes where correlations are in the data, then the regression procedure uses that information in its calculations. Probably only affects the standard errors of your calculations, though.
   42. MGL Posted: March 08, 2004 at 03:13 AM (#614921)
Now, how do you know that, say, I'm not a professional statistician myself, with a post-graduate degree in that field?

Come on, that's a silly question.

Could you point out what exactly was wrong in my arguments (other than a couple of typos)? I'll give one more analogy, and I would appreciate it if you would think about it instead of blindly saying it's wrong.

Honestly, I haven't had the time to go through your premise. On the other hand, I did take lots of time to think through my thesis when I wrote the article (several months ago).


As I said, I think you are wrong, but I'm not sure. I did say that I think you are wrong, not that I was sure you are wrong, didn't I? There is a big difference (from my perspective).

I am no statistician either but I do know a fair amount about regressions and correlations, and have done a fair amount of work in baseball analysis using those "tools."

However, I'm not sure I am capable of addressing your specific concerns, so I apologuice for just saying that "I think you are wrong," as that is not particualry helpful. As I said, I can probably write a simple sim or two that would tell us how changing parks (park effects and defense) affects a correlation, whether or not there is one in the first place.

I'll try and do that tonight....
   43. MGL Posted: March 09, 2004 at 03:13 AM (#614925)
Hoiles, you may be right, but I just don't have the time to pursue this right now....
Page 1 of 1 pages

You must be Registered and Logged In to post comments.

 

 

<< Back to main

Support BBTF

donate

Thanks to
Sheer Tim Foli
for his generous support.

Bookmarks

You must be logged in to view your Bookmarks.

Syndicate

Buy MLB playoff tickets, plus 2011 World Series, 2011 ALCS tickets and NLCS game tickets. We also have Texas Rangers playoff schedule, tickets to Red Sox games and Yankees game tickets. Plus, buy Phillies baseball tickets, Tigers playoff tickets and the biggies like ALDS baseball tickets and 2011 NLDS tickets.

Demarini, Easton and TPX Baseball Bats

 

 

 

AllianceTickets.com has cheap MLB Tickets. Get all your Colorado Rockies Tickets, Seattle Mariners Tickets, San Francisco Giants Tickets and all your favorite baseball tickets here. We also carry cheap Denver Broncos Tickets, Seattle Seahawks Tickets and Denver Nuggets Tickets.

Page rendered in 0.9054 seconds
38 querie(s) executed