Baseball for the Thinking Fan

Login | Register | Feedback

btf_logo
You are here > Home > Baseball Newsstand > Baseball Primer Newsblog > Discussion
Baseball Primer Newsblog
— The Best News Links from the Baseball Newsstand

Saturday, November 17, 2012

TechCrunch: The Most Important Offseason Acquisition For The S.F. Giants Could Be Hadoop

Latest Hadoop yarn…

Baseball, more so than other sports, is known for its massive data collection, complex statistics and informed managerial decisions. So it should be no surprise that, just as corporate enterprises are going through a big data revolution, so will baseball. While the technology that enables big data is quite technical and designed to operate behind the scenes, the direct impact of big data on the average consumer will be quite visible over time. Hadoop, with its ability to manage massive data sets, is about to change the game of baseball.

...So why would a baseball organization need a Hadoop cluster? Because unstructured data may unlock insights that are not apparent from the structured event data that is available to every team. Baseball managers, like CEOs, believe that the past is a great predictor of the future. By having his data scientist run a Hadoop job before every game, Bruce Bochy can not only make an informed decision about where to locate a 3-1 Matt Cain pitch to Prince Fielder, but he can also predict how and where the ball might be hit, how much ground his infielders and outfielders can cover on such a hit, and thus determine where to shift his defense.
Taken one step further, it’s not hard to imagine a day where managers like Bochy have their locker room data scientist run real-time, in-game analytics using technologies like Cassandra, Hbase, Drill, and Impala.

Will Big Data Ruin Baseball?

This raises the question, will big data ruin baseball? Will tracking and analyzing this mountain of data take the enjoyment out of the game? I don’t think so. Our national pastime has survived the Black Sox scandal, the designated hitter, pull over uniforms, free agency, night games, multiple players’ strikes, the dead ball era, the live ball era, and of course steroids. Big Data is not nearly as threatening.

In fact, big data might be the great neutralizer between large market and small market teams. Teams with the most advanced predictive algorithms would have an advantage. Bay Area teams should have an even larger advantage since it is the epicenter of big data. If you are an avid Giants fan and a data scientist, your dream job may soon be available. But move quickly, because the team across the Bay may already have a head start – they do, after all, have a Hadoop-like elephant for a mascot.

Repoz Posted: November 17, 2012 at 06:17 AM | 1 comment(s) Login to Bookmark
  Tags: sabermetrics

Reader Comments and Retorts

Go to end of page

Statements posted here are those of our readers and do not represent the BaseballThinkFactory. Names are provided by the poster and are not verified. We ask that posters follow our submission policy. Please report any inappropriate comments.

   1. depletion Posted: November 17, 2012 at 09:53 AM (#4304875)
Barry is a Managing Director of Lightspeed and focuses primarily on information technology infrastructure, with a specific interest in cloud computing

What possible motivation would he have in writing this article?

You must be Registered and Logged In to post comments.

 

 

<< Back to main

News

All News | Prime News

Old-School Newsstand


BBTF Partner

Support BBTF

donate

Thanks to
tshipman
for his generous support.

Bookmarks

You must be logged in to view your Bookmarks.

Hot Topics

NewsblogOTP 22 May 2017: George W. Bush photobombs a sports reporter
(1698 - 11:54pm, May 28)
Last: Ray (RDP)

NewsblogMike Trout sprains thumb in Angels’ loss to Marlins
(9 - 11:52pm, May 28)
Last: The Yankee Clapper

NewsblogOT - March 2017 NBA thread
(4415 - 11:48pm, May 28)
Last: Oriole Tragic

NewsblogHomer Simpson inducted into Baseball Hall of Fame
(25 - 11:36pm, May 28)
Last: ERROR---Jolly Old St. Nick

NewsblogBrian Kenny on Twitter: "Regarding the Andrew McCutchen comeback: An absence of good news.
(21 - 10:50pm, May 28)
Last: Rennie's Tenet

NewsblogTerry Collins deserves more time with Mets, but will he get it? - NY Daily News
(7 - 10:26pm, May 28)
Last: Cargo Cultist

NewsblogFormer Senator, baseball Hall of Famer Jim Bunning has died
(69 - 9:57pm, May 28)
Last: GGC for Sale

NewsblogOT: March-April 2017 Soccer Thread
(557 - 9:50pm, May 28)
Last: Pirate Joe

NewsblogPhillies' Bowa, Hernandez form bond that crosses generational and cultural lines
(4 - 9:35pm, May 28)
Last: SoSHially Unacceptable

NewsblogYankees ticket sales plunge; New York has lost $166 million since 2009 | SI.com
(21 - 9:04pm, May 28)
Last: RMc's Aggravating as Hell, Arrogant, Disrespectful

NewsblogA lie told often enough becomes OMNICHATTER, for May 28, 2017
(54 - 8:49pm, May 28)
Last: AT-AT at bat@AT&T

NewsblogScott Boras says Jake Arrieta still elite pitcher even though velocity has dropped
(17 - 8:33pm, May 28)
Last: cardsfanboy

NewsblogA look at what all 30 teams can do with MLB's free agent mega-class of 2018-19
(66 - 4:59pm, May 28)
Last: cercopithecus aethiops

NewsblogWell said: Orioles are 'swinging at a lot of pitches we probably can’t do much with' - Baltimore Sun
(1 - 3:10pm, May 28)
Last: ERROR---Jolly Old St. Nick

NewsblogStephen Strasburg sets career high with 15 K's | MLB.com
(5 - 3:10pm, May 28)
Last: cercopithecus aethiops

Page rendered in 0.1142 seconds
47 querie(s) executed