Purchase Our Products
You will find below our current three dataset products. All datasets are based
on source data* that indicated the regular season game-day starting player roster for each Major League Baseball game
from 1914 through 2014. Each of our dataset products is a ".csv" formatted datafile containing 2,923,407 records sorted first by PlayerID, then by GameDate, and in the format:
(PlayerID, GameDate, RunningTotalOfOtherStartingPlayers)
PlayerID is the unique identifier of every MLB player ever, as created and defined by
Retrosheets. These PlayerID values can be directly associated with other dataset PlayerIDs such as Lahman's PlayerID if you wish to do deeper analysis across other MLB
MLB Dataset #1 contains, in the third field, the running total of UNIQUE other MLB starting players a player has PLAYED WITH up through that specific GameDate.
MLB Dataset #2 is identical in format and content to MLB Dataset #1, except the third field of Dataset #2 contains a running total of UNIQUE other MLB starting players a player has PLAYED AGAINST up through that specific GameDate.
We provide both MLB Dataset #1 and MLB Dataset #2 because one can learn tactics, techniques, and procedures from both those one performs a task WITH and AGAINST.
MLB Dataset #3 is a discounted price on the combination of MLB Dataset #1 and MLB Dataset #2 but also includes two additional player connectivity datasets. That is, for $10.00 less than the separate prices of MLB Dataset #1 and MLB Dataset #2 you will receive BOTH of those datasets PLUS two additional datasets in the same format as MLB Dataset #1 and MLB Dataset #2, but containing running totals of ALL starting players a player has played WITH up through that GameDate AND a dataset in the same format as MLB Dataset #1 and MLB Dataset #2 containing a running total of ALL starting players a player has played AGAINST up through that GameDate.
Individual licensing information: by purchasing any
of our datafile products, you agree to not post them online in any form or give or share them with anyone or any organization. You may make one copy for personal back-up purposes.
Corporate licensing information: please contact us for corporate licensing restrictions that allow for multi-seat use of our datasets.
* The information used here was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at "www.retrosheet.org".
Copyright 2015 Connectivity Analysis LLC (CA)