Statistical Analysis of Network Data | |||||||||||||||||||
Methods and Models | |||||||||||||||||||
|
Please note that for the majority of these datasets it is only due to the generosity of various of my colleagues that they are being made available for general use. In using any of these datasets, please acknowledge the sources appropriately.
Aggregate flow volume data based on measurements of origin-destination flows on the Abilene network, taken continuously over a seven-day period, starting December 22, 2003. Network of citations among blogs related to AIDS, patients, and their support networks, collected by Gopal, over a three-day period in August 2005. ECoG time series data corresponding to two periods (so-called 'pre-ictal' and 'ictal') of a seizure in an epilepsy patient, for eight separate seizures. Measurements are taken at each of 76 electrodes in the brain of the patient, allowing for the construction of association-based networks in studying functional connectivity. Zachary's well-known 'Karate Club' social network. Lazega's data on the collaborative working relationships among lawyers in a New England law firm. I am unable to make these data available, due to privacy constraints associated with the original study. A subset of the microarray data for E. coli available from the Many Microbe Microarrays Database, as well as a subset of the known regulatory interactions for E. coli listed in the RegulonDB database. Packet delay data from Coates et al. resulting from an Internet packet probing experiment designed for conducting network topology inference. A network of interactions among 5151 proteins in S. cerevisiae (i.e., baker's yeast), culled from the January 2007 BioGRID database. A sub-network of the above protein interaction network, induced by those proteins annotated with the function `Cellular Communication' in the January 2007 version of the Gene Ontology (GO) database, as well as labels indicating which of those proteins are further annotated with `Intracellular Signaling Cascade' (i.e., a more specific form of cellular communication). A network representation of a portion of the router-level Internet, based on topology discovery measurements collected between April 21 and May 8, 2003 by the skitter measurement system at CAIDA. These data were collected as part of a project at Sandia Labs and are proprietary and unavailable. Figures 3.5 and 3.6 corresponding to these data were furnished directly to me by Kevin Boyack. |
|
|||||||||||||||||