Social Networking: Publications and Datasets
current

Research on Facebook (and other social networks)

Welcome. My group has been studying online social networks with a primary focus on Facebook.  Facebook is currently the largest online social network, with more than 200 million active users worldwide (April 2009).  It is also an interesting and vibrant application platform for everything from shared Netflix reviews to Hugg and social marketplaces.

We have also worked on a number of other social networking topics, including a measurement of the socially-enhanced Overstock.com auction site, and a system for using social networks to protect anonymous communication users from passive traffic analysis attacks.

Related Publications:

Datasets:

Following our EuroSys Facebook measurement study, we are making some datasets of social graphs and interaction graphs available.

These graphs only contain simple edges connecting anonymized nodeIDs. The social graph file is simply a list of all edges in the graph, each bidirectional edge represented by a two-tuple of anonymized nodeIDs. Our user connectivity graphs reflect measurements performed in early 2008, and are not reflective of current Facebook topologies.

For the anonymized interaction graphs, we filter interactions based on their relative age to the time of the crawl (April 2008). Each edge in the interaction graph is listed in the file as a two-tuple of anonymized nodeIDs. The interaction graph is an undirected graph, so an edge from A to B represents a bidirectional edge connecting them. We include multiple interactions within the same period as duplicate edges across the same endpoints to account for user pairs that interact more than once during the time period. This frequency can be used to assign "weights" to edges on the interaction graph. If you want an undirected, unweighted interaction graph, then remove those duplicate edges.

Note: If you would like access to this data, please send email to ravenben at cs dot ucsb dot edu. When you get access to the data files, please do not distribute them beyond your immediate research group. Thank you.


Locations of visitors to this page