Our data is entirely sourced from WikiLeaks, and was last updated on Fri Nov 4, 2016 (Podesta 30). In particular, we used the Hillary Clinton Email Archive, The Podesta Emails, and the DNC Email Archive.
For all three networks, nodes (a.k.a. circles) are people and links (a.k.a. lines) between nodes indicate that two people were on a common e-mail thread. The size of a node shows the number of e-mail threads in which a person participated. The size of a link denotes the number of shared e-mail threads between two people. Each node is algorithmically assigned to a community (a group with a disproportionately large number of links among them), which is shown by node color.
At the MIT Media Lab Macro Connections group, we build tools to transform data into stories. ClintonCircle was not created to visualize these emails, but it is a fork of a tool called Immersion that we launched in 2013 and that helps people visualize and understand the networks they weave through email interactions.
Some of the other tools we have built include the Observatory of Economic Complexity, which is the world’s leading site visualizing international trade data, Pantheon, a project exploring our species collective memory, and DataUSA, the most comprehensive visualization of US Public Data. We build all of these tools to improve people’s ability to understand and explore data, and to democratize the benefits of data exploration and visualization.
Kevin Hu is a doctoral candidate working on tools for accessible data visualization and analysis. Jingxian Zhang is a masters student creating e-mail network visualization tools for organizations. Professor César Hidalgo, PhD, is the Principal Investigator and Director of the Macro Connections group at the MIT Media Lab.