Blog | Uncharted®

Tweets about Donald Trump during February 2016

Everyone's tweeting Trump leading up to Super Tuesday

Richard Brath
March 1, 2016

In our ongoing analysis of tweets about Donald Trump we pulled together 27 million tweets in the month of February from 3.7 million unique users. It seems like everyone is joining the conversation in different ways.

Two Days of Trump in Iowa

2 million tweets, 700 thousand voices

Rob Harper
February 5, 2016

Last summer we began tracking tweets that mention Donald Trump in preparation for our talk at Strata NY. At the time it looked like things were only going to get more interesting so we kept watching and, as of Feb 1, we’d processed over 59 million tweets. With the Iowa caucus last week it seemed like a good time to revisit some of our earlier analysis.

Connections between tor relay nodes in North America and Europe

TorFlow

Data Flow in the Tor Network

Chris Dickson and Kevin Birk
January 17, 2016

The Tor project is an open network for anonymous communication over the internet. Tor routes users’ internet traffic through a series of volunteer-run relay nodes to conceal its origin and destination from potential surveillance or censorship. While Tor is built for anonymity, the structure of the network and locations of many of the relay nodes is open.

T-Drive trajectory dataset of 15 million data points visualzed over Beijing

Introducing Uncharted Salt

Open source, multi-scale big data visualization

Sean McIntyre
December 22, 2015

One of the questions we’ve spent a lot of time pondering over the years is a deceptively simple one - How do we visualize billions of data points? To understand why this is a difficult question to answer, let’s look at two popular approaches.

Continuous Integration with Apache Spark

Sean McIntyre
November 6, 2015

We’ve been writing libraries for Spark at Uncharted for several years, and continuous integration involving the Spark runtime has always been a difficult thing to accomplish. Common approaches, such as creating a Spark context within a standard Scala runtime, can fail to accurately emulate nuances of the distributed Spark environment. I’d like to share a solution we’ve developed for creating a native Spark test environment within TravisCI.