Field

Big Data

2023

Why AI Can’t Save the World (Yet), and How Humans Can Help

Brath, R., E. Wyatt, and W. Brooke, AI for Good, January 2023

Analysts tasked with solving social problems like human trafficking rely on long-established processes and field experience. Broad artificial intelligence solutions to these problems have good intentions, but they often omit nuance and context that analysts bring. In this talk, we present Uncharted’s approach to designing “augmented intelligence” tools for global resilience that let both humans and machines do what they do best.

Watch Video

2021

Unmasking Human Trafficking Risk in Commercial Sex Supply Chains with Machine Learning

IBM Best Student Paper

Ramchandani, P., H. Bastani, E. Wyatt, Working Paper, INFORMS Annual Meeting 2021 (Service Science Section), June 2021

The covert nature of sex trafficking provides a significant barrier to generating large-scale, data-driven insights to inform law enforcement, policy and social work. We leverage massive deep web data (collected globally from leading commercial sex websites) in tandem with a novel machine learning framework to unmask suspicious recruitment-to-sales pathways, thereby providing the first global network view of trafficking risk in commercial sex supply chains.

View Paper

2017

Text Analytics and New Visualization Techniques

Brath, R. and S. Langevin, Strata, New York, September 2017

Text analytics have significantly advanced with techniques such as entity extraction and characterization, topic and opinion analysis, and sentiment and emotion extraction. But the visualization of text has advanced much more slowly. Recent visualization techniques for text, however, are providing new capabilities. In this talk we offer an overview of these new ways to organize massive volumes of text, characterize subjects, score synopses, and skim through a lot of documents. Together, these techniques can improve workflows for users focused on documents.

View Slides
Watch Video

2016

Graph Mapping: Multi-Scale Community Visualization of Massive Graph Data

Jonker, D., S. Langevin, D. Giesbrecht, M. Crouch, and N. Kronenfeld, Information Visualization Journal, August 2016

Graph visualizations increase the perception of entity relationships in a network. However, as graph size and density increases, readability rapidly diminishes. In this cover article for the July 2017 edition of the Information Visualization journal, we present an end-to-end, tile-based visual analytic approach called graph mapping that utilizes cluster computing to turn large-scale graph (node-link) data into interactive visualizations in modern web browsers.

View Paper

2015

Multi-Scale Community Visualization of Massive Graph Data

Langevin, S., D. Jonker, D. Giesbrecht and M. Crouch, IEEE EGAS Exploring Graphs at Scale 2015

Graph visualizations increase the perception of entity relationships in a network. However, as graph size and density increases, readability rapidly diminishes. In this paper, we present a tile-based visual analytic approach that leverages cluster computing to create large-scale, interactive graph visualizations in modern web browsers. Our approach is optimized for analyzing community structure and relationships.

View Paper
Global to Local Pattern of Life Analysis with Tile-Based Visual Analytics

Langevin, S., D. Jonker, K. Birk, C. Bethune and N. Kronenfeld, IEEE Vis 2015, Practitioner Session

In this paper, we present in-progress work on applications of tile-based visual analytics (TBVA) to population pattern of life analysis and geo-temporal event detection. TBVA uses multiresolution data tiling, analytics and layered high-fidelity visualization to enable interactive multi-scale analysis of billions of records in modern web browsers through techniques made familiar by online map services.

View Paper
View Poster
Value in the Details: Understanding Data Through Visual Exploration

Brath, R. and R. Harper, Strata, New York, October 2015

Analysts and data scientists work with large amounts of data, but the common approach, which dates back 20 years, is to roll up all the data into summary tables or charts, resulting in loss of detail. In contrast, direct visual exploratory analysis of massive amounts of raw data can yield insights that are otherwise overlooked. Here we highlight high-density visualizations that directly plot hundreds of millions of data points for applications such as market opportunity, periodicity analysis, and anomaly identification.

View Slides
Watch Video
Untangling Influence and Desire: Visual Analysis of Massive Graph Data

Jonker, D. and S. Langevin, Strata, London, May 2015

This session demonstrates using open source tools and techniques for visually exploring massive node-link graphs in a web browser by visualizing all the data. Seeing all the data reveals informative patterns and provides important context to understanding insights. Examples will highlight large-scale graph analysis of social networks, customer purchase history, and healthcare industry data.

View Slides
Influent
Watch Video

2014

Highly Scalable Tile-Based Visualization for Exploratory Data Analysis

Jonker, D. and R. Harper, Strata, New York, October 2014

The open source Aperture Tiles provides interactive data exploration with continuous zooming on large scale datasets. By leveraging big data software stacks Hadoop, Apache Spark and HBase for distributed computation and storage, the technique can scale to data sets with billions of data points or more. In this talk we review the Tile-based visual analytic methodology, challenges and highlight some example applications.

View Slides
Watch Video
Influent: Scalable Transaction Flow Analysis with Entity-Relationship Graphs

Jonker D., S. Langevin, D. Gauldie and W. Wright, Poster, IEEE EuroVis 2014

In this paper, we present in-progress work on “Influent,” a graph analysis tool that enables an intelligence analyst to visually and interactively “follow the money” or other transaction flow. Summary visualizations of transactional patterns and entity characteristics, a left-to-right semantic flow layout, interactive link expansion and hierarchical entity clustering enable Influent to operate effectively at scale with millions of entities and hundreds of millions of transactions, with larger data sets in progress.

View Paper

2013

Visualizing Big Graphs and Social Networks

Brath, R. and D. Jonker, Strata New York, October 2013

Big graphs are difficult to visualize because the scale of the data quickly results in complex network diagrams that can be difficult to decipher and have complicated interfaces with hundreds of options. Instead, we start by understanding what valuable nuggets we are trying to learn from this data, such as “Who’s connected to both me and my competition?” and “Did these people meet?” Starting from the question, we can then simplify the requirements and design effective visualizations that fit the problem. We will look at some mini-case studies and demos (e.g. Twitter correlation network, charity network).

View Slides
Tile Based Visual Analytics for Twitter Big Data Exploratory Analysis

Cheng, D., P. Schretlen, N. Kronenfeld, N. Bozowsky and W. Wright, IEEE Big Data Conference, October 2013

The widespread use and adoption of web-based geo maps have provided a familiar set of interactions for exploring extremely large geo data spaces and can be applied to similarly large abstract data spaces. Building on these techniques, a tile based visual analytics system (TBVA) was developed that demonstrates interactive visualization for a one billion point Twitter dataset.

View Paper
Louvain Clustering for Big Data Graph Visual Analytics

Gauldie, D., S. Langevin, P. Schretlen, D. Jonker, N. Bozowsky and W. Wright, IEEE VisWeek, October 2013

Visual aggregation techniques, tools and easily tailorable components are needed that will support answering analytical questions with data description, characterization and interaction without loss of information. We present two case studies of prototype implementations of JavaScript browser-based visualization tools leveraging the Louvain clustering algorithm.

View Paper
Interactive Data Exploration with “Big Data Tukey Plots”

Schretlen, P., N. Kronenfeld, D. Gray, J. McGeachie, E. Hall, D. Cheng, N. Covello and W. Wright, IEEE VisWeek, October 2013

Our goal is to develop new automated tools to produce effective raw data characterization on extremely large datasets. This paper reports on the development of a web based interactive scatter plot prototype that uses tile-based rendering similar to geographic maps and interaction paradigms.

View Paper

2023

Why AI Can’t Save the World (Yet), and How Humans Can Help

2021

Unmasking Human Trafficking Risk in Commercial Sex Supply Chains with Machine Learning

2017

Text Analytics and New Visualization Techniques

2016

Graph Mapping: Multi-Scale Community Visualization of Massive Graph Data

2015

Multi-Scale Community Visualization of Massive Graph Data

Global to Local Pattern of Life Analysis with Tile-Based Visual Analytics

Value in the Details: Understanding Data Through Visual Exploration

Untangling Influence and Desire: Visual Analysis of Massive Graph Data

2014

Highly Scalable Tile-Based Visualization for Exploratory Data Analysis

Influent: Scalable Transaction Flow Analysis with Entity-Relationship Graphs

2013

Visualizing Big Graphs and Social Networks

Tile Based Visual Analytics for Twitter Big Data Exploratory Analysis

Louvain Clustering for Big Data Graph Visual Analytics

Interactive Data Exploration with “Big Data Tukey Plots”