About

The internet today offers an overwhelming amount of still growing resources such as websites, images, texts, and videos. The resulting Big Data Problem does not only consist of the handling of this immense volume of data. Moreover, data needs to be processed, cleaned, and presented in a user-friendly, graphical way.

The VANDA project addresses the challenges summarized in the four V’s: Volume (huge data amounts in the range of tera and peta bytes), Velocity (the speed in which data is created, processed, and analysed), Variety (the different heterogeneous data types, sources, and formats), Veracity (authenticity and validity of data).

Big Data driven interfaces in the VANDA project combine suitable backend and frontend technologies as well as automatic and semi-automatic approaches in order to analyse data in various business contexts. An important aspect is human intervention in developing and training machine learning algorithms (human in the loop).

Glyphs

Glyphs are small independent visual objects that map each data attribute to graphical attribute, such as size, shape, color and orientation. Its major strength is that patterns involving more than three dimensions can be more readily perceived and subsets of dimensions can form composite visual features that are easy to recognize.

We propose this visualization technique with different levels of detail to the problem of analyzing the features involved in clustering algorithms. Our concept relies on a mapping of each data item to a color-coded pixel in a scatterplot that is computed using Multidimensional Scaling. The resulting clusters are first explored by a data analyst, who then selects subsets or small clusters for detailed inspection. Once the amount of data items is reduced, another level of detail in the form of glyphs is presented for an in-depth analysis.

News

Best honorable mention for Glyphboard

This week –  from 20 to  25 October – Mandy Keck and Thomas Gründer represent the VANDA team in Vancouver and participate at IEEE VIS, which is the worldwide largest and most important conference on Information Visualization, Visual Analytics and Scientific Visualization.  The conference started on Sunday with many interesting and inspiring keynotes, talks and tutorials, …

VANDA project team @ OUTPUT.DD

This June we presented our research results at OUTPUT.DD and the 50 Years of university computer science education in Dresden celebration. OUTPUT.DD is the project presentation of the Faculty of Computer Science where research results are presented to the general public once a year. During the two-day event on 19 and 20 June 2019, we presented our glyphboard demonstrator to …

Posters at IEEE InfoVis 2018

We are proud to be presenting two posters at IEEE InfoVis in Berlin this year. Mandy Keck, Dietrich Kammer, and Rainer Groh present Visual Version Comparison of Multidimensional Data Sets using Glyphs. When working with a given high-dimensional data set, data analysts often use different machine-learning algorithms to calculate clusters and classifications. However, it is difficult to …