The internet today offers an overwhelming amount of still growing resources such as websites, images, texts, and videos. The resulting Big Data Problem does not only consist of the handling of this immense volume of data. Moreover, data needs to be processed, cleaned, and presented in a user-friendly, graphical way.

The VANDA project addresses the challenges summarized in the four V’s: Volume (huge data amounts in the range of tera and peta bytes), Velocity (the speed in which data is created, processed, and analysed), Variety (the different heterogeneous data types, sources, and formats), Veracity (authenticity and validity of data).

Big Data driven interfaces in the VANDA project combine suitable backend and frontend technologies as well as automatic and semi-automatic approaches in order to analyse data in various business contexts. An important aspect is human intervention in developing and training machine learning algorithms (human in the loop).


Glyphs are small independent visual objects that map each data attribute to graphical attribute, such as size, shape, color and orientation. Its major strength is that patterns involving more than three dimensions can be more readily perceived and subsets of dimensions can form composite visual features that are easy to recognize.

We propose this visualization technique with different levels of detail to the problem of analyzing the features involved in clustering algorithms. Our concept relies on a mapping of each data item to a color-coded pixel in a scatterplot that is computed using Multidimensional Scaling. The resulting clusters are first explored by a data analyst, who then selects subsets or small clusters for detailed inspection. Once the amount of data items is reduced, another level of detail in the form of glyphs is presented for an in-depth analysis.


Workshop on Visual Interfaces for Big Data Environments in Industrial Applications at AVI 2018

Dietrich Kammer and Mandy Keck from the VANDA project team were involved in conducting the Workshop on Visual Interfaces for Big Data Environments in Industrial Applications at AVI 2018 on 29 May 2018. The workshop was part of the International Conference on Advanced Visual Interfaces (AVI 2018) in Castiglione della Pescaia (Italy). The target of …

Conversation Training Paper and Presentation at VisBIA 2018

Alexander Maasch from project partner chemmedia AG also presented research concerning the visualization of conversation training data. Conversation training is an e-learning method designed for the transfer of communication-related skills. With increasing conversation length and number of learners, creating and evaluating conversations becomes a complex task that needs appropriate visual guidance and data visualization. This …

Poster Presentation of Big Data Landscapes at AVI 2018

A poster with research from the VANDA project about Big Data Landscapes was discussed with the international audience at the International Conference on Advanced Visual Interfaces (AVI 2018) at Castiglione della Pescaia, Grosseto (Italy) on 30 May 2018 during the poster session.