Graph Visualization (Diploma Thesis)

  • ELCA
  • Bern, Bern, Switzerland
  • 09/10/2020
Full time Data Science Data Analytics Big Data Data Management Statistics

Job Description

Description

Finding and understanding datasets and managing data flows in larger organizations can be quite challenging. Our open-source product SmartDataLakeBuilder (smartdatalake.io) has valuable metadata about data sources and data pipelines of data lakes that we want to visualize, so that data scientists and analysts can find and understand data more easily and operators manage data pipelines more efficiently.

Challenge: create state-of-the-art visualizations for data flow (sankey chart) and datasets (force-directed graph chart) by using latest D3 visualization library. Use graph theory to optimize the layout of complex graph visualizations automatically to maximize usability of the visualization.

In this role

To goal of the master thesis is to

  • implement a customized “sankey chart” with D3 to represent data flows, optimizing the layout to reduce crossings. Add functionality to show, expand & collaps groups, highlight paths and display arbitrary metadata.
  • implement a customized “force directed graph chart” with D3 to represent datasets and their dependencies. Add functionality to show, expand & collaps groups and display arbitrary metadata.
  • Build a small Web UI around the visualizations to filter the metadata and select the attributes to show.

What we offer

Join our team as intern and you will find a young, dynamic and culturally diverse working environment.

About your profile

knowledge / skills required

  • Interest and strong knowledge in Javascript, visualizations and graph theory.
  • Experience with BigData tools (Hadoop, Spark) and general data processing is a plus