Finding and understanding datasets and managing data flows in larger organizations can be quite challenging. Our open-source product SmartDataLakeBuilder (smartdatalake.io) has valuable metadata about data sources and data pipelines of data lakes that we want to visualize, so that data scientists and analysts can find and understand data more easily and operators manage data pipelines more efficiently.
Challenge: create state-of-the-art visualizations for data flow (sankey chart) and datasets (force-directed graph chart) by using latest D3 visualization library. Use graph theory to optimize the layout of complex graph visualizations automatically to maximize usability of the visualization.
To goal of the master thesis is to
Join our team as intern and you will find a young, dynamic and culturally diverse working environment.
knowledge / skills required