Project D03

D03 | Visual Exploration and Analysis of Provenance Data

Prof. Melanie Herschel, University of Stuttgart
Email | Website

Prof. Sabine Storandt, University of Konstanz
Email | Website

[completed]

Houssem Ben Lahmar, University of Stuttgart – Email | Website

About this project
Results

To analyze or debug complex data processing applications, or to ensure their understandability and repeatability, provenance techniques are increasingly being deployed, resulting in large volumes and a wide variety of provenance data. The long-term goal of this project is to leverage visualization techniques to efficiently and effectively explore provenance data. In the first funding period, we will focus on properly visualizing the full provenance data generated for one run of a data-processing pipeline. This involves both quantifiably identifying suited visualizations for various provenance types and ensuring user-friendly provenance data generation and visualization in existing data processing pipelines.

Research Questions

What are suitable visualization techniques for different settings defined by varying types of provenance and applications?

Which metrics can quantitatively assess provenance data visualization quality?

How can such metrics support tuning processes generating and managing provenance data?

Which types of provenance are best suited to achieve the goals of reproducibility and predictability for selected visual computing processes?

Fig. 1:Visualizing and Interacting with Provenance Data

Publications

H. Ben Lahmar and M. Herschel, “Collaborative filtering over evolution provenance data for interactive visual data exploration,” Information Systems, vol. 95, p. 101620, 2021, doi: 10.1016/j.is.2020.101620.
- BibTeX
- Link
BibTeX
@article{benlahmar2021collaborative, author = {Ben Lahmar, Houssem and Herschel, Melanie}, doi = {10.1016/j.is.2020.101620}, journal = {Information Systems}, pages = 101620, title = {Collaborative filtering over evolution provenance data for interactive visual data exploration}, url = {https://doi.org/10.1016/j.is.2020.101620}, volume = 95, year = 2021 }
Link
https://doi.org/10.1016/j.is.2020.101620
V. Bruder et al., “Volume-Based Large Dynamic Graph Analysis Supported by Evolution Provenance,” Multimedia Tools and Applications, vol. 78, Art. no. 23, 2019, doi: 10.1007/s11042-019-07878-6.
- BibTeX
- Link
BibTeX
@article{journals/mta/BruderLHFBWHE19, affiliation = {Bruder, Valentin, Visualisierungsinstitut der Universität Stuttgart. Frey, Steffen, Visualisierungsinstitut der Universität Stuttgart. Burch, Michael, Visualisierungsinstitut der Universität Stuttgart. Weiskopf, Daniel, Visualisierungsinstitut der Universität Stuttgart. Ertl, Thomas, Visualisierungsinstitut der Universität Stuttgart}, author = {Bruder, Valentin and Lahmar, Houssem Ben and Hlawatsch, Marcel and Frey, Steffen and Burch, Michael and Weiskopf, Daniel and Herschel, Melanie and Ertl, Thomas}, doi = {10.1007/s11042-019-07878-6}, journal = {Multimedia Tools and Applications}, number = 23, orcid-numbers = {Bruder, Valentin/0000-0001-5063-4894, Frey, Steffen/0000-0002-1872-6905, Weiskopf, Daniel/0000-0003-1174-1026, Ertl, Thomas/0000-0003-4019-2505}, pages = {32939-32965}, title = {Volume-Based Large Dynamic Graph Analysis Supported by Evolution Provenance}, url = {https://doi.org/10.1007/s11042-019-07878-6}, volume = 78, year = 2019 }
Link
https://doi.org/10.1007/s11042-019-07878-6
C. Schulz, A. Zeyfang, M. van Garderen, H. Ben Lahmar, M. Herschel, and D. Weiskopf, “Simultaneous Visual Analysis of Multiple Software Hierarchies,” in Proceedings of the IEEE Working Conference on Software Visualization (VISSOFT), IEEE, 2018, pp. 87–95. [Online]. Available: https://ieeexplore.ieee.org/document/8530134/
- BibTeX
- Link
BibTeX
@inproceedings{Schulz2018Simultaneous, affiliation = {Schulz, Christoph, Visualisierungsinstitut der Universität Stuttgart. van Garderen, Mereke, Visualisierungsinstitut der Universität Stuttgart. Weiskopf, Daniel, Visualisierungsinstitut der Universität Stuttgart}, author = {Schulz, Christoph and Zeyfang, Adrian and van Garderen, Mereke and Ben Lahmar, Houssem and Herschel, Melanie and Weiskopf, Daniel}, booktitle = {Proceedings of the IEEE Working Conference on Software Visualization (VISSOFT)}, orcid-numbers = {Schulz, Christoph/0000-0001-5771-3966, Weiskopf, Daniel/0000-0003-1174-1026}, pages = {87-95}, publisher = {IEEE}, title = {Simultaneous Visual Analysis of Multiple Software Hierarchies}, url = {https://ieeexplore.ieee.org/document/8530134/}, year = 2018 }
Link
https://ieeexplore.ieee.org/document/8530134/
H. Ben Lahmar, M. Herschel, M. Blumenschein, and D. A. Keim, “Provenance-based Visual Data Exploration with EVLIN,” in Proceedings of the Conference on Extending Database Technology (EDBT), 2018, pp. 686–689. doi: 10.5441/002/edbt.2018.85.
- BibTeX
- Link
BibTeX
@inproceedings{benlahmar2018provenancebased, affiliation = {Keim, Daniel A., Universität Konstanz}, author = {Ben Lahmar, Houssem and Herschel, Melanie and Blumenschein, Michael and Keim, Daniel A.}, booktitle = {Proceedings of the Conference on Extending Database Technology (EDBT)}, doi = {10.5441/002/edbt.2018.85}, pages = {686-689}, title = {Provenance-based Visual Data Exploration with EVLIN}, url = {https://dx.doi.org/10.5441/002/edbt.2018.85}, year = 2018 }
Link
https://dx.doi.org/10.5441/002/edbt.2018.85
S. Oppold and M. Herschel, “Provenance for Entity Resolution,” in Provenance and Annotation of Data and Processes. IPAW 2018. Lecture Notes in Computer Science, vol. 11017, K. Belhajjame, A. Gehani, and P. Alper, Eds., Springer International Publishing, 2018, pp. 226–230. doi: 10.1007/978-3-319-98379-0_25.
- BibTeX
- Link
BibTeX
@inbook{conf/ipaw/OppoldH18, author = {Oppold, Sarah and Herschel, Melanie}, booktitle = { Provenance and Annotation of Data and Processes. IPAW 2018. Lecture Notes in Computer Science}, doi = {10.1007/978-3-319-98379-0_25}, editor = {Belhajjame, Khalid and Gehani, Ashish and Alper, Pinar}, pages = {226-230}, publisher = {Springer International Publishing}, title = {Provenance for Entity Resolution}, url = {https://doi.org/10.1007/978-3-319-98379-0_25}, volume = 11017, year = 2018 }
Link
https://doi.org/10.1007/978-3-319-98379-0_25
H. Ben Lahmar and M. Herschel, “Provenance-based Recommendations for Visual Data Exploration,” in Proceedings of the USENIX Conference on Theory and Practice of Provenance (TAPP), 2017, pp. 1–7.
- BibTeX
BibTeX
@inproceedings{lahmar2017, author = {Ben Lahmar, Houssem and Herschel, Melanie}, booktitle = {Proceedings of the USENIX Conference on Theory and Practice of Provenance (TAPP)}, pages = {1-7}, title = {Provenance-based Recommendations for Visual Data Exploration}, year = 2017 }
M. A. Baazizi, H. Ben Lahmar, D. Colazzo, G. Ghelli, and C. Sartiani, “Schema Inference for Massive JSON Datasets,” in Proceedings of the Conference on Extending Database Technology (EDBT), 2017, pp. 222–233. doi: 10.5441/002/edbt.2017.21.
- BibTeX
- Link
BibTeX
@inproceedings{baazizi2017schema, author = {Baazizi, Mohamed Amine and Ben Lahmar, Houssem and Colazzo, Dario and Ghelli, Giorgio and Sartiani, Carlo}, booktitle = {Proceedings of the Conference on Extending Database Technology (EDBT)}, doi = {10.5441/002/edbt.2017.21}, pages = {222-233}, title = {Schema Inference for Massive JSON Datasets}, url = {http://dx.doi.org/10.5441/002/edbt.2017.21}, year = 2017 }
Link
http://dx.doi.org/10.5441/002/edbt.2017.21
R. Diestelkämper, M. Herschel, and P. Jadhav, “Provenance in DISC Systems: Reducing Space Overhead at Runtime,” in Proceedings of the USENIX Conference on Theory and Practice of Provenance (TAPP), 2017, pp. 1–13. doi: 10.5555/3183865.3183883.
- BibTeX
- Link
BibTeX
@inproceedings{diestelkamper2017provenance, author = {Diestelkämper, Ralf and Herschel, Melanie and Jadhav, Priyanka}, booktitle = {Proceedings of the USENIX Conference on Theory and Practice of Provenance (TAPP)}, doi = {10.5555/3183865.3183883}, pages = {1-13}, title = {Provenance in DISC Systems: Reducing Space Overhead at Runtime}, url = {https://dl.acm.org/doi/abs/10.5555/3183865.3183883}, year = 2017 }
Link
https://dl.acm.org/doi/abs/10.5555/3183865.3183883
M. Herschel, R. Diestelkämper, and H. Ben Lahmar, “A Survey on Provenance - What for? What form? What from?,” The VLDB Journal, vol. 26, pp. 881–906, 2017, doi: 10.1007/s00778-017-0486-1.
- BibTeX
BibTeX
@article{herschel:2017:survey, author = {Herschel, Melanie and Diestelkämper, Ralf and Ben Lahmar, Houssem}, doi = {10.1007/s00778-017-0486-1}, journal = {The VLDB Journal}, pages = {881-906}, title = {A Survey on Provenance - What for? What form? What from?}, volume = 26, year = 2017 }
M. Herschel and M. Hlawatsch, “Provenance: On and Behind the Screens,” in Proceedings of the ACM International Conference on the Management of Data (SIGMOD), F. Özcan, G. Koutrika, and S. Madden, Eds., ACM, 2016, pp. 2213–2217. doi: 10.1145/2882903.2912568.
- BibTeX
- Link
BibTeX
@inproceedings{conf/sigmod/HerschelH16, author = {Herschel, Melanie and Hlawatsch, Marcel}, booktitle = {Proceedings of the ACM International Conference on the Management of Data (SIGMOD)}, doi = {10.1145/2882903.2912568}, editor = {Özcan, Fatma and Koutrika, Georgia and Madden, Sam}, pages = {2213-2217}, publisher = {ACM}, title = {Provenance: On and Behind the Screens}, url = {https://doi.org/10.1145/2882903.2912568}, year = 2016 }
Link
https://doi.org/10.1145/2882903.2912568