Navigation auf uzh.ch

Suche

Department of Informatics Interactive Visual Data Analysis Group

Lecture: Interactive-Visual Data Analysis (L&E)

IVDA galery
Lecturer Prof. Dr. Jürgen Bernard
TAs & Tutors

Gabriela Morgenshtern, Ann-Kathrin Kübler

Teaching Language English
Level, ECTS

MSc (6 ECTS)

PhD (DSI) (2 ECTS)

Notes If you have already taken the course „Introduction to Interactive-Visual Data Analysis (MINF4570)" in Fall 2021, you are not allowed to book this course.
Academic Semester Fall 2024
Time and Location

Tuesday 16:15 - 18:00, Room 2.A.01

Thursday 14:00 - 15:45, Room BIN 2.A.01

Digital Backups

Slides will be on OLAT, additional course material will be as described below.

There is no recording of the lecture and exercise, as this is an interactive course requiring live participation.

OLAT To access all the course material OLAT
Start Date 17.09.2024
End Date 19.12.2024
Course Material

Coursebook (Visualization Analysis and Design, Tamara Munzner)

Research papers (as announced)

IVDA Programming Tutorial

Available on GitLab

Prerequisites Willingness to participate actively in class is recommended. Willingness to work in groups to face data analysis challenges together.
Prior Knowledge Programming expertise is mandatory. Student groups should be comfortable with frontend and backend development. Basic knowledge in data science, machine learning, and data analysis are useful but not mandatory.
Grading Regular per-person exercises and individual homework assignments (50%), programming project in the group including iterative presentation and report submission (50%). Students must pass both the individual grade and the overall grade for successful completion of the course.
General Inquires For any general inquires about the course send an email

 

Course Pitch

As of 2022, but also applies to 2024.

Course Pitch

General Description 

This course introduces fundamental concepts and techniques of interactive-visual data analysis (IVDA). The main focus is on the combination of automatic data analysis methods with interactive visual interfaces, as well as on their interplay to facilitate data analysis goals. As such, IVDA is particularly suited to leverage the strengths of both humans and machines in a human-in-the-loop data analysis process. Associated research fields are Information Visualization, Visual Analytics, Interactive Data Science, Interactive Machine Learning, and Human-Centered Artificial Intelligence.

Learning Outcome

Students will learn basic characteristics of data types and data attributes (WHAT), as well as data analysis tasks (WHY). Further, students will learn basic design skills about HOW data can be transformed into visual structures and which types of visualization techniques are meaningful design choices for given data types and analysis at hand. Students will also learn fundamental interaction techniques, as well as concepts for the composition of views in data analysis systems for different data types and their individual complexities.

Further, students will gain a deep understanding about how data analysis can benefit from both having a human and a (machine learning) model in the loop, following the goal to gain knowledge from data.  Along these lines, students will learn about the strengths (and weaknesses) of human and machines, as well as about combining these complementary strengths effectively, as described in Visual Analytics methodology. In detail, students will learn examples for interactive data preprocessing, human-centered unsupervised machine learning, as well as for human-centered semi-supervised and supervised machine learning. Finally, the course introduces approaches that allow training personalized machine learning models and conduct personal and human-centered data analytics and artificial intelligence.

Target Groups

This module is designed for MA students (POC, DS, AI). It would be possible for students in other disciplines to take this course with a programming background. There are no enforced prerequisites, but the willingness to participate actively and to work in groups is recommended.  

Prior Knowledge

The course requires some degree of  programming expertise. Student groups should be comfortable with frontend and backend development. Basic knowledge in data science, machine learning, and data analysis are useful but not mandatory.

Recommended Reading

Visualization Analysis and Design, Tamara Munzner (A K Peters Visualization Series, CRC Press, 2014) is the course textbook. Recommended reading also includes selected papers, as outlined below.

Topic Overview

L01
Introduction to IVDA
L02 What: dataset types and data attributes
L03
Why: analysis tasks and abstractions
L04 How: visual variables and visualization guidelines
L05 How: interaction techniques and view composition
L06 How: advanced visualization techniques
L07 Data wrangling and visual preprocessing
L08 Introduction to Visual Analytics

L09

Unsupervised machine learning and data exploration
L10 Supervised machine learning and model explanation
L11
Human-centered artificial intelligence
L12 Human knowledge externalization
L13

Preference-based and personalized analytics

2024-09-17 - L01: Introduction to IVDA

Recommended reading (pre-class)

  • VAD Book Chapter 1. What's Vis, and Why Do It?

In-class Agenda

  • Welcome to the class!
  • Introduction to Interactive Visual Data Analysis
  • Interactive Demo: Exploration of Funds Data

  • Course logistics - organizational stuff

Further Reading

  • [VIS Design Principles]: Semiology of Graphics, Jacques Bertin, Gauthier-Villars 1967, EHESS 1998
  • [VIS Design Principles]: The Visual Display of Quantitative Information. Edward R. Tufte. Graphics Press, 1983.

2024-09-19 - L02: What? Data Types and Attributes

Recommended reading (pre-class)

  • VAD Book Chapter 2: What: Data Abstraction

In-class Agenda

  • What to analyze? Introduction to the data perspective
  • What to analyze? Dataset Types, the between-objects perspective

  • What to analyze? Data Attributes: nominal, ordinal, and numerical

Further Reading

2024-09-24 - L03: Why? Analysis Tasks

Recommended reading (pre-class)

  • VAD Book Chapter 3: Why: Task Abstraction

In-class Agenda

  • Why analyze? Analysis Tasks - actions and targets

  • Why analyze? Data and Task Abstraction using a four-level analysis framework for design and validation

Further Reading

2024-10-01 - L04: How? Marks, Channels, and Visualization Guidelines

Recommended reading (pre-class)

  • VAD Book Chapter 5: Marks and Channels
  • VAD Book Chapter 6: Rules of Thumb

In-class Agenda

  • How analyze? Marks - Basic Graphical Elements

  • How analyze? Channels - Visual Variables

  • How analyze? Visualization Guidelines - Perception, Color, and Rules of Thumb

  • How analyze? Decoding of Visualizations - Chart Decomposition

  • How analyze? Exercise

Further Reading

  • [VIS Design Principles]: Semiology of Graphics, Jacques Bertin, Gauthier-Villars 1967, EHESS 1998
  • [VIS Design Principles]: The Visual Display of Quantitative Information. Edward R. Tufte. Graphics Press, 1983.

2024-10-03 - L05: How? Interaction Techniques and View Composition

Recommended reading (pre-class)

  • VAD Book Chapter 11. Manipulate View
  • VAD Book Chapter 12. Facet into Multiple Views
  • VAD Book Chapter 14. Embed: Focus+Context

In-class Agenda

  • How analyze? Interaction Design - Engaging in a dialog with the data

  • How analyze? Interaction Techniques - Overview of atomic Interactions

  • How analyze? View Composition - Leveraging interaction techniques

Further Reading

  • Norman, D., The design of everyday things: Revised and expanded edition. Basic books. 2013
  • Interactive Visual Data Analysis. Christian Tominski and Heidrun Schumann. AK Peters Visualization Series. CRC Press. 2020

2024-10-10 - L06: How? Advanced Visualization Techniques

Recommended reading (pre-class)

  • VAD Book Chapter 7: Arrange Tables
  • VAD Book Chapter 8: Arrange Spatial Data
  • VAD Book Chapter 9: Arrange Networks and Trees
  • Polaris: A System for Query, Analysis and Visualization of Multidimensional Relational Databases. Chris Stolte and Pat Hanrahan. Proceedings of IEEE InfoVis 2000. [research paper, intellectual foundation of the Tableau software]

In-class Agenda

Advanced visualization techniques for...

  • Multivariate Data
  • Networks & Graphs
  • Trees & Hierarchies
  • Time Series
  • Geographical Data
  • Other Data Types

Further Reading

Web-based overviews of techniques for...

2024-10-15 - L07: Data Wrangling and Interactive Preprocessing

Required reading (pre-class)

In-class Agenda

  • Aspect of Dirty Data - Identification and Curation
  • Data Transformations - Making data usable and useful
  • Visual Preprocessing - Examples of Vis tool usage in applications

2023-10-24 - L08: Introduction to Visual Analytics

Recommended reading (pre-class)

  • no entry

In-class Agenda

  • Introduction - About Patterns and Models
  • Knowledge Generation - Patterns, Models, and Analytical Reasoning
  • Data Transform. Processes - The InfoVis and the KDD Process
  • Humans and Machines - Strengths and Weaknesses
  • Visual Analytics - Synthesis

Further Reading

  • Book: Illuminating the Path: The Research and Development Agenda for Visual Analytics. Thomas, J. and Cook, K. National Visualization and Analytics Center, 2005
  • Paper:Visual Analytics: Definition, Process, and Challenges. D Keim, G Andrienko, JD Fekete, C Gorg, J Kohlhammer, G Melancon, 2008

2023-10-29 - L09: Unsupervised ML and Data Exploration

Recommended reading (pre-class)

  • VAD Book Chapter 13. Reduce Items and Attributes

In-class Agenda

  • Unsupervised ML - The two "Work Horses"
  • Clustering - Finding Groups in Datasets
  • Dimensionality Reduction - Reducing the Number of Attributes
  • Visual Data Exploration - Using Self-Organizing Maps (SOM)

Further Reading

  • Wang Q, Chen Z, Wang Y, Qu H. A Survey on ML4VIS: Applying Machine Learning Advances to Data Vis. TVCG. 2021
  • Sedlmair, Aupetit: Data-driven Evaluation of Visual Quality Measures. EuroVis. 2015
  • Bernard, Hutter, Zeppelzauer, Sedlmair, Munzner: ProSeCo: Visual Analysis of Class Separation Measures and Dataset Characteristics. G&C (2021)

2023-10-31 - L10: Supervised ML and Model Explanation

Recommended reading (pre-class)

  • Hohman F, Kahng M, Pienta R, Chau DH. Visual analytics in deep learning: An interrogative survey for the next frontiers. IEEE Transactions on Visualization and Computer Graphics (TVCG). 2018.

In-class Agenda

  • Supervised ML - The two "Work Horses"
  • Explainable AI Special - Why something happens in ML Models
  • Interactive ML Application - VIAL: Visual Interactive Data Labeling

Further Reading

  • Bernard J, Zeppelzauer M, Lehmann M, Müller M, Sedlmair M. Towards User‐Centered Active Learning Algorithms. Computer Graphics Forum. 2018.
  • Sacha D, Kraus M, Keim DA, Chen M. Vis4ml: An ontology for visual analytics assisted machine learning. IEEE transactions on visualization and computer graphics. 2018
  • Amershi S, Cakmak M, Knox WB, Kulesza T. Power to the people: The role of humans in interactive machine learning. AI Magazine. 2014

2023-11-12 - L11: Human-Centered Artificial Intelligence

In-class Agenda

  • Introduction to HC-AI
  • Ensuring human control while increasing automation
  • Human-AI collaboration challenges
  • "IVDA Supertools": Data Exploration, Model Explanation, and Knowledge Externalization
  • Live Demo: Real-world application on sustainability research: SDG Research Scout

Further Reading

  • B. Shneiderman: Human-centered artificial intelligence: Reliable, safe & trustworthy. International Journal of Human–Computer Interaction. 2020
  • J. Bernard, M. Zeppelzauer, M. Sedlmair, and W. Aigner: VIAL – A Unified Process for Visual-Interactive Labeling. The Visual Computer Journal (TVCJ), 2018

2023-11-19 - L12: Human Knowledge Externalization

In-class Agenda

  • Basics of human knowledge externalization
  • Formalization of (explicit) human feedback types
  • Human-centered data labeling
  • Live Application: Interactive Visual Labeling of Handwritten Digits
  • LLM-based "Knowledge Externalization

Further Reading

  • J Bernard, M Hutter, M Zeppelzauer, D Fellner, M Sedlmair: Comparing Visual-Interactive Labeling with Active Learning: An Experimental Study. IEEE Transactions on Visualization and Computer Graphics, 2018

  • J Bernard, M Zeppelzauer, M Lehmann, M Müller, M Sedlmair: Towards User-Centered Active Learning Algorithms. Computer Graphics Forum (CGF), 2018

  • A Visual Active Learning System for the Assessment of Patient Well-Being in Prostate Cancer Research. Bernard, J., Sessler, D., Bannach, A., May, T., Kohlhammer, J. IEEE VIS Workshop on Visual Analytics in Healthcare (VAHC), 2015

2023-11-26 - L13: Preference and Personalized Analytics

In-class Agenda

  • Introduction to Preference-Based and Personalized Analytics
  • Application & Example: Creation of a personalized music classifier
  • Application & Example: Creation of a preference-based item ranking

  • Application & Example: Creation of a similarity metric for countries

  • Application & Example: Personalized analytics of Type-1-diabetes

Further Reading

  • The human is the loop: new directions for visual analytics. Endert, A., Hossain, M. S., Ramakrishnan, N., North, C., Fiaux, P., Andrews, C. Journal of Intelligent Information Systems, 2014.

  • Brown ET, Liu J, Brodley CE, Chang R. Dis-function: Learning distance functions interactively. IEEE conference on visual analytics science and technology (VAST). 2012

  • User-Based Visual-Interactive Similarity Definition for Mixed Data Objects-Concept and First Implementation. Bernard, J., Sessler, D., Ruppert, T., Davey, J., Kuijper, A., Kohlhammer, J. WSCG. 2014

  • Personalized Visual-Interactive Music Classification. Christian Ritter, Christian Altenhofen, Matthias Zeppelzauer, Arjan Kuijper, Tobias Schreck, and Jürgen Bernard, EuroVA @ EuroVis (EuroGraphics), 2018