Analyzing the Training Processes of Deep Generative Models

Liu, Mengchen<sup>*</sup>; Shi, Jiaxin; Cao, Kelei; Zhu, Jun; Liu, Shixia

doi:10.1109/TVCG.2017.2744938

摘要

Among the many types of deep models, deep generative models (DGMs) provide a solution to the important problem of unsupervised and semi-supervised learning. However, training DGMs requires more skill, experience, and know-how because their training is more complex than other types of deep models such as convolutional neural networks (CNNs). We develop a visual analytics approach for better understanding and diagnosing the training process of a DGM. To help experts understand the overall training process, we first extract a large amount of time series data that represents training dynamics (e.g., activation changes over time). A blue-noise polyline sampling scheme is then introduced to select time series samples, which can both preserve outliers and reduce visual clutter. To further investigate the root cause of a failed training process; we propose a credit assignment algorithm that indicates how other neurons contribute to the output of the neuron causing the training failure. Two case studies are conducted with machine learning experts to demonstrate how our approach helps understand and diagnose the training processes of DGMs. We also show how our approach can be directly used to analyze other types of deep models such as CNNs.

出版日期2018-1
单位清华大学

全文

访问全文

收藏分享被引(103) 浏览

更新时间：2024-04-18 20:00

Analyzing the Training Processes of Deep Generative Models

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友