DeepEye: An Automatic Big Data Visualization Framework

Data visualization transforms data into images to aid the understanding of data; therefore, it is an invaluable tool for explaining the significance of data to visually inclined people. Given a (big) dataset, the essential task of visualization is to visualize the data to tell compelling stories by...

Full description

Saved in:
Bibliographic Details
Main Authors: Xuedi Qin, Yuyu Luo, Nan Tang, Guoliang Li
Format: Article
Language:English
Published: Tsinghua University Press 2018-03-01
Series:Big Data Mining and Analytics
Subjects:
Online Access:https://www.sciopen.com/article/10.26599/BDMA.2018.9020007
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832572800335872000
author Xuedi Qin
Yuyu Luo
Nan Tang
Guoliang Li
author_facet Xuedi Qin
Yuyu Luo
Nan Tang
Guoliang Li
author_sort Xuedi Qin
collection DOAJ
description Data visualization transforms data into images to aid the understanding of data; therefore, it is an invaluable tool for explaining the significance of data to visually inclined people. Given a (big) dataset, the essential task of visualization is to visualize the data to tell compelling stories by selecting, filtering, and transforming the data, and picking the right visualization type such as bar charts or line charts. Our ultimate goal is to automate this task that currently requires heavy user intervention in the existing visualization systems. An evolutionized system in the field faces the following three main challenges: (1) Visualization verification: to determine whether a visualization for a given dataset is interesting, from the viewpoint of human understanding; (2) Visualization search space: a "boring" dataset may become interesting after an arbitrary combination of operations such as selections, joins, and aggregations, among others; (3) On-time responses: do not deplete the user’s patience. In this paper, we present the DeepEye system to address these challenges. This system solves the first challenge by training a binary classifier to decide whether a particular visualization is good for a given dataset, and by using a supervised learning to rank model to rank the above good visualizations. It also considers popular visualization operations, such as grouping and binning, which can manipulate the data, and this will determine the search space. Our proposed system tackles the third challenge by incorporating database optimization techniques for sharing computations and pruning.
format Article
id doaj-art-42defbc30c1b4d6cac474fdfce448647
institution Kabale University
issn 2096-0654
language English
publishDate 2018-03-01
publisher Tsinghua University Press
record_format Article
series Big Data Mining and Analytics
spelling doaj-art-42defbc30c1b4d6cac474fdfce4486472025-02-02T06:49:44ZengTsinghua University PressBig Data Mining and Analytics2096-06542018-03-0111758210.26599/BDMA.2018.9020007DeepEye: An Automatic Big Data Visualization FrameworkXuedi Qin0Yuyu Luo1Nan Tang2Guoliang Li3<institution content-type="dept">Department of Computer Science</institution>, <institution>Tsinghua University</institution>, <city>Beijing</city> <postal-code>100084</postal-code>, <country>China</country>.<institution content-type="dept">Department of Computer Science</institution>, <institution>Tsinghua University</institution>, <city>Beijing</city> <postal-code>100084</postal-code>, <country>China</country>.<institution content-type="dept">Qatar Computing Research Institute</institution>, <institution>HBKU</institution>, <country>Qatar</country>.<institution content-type="dept">Department of Computer Science</institution>, <institution>Tsinghua University</institution>, <city>Beijing</city> <postal-code>100084</postal-code>, <country>China</country>.Data visualization transforms data into images to aid the understanding of data; therefore, it is an invaluable tool for explaining the significance of data to visually inclined people. Given a (big) dataset, the essential task of visualization is to visualize the data to tell compelling stories by selecting, filtering, and transforming the data, and picking the right visualization type such as bar charts or line charts. Our ultimate goal is to automate this task that currently requires heavy user intervention in the existing visualization systems. An evolutionized system in the field faces the following three main challenges: (1) Visualization verification: to determine whether a visualization for a given dataset is interesting, from the viewpoint of human understanding; (2) Visualization search space: a "boring" dataset may become interesting after an arbitrary combination of operations such as selections, joins, and aggregations, among others; (3) On-time responses: do not deplete the user’s patience. In this paper, we present the DeepEye system to address these challenges. This system solves the first challenge by training a binary classifier to decide whether a particular visualization is good for a given dataset, and by using a supervised learning to rank model to rank the above good visualizations. It also considers popular visualization operations, such as grouping and binning, which can manipulate the data, and this will determine the search space. Our proposed system tackles the third challenge by incorporating database optimization techniques for sharing computations and pruning.https://www.sciopen.com/article/10.26599/BDMA.2018.9020007big dataautomatic data visualizationvisualization verificationvisualization rankingvisualization search space
spellingShingle Xuedi Qin
Yuyu Luo
Nan Tang
Guoliang Li
DeepEye: An Automatic Big Data Visualization Framework
Big Data Mining and Analytics
big data
automatic data visualization
visualization verification
visualization ranking
visualization search space
title DeepEye: An Automatic Big Data Visualization Framework
title_full DeepEye: An Automatic Big Data Visualization Framework
title_fullStr DeepEye: An Automatic Big Data Visualization Framework
title_full_unstemmed DeepEye: An Automatic Big Data Visualization Framework
title_short DeepEye: An Automatic Big Data Visualization Framework
title_sort deepeye an automatic big data visualization framework
topic big data
automatic data visualization
visualization verification
visualization ranking
visualization search space
url https://www.sciopen.com/article/10.26599/BDMA.2018.9020007
work_keys_str_mv AT xuediqin deepeyeanautomaticbigdatavisualizationframework
AT yuyuluo deepeyeanautomaticbigdatavisualizationframework
AT nantang deepeyeanautomaticbigdatavisualizationframework
AT guoliangli deepeyeanautomaticbigdatavisualizationframework