Center for Anytime Anywhere Analytics

Journal Paper

#64

@article{Choi2019, title={Visualizing for the Non‐Visual: Enabling the Visually Impaired to Use Visualization}, author={Jinho Choi and Sanghun Jung and Deok Gun Park and Jaegul Choo and Niklas Elmqvist}, url={http://users.umiacs.umd.edu/~elm/projects/vis4nonvisual/vis4nonvisual.pdf}, year={2019}, date={2019-06-01}, journal={Computer Graphics Forum}, volume={38}, number={3}, pages={249--260}, abstract={The majority of visualizations on the web are still stored as raster images, making them inaccessible to visually impaired users. We propose a deep‐neural‐network‐based approach that automatically recognizes key elements in a visualization, including a visualization type, graphical elements, labels, legends, and most importantly, the original data conveyed in the visualization. We leverage such extracted information to provide visually impaired people with the reading of the extracted information. Based on interviews with visually impaired users, we built a Google Chrome extension designed to work with screen reader software to automatically decode charts on a webpage using our pipeline. We compared the performance of the back‐end algorithm with existing methods and evaluated the utility using qualitative feedback from visually impaired users.}, keywords={}, }

Computer Graphics Forum • 2019

pdf Visualizing for the Non‐Visual: Enabling the Visually Impaired to Use Visualization^↗

Jinho Choi

Sanghun Jung

Deok Gun Park

Jaegul Choo

Niklas Elmqvist

Click to read abstract

The majority of visualizations on the web are still stored as raster images, making them inaccessible to visually impaired users. We propose a deep‐neural‐network‐based approach that automatically recognizes key elements in a visualization, including a visualization type, graphical elements, labels, legends, and most importantly, the original data conveyed in the visualization. We leverage such extracted information to provide visually impaired people with the reading of the extracted information. Based on interviews with visually impaired users, we built a Google Chrome extension designed to work with screen reader software to automatically decode charts on a webpage using our pipeline. We compared the performance of the back‐end algorithm with existing methods and evaluated the utility using qualitative feedback from visually impaired users.
Journal Paper

#52

@article{Park2017, title={ConceptVector: Text Visual Analytics via Interactive Lexicon Building using Word Embedding}, author={Deok Gun Park and Seungyeon Kim and Jurim Lee and Jaegul Choo and Nicholas Diakopoulos and Niklas Elmqvist}, url={http://www.umiacs.umd.edu/~elm/projects/conceptvector/conceptvector.pdf}, year={2017}, date={2017-10-01}, journal={IEEE Transactions on Visualization & Computer Graphics}, abstract={Central to many text analysis methods is the notion of a concept: a set of semantically related keywords characterizing a specific object, phenomenon, or theme. Advances in word embedding allow building such concepts from a small set of seed terms. However, naive application of such techniques may result in false positive errors because of the polysemy of human language. To mitigate this problem, we present a visual analytics system called ConceptVector that guides the user in building such concepts and then using them to analyze documents. Document-analysis case studies with real-world datasets demonstrate the fine-grained analysis provided by ConceptVector. To support the elaborate modeling of concepts using user seed terms, we introduce a bipolar concept model and support for irrelevant words. We validate the interactive lexicon building interface via a user study and expert reviews. The quantitative evaluation shows that the bipolar lexicon generated with our methods is comparable to human-generated ones.}, keywords={}, }

IEEE Transactions on Visualization & Computer Graphics • 2017

pdf ConceptVector: Text Visual Analytics via Interactive Lexicon Building using Word Embedding^↗

Deok Gun Park

Seungyeon Kim

Jurim Lee

Jaegul Choo

Nicholas Diakopoulos

Niklas Elmqvist

Click to read abstract

Central to many text analysis methods is the notion of a concept: a set of semantically related keywords characterizing a specific object, phenomenon, or theme. Advances in word embedding allow building such concepts from a small set of seed terms. However, naive application of such techniques may result in false positive errors because of the polysemy of human language. To mitigate this problem, we present a visual analytics system called ConceptVector that guides the user in building such concepts and then using them to analyze documents. Document-analysis case studies with real-world datasets demonstrate the fine-grained analysis provided by ConceptVector. To support the elaborate modeling of concepts using user seed terms, we introduce a bipolar concept model and support for irrelevant words. We validate the interactive lexicon building interface via a user study and expert reviews. The quantitative evaluation shows that the bipolar lexicon generated with our methods is comparable to human-generated ones.
Journal Paper

#46

@article{Kim2017, title={TopicLens: Efficient Multi-Level Visual Topic Exploration of Large-Scale Document Collections}, author={Minjeong Kim and Kyeongpil Kang and Deokgun Park and Jaegul Choo and Niklas Elmqvist}, url={http://www.umiacs.umd.edu/~elm/projects/topiclens/topiclens.pdf}, video={https://www.youtube.com/watch?v=RKC5w9dZmXQ}, year={2016}, date={2016-08-10}, journal={IEEE Transactions on Visualization and Computer Graphics}, volume={23}, number={1}, pages={151--160}, abstract={Topic modeling, which reveals underlying topics of a document corpus, has been actively adopted in visual analytics for large-scale document collections. However, due to its significant processing time and non-interactive nature, topic modeling has so far not been tightly integrated into a visual analytics workflow. Instead, most such systems are limited to utilizing a fixed, initial set of topics. Motivated by this gap in the literature, we propose a novel interaction technique called TopicLens that allows a user to dynamically explore data through a lens interface where topic modeling and the corresponding 2D embedding are efficiently computed on the fly. To support this interaction in real time while maintaining view consistency, we propose a novel efficient topic modeling method and a semi-supervised 2D embedding algorithm. Our work is based on improving state-of-the-art methods such as nonnegative matrix factorization and t-distributed stochastic neighbor embedding. Furthermore, we have built a web-based visual analytics system integrated with TopicLens. We use this system to measure the performance and the visualization quality of our proposed methods. We provide several scenarios showcasing the capability of TopicLens using real-world datasets.}, keywords={}, }

IEEE Transactions on Visualization and Computer Graphics • 2016

pdf TopicLens: Efficient Multi-Level Visual Topic Exploration of Large-Scale Document Collections^↗

Minjeong Kim

Kyeongpil Kang

Deokgun Park

Jaegul Choo

Niklas Elmqvist

Click to read abstract

Topic modeling, which reveals underlying topics of a document corpus, has been actively adopted in visual analytics for large-scale document collections. However, due to its significant processing time and non-interactive nature, topic modeling has so far not been tightly integrated into a visual analytics workflow. Instead, most such systems are limited to utilizing a fixed, initial set of topics. Motivated by this gap in the literature, we propose a novel interaction technique called TopicLens that allows a user to dynamically explore data through a lens interface where topic modeling and the corresponding 2D embedding are efficiently computed on the fly. To support this interaction in real time while maintaining view consistency, we propose a novel efficient topic modeling method and a semi-supervised 2D embedding algorithm. Our work is based on improving state-of-the-art methods such as nonnegative matrix factorization and t-distributed stochastic neighbor embedding. Furthermore, we have built a web-based visual analytics system integrated with TopicLens. We use this system to measure the performance and the visualization quality of our proposed methods. We provide several scenarios showcasing the capability of TopicLens using real-world datasets.

📺 Video

Publications coauthored by

Jaegul Choo

pdf Visualizing for the Non‐Visual: Enabling the Visually Impaired to Use Visualization ↗

pdf ConceptVector: Text Visual Analytics via Interactive Lexicon Building using Word Embedding ↗

pdf TopicLens: Efficient Multi-Level Visual Topic Exploration of Large-Scale Document Collections ↗

pdf Visualizing for the Non‐Visual: Enabling the Visually Impaired to Use Visualization^↗

pdf ConceptVector: Text Visual Analytics via Interactive Lexicon Building using Word Embedding^↗

pdf TopicLens: Efficient Multi-Level Visual Topic Exploration of Large-Scale Document Collections^↗