Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan;29(1):570-580.
doi: 10.1109/TVCG.2022.3209407. Epub 2022 Dec 21.

GenoREC: A Recommendation System for Interactive Genomics Data Visualization

GenoREC: A Recommendation System for Interactive Genomics Data Visualization

Aditeya Pandey et al. IEEE Trans Vis Comput Graph. 2023 Jan.

Abstract

Interpretation of genomics data is critically reliant on the application of a wide range of visualization tools. A large number of visualization techniques for genomics data and different analysis tasks pose a significant challenge for analysts: which visualization technique is most likely to help them generate insights into their data? Since genomics analysts typically have limited training in data visualization, their choices are often based on trial and error or guided by technical details, such as data formats that a specific tool can load. This approach prevents them from making effective visualization choices for the many combinations of data types and analysis questions they encounter in their work. Visualization recommendation systems assist non-experts in creating data visualization by recommending appropriate visualizations based on the data and task characteristics. However, existing visualization recommendation systems are not designed to handle domain-specific problems. To address these challenges, we designed GenoREC, a novel visualization recommendation system for genomics. GenoREC enables genomics analysts to select effective visualizations based on a description of their data and analysis tasks. Here, we present the recommendation model that uses a knowledge-based method for choosing appropriate visualizations and a web application that enables analysts to input their requirements, explore recommended visualizations, and export them for their usage. Furthermore, we present the results of two user studies demonstrating that GenoREC recommends visualizations that are both accepted by domain experts and suited to address the given genomics analysis problem. All supplemental materials are available at https://osf.io/y73pt/.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Top: GenoREC maps data and task specifications (A) to appropriate visualizations. In this figure, the knowledge-based recommendation (B) shows the component-wise model of GenoREC and the subsequent decisions made at each step. Based on the recommendation model, GenoREC generates and recommends an appropriate visualization to the user (C). Through the recommendation, GenoREC allows the user to avoid a wide range of similar but sub-optimal visualization options (D) given the data and task. Bottom: An overview of GenoREC’s system components and their interactions to generate output visualizations.
Fig. 2.
Fig. 2.
Visual overview of a genome and genomic features: point, segment, sparse, and contiguous.
Fig. 3.
Fig. 3.
Progression of GenoREC’s recommendation. In A, user has specified a BED file and a VCF file with categorical data and GenoREC recommends linear and circular layouts. In B, user adds a BIGWIG file with quantitative data and GenoREC switches to a linear recommendation.
Fig. 4.
Fig. 4.. User Interface of GenoREC.
The interface consists of two main panels. Left: a domain-centric data and task elicitation interface. The data and task specification panel contains two parts data description and tasks. Right: a visualization gallery that displays recommends visualizations.
Fig. 5.
Fig. 5.
Decision Matrix for the Encoding component (C1). Rows represent the visual encodings, and the columns represent the factors that affect the encoding selection. A formula image cell in the matrix represents a value 1, which means the encoding supports the factor, and an empty cell represents a value −1, which means the factor is not supported.
Fig. 6.
Fig. 6.. GenoREC’s recommendation creation process:
Each component in the model identifies the most suitable output option based on the previous stage and the underlying recommendation model. Output for components (A–F) are explained in Sect. 6.1.3 (Recommendation walk through).
Fig. 7.
Fig. 7.
Utility ratings for GenoREC and Alternate stimulus across all participants and scenarios. Scenarios where GenoREC’s responses were significantly higher than the alternate stimulus, are marked with *.

Similar articles

Cited by

References

    1. Abdennur N and Mirny LA. Cooler: scalable storage for Hi-C data and other genomically labeled arrays. Bioinformatics, 36(1):311–316, July 2019. - PMC - PubMed
    1. Aggarwal CC. Knowledge-Based Recommender Systems, pp. 167–197. Springer International Publishing, Cham, 2016. doi: 10.1007/978-3-319-29659-3_5 - DOI
    1. awesome-genome-visualization. https://cmdcolin.github.io/awesome-genome-visualization.
    1. Borkin MA, Vo AA, Bylinskii Z, Isola P, Sunkavalli S, Oliva A, and Pfister H. What makes a visualization memorable? IEEE transactions on visualization and computer graphics, 19(12):2306–2315, 2013. - PubMed
    1. Bouali F, Guettala A, and Venturini G. Vizassist: an interactive user assistant for visual data mining. The Visual Computer, 32(11):1447–1463, 2016.

Publication types