Mastering data visualization with Python: practical tips for researchers

Il-Youp Kwak Soyul Han

Abstrak

Big data have revolutionized the way data are processed and used across all fields. In the past, research was primarily conducted with a focus on hypothesis confirmation using sample data. However, in the era of big data, this has shifted to gaining insights from the collected data. Visualizing vast amounts of data to derive insights is crucial. For instance, leveraging big data for visualization can help identify and predict characteristics and patterns related to various infectious diseases. When data are presented in a visual format, patterns within the data become clear, making it easier to comprehend and provide deeper insights. This study aimed to comprehensively discuss data visualization and the various techniques used in the process. It also sought to enable researchers to directly use Python programs for data visualization. By providing practical visualization exercises on GitHub, this study aimed to facilitate their application in research endeavors.

Artikel Ilmiah Terkait

Research on Python Data Visualization Technology

Yunhan Zeng Shangru Yang Shengjia Cao + 1 lainnya

1 Januari 2021

In recent years, researchers at home and abroad have accumulated a lot of experience in the research of data visualization technology, and they have played animportant role in scientific discovery, medical diagnosis, business decision-making, and engineering applications. As a library developed using Python language, Matplotlib has a concise language, high drawing accuracy, and simple and easy-to-understand code. This article first introduces data visualization and related technologies used and then uses Python’s Matplotlib library and pyecharts library to realize data visualization. Through representative examples, combined with the use of correct charts, visual processing of data in different fields, so as to further analyze the effect of visualization.

Big Data Visualization and Visual Analytics of COVID-19 Data

Yan Wen C. Leung Calvin S. H. Hoi + 3 lainnya

1 September 2020

In the current era of big data, a huge amount of data has been generated and collected from a wide variety of rich data sources. Embedded in these big data are useful information and valuable knowledge. An example is healthcare and epidemiological data such as data related to patients who suffered from epidemic diseases like the coronavirus disease 2019 (COVID-19). Knowledge discovered from these epidemiological data helps researchers, epidemiologists and policy makers to get a better understanding of the disease, which may inspire them to come up ways to detect, control and combat the disease. As “a picture is worth a thousand words”, having methods to visualize and visually analyze these big data makes it easily to comprehend the data and the discovered knowledge. In this paper, we present a big data visualization and visual analytics tool for visualizing and analyzing COVID-19 epidemiological data. The tool helps users to get a better understanding of information about the confirmed cases of COVID-19. Although this tool is designed for visualization and visual analytics of epidemiological data, it is applicable to visualization and visual analytics of big data from many other real-life applications and services.

Seaborn: Statistical Data Visualization

Michael L. Waskom

2021

seaborn is a library for making statistical graphics in Python. It provides a high-level interface to matplotlib and integrates closely with pandas data structures. Functions in the seaborn library expose a declarative, dataset-oriented API that makes it easy to translate questions about data into graphics that can answer them. When given a dataset and a specification of the plot to make, seaborn automatically maps the data values to visual attributes such as color, size, or style, internally computes statistical transformations, and decorates the plot with informative axis labels and a legend. Many seaborn functions can generate figures with multiple panels that elicit comparisons between conditional subsets of data or across different pairings of variables in a dataset. seaborn is designed to be useful throughout the lifecycle of a scientific project. By producing complete graphics from a single function call with minimal arguments, seaborn facilitates rapid prototyping and exploratory data analysis. And by offering extensive options for customization, along with exposing the underlying matplotlib objects, it can be used to create polished, publication-quality figures.

Need for Interactive Data Visualization in Public Health Practice: Examples from India

MSiva Durga Prasad Nayak KA Narayan

1 Januari 2021

The world is full of data which is increasing by leaps and bounds. In health care, big data is becoming common with increased electronic health data accumulation and/or accessibility to public data previously held under lock and key. At the same time, health data visualization applications have become popular over recent years. Against this background, a review was done to summarize the application of data visualization in public health & the challenges faced. Peer-reviewed original research articles and review articles searched in Google Scholar and Pubmed databases that were indexed in the last ten years period, using the keywords “Big data” or “data visualization” or “Interactive visualization techniques.” Other related information in books, blogs, and published documents were searched in Google search engine using the same keywords. Contents from the downloaded documents were presented and discussed under three headings viz. (a) the visualizations that are still current and how they have evolved further, (b) tools or methods that can be used by end-users to make their own modifications, (c) the platforms to disseminate them. Usage of different plots in public health is explained with suitable examples using the data from public health datasets. From the discussion it can be understood that when big data is visualized well, it can identify implementation gaps and disparities and accelerate implementation strategies to reach the population groups in most need for interventions. As health administrator may come from diverse specialties, robust training and career development for big data in public health is the need of the hour.

Cheat Sheets for Data Visualization Techniques

Zezhong Wang Lovisa Sundin Dave Murray-Rust + 1 lainnya

18 Januari 2020

This paper introduces the concept of 'cheat sheets' for data visualization techniques, a set of concise graphical explanations and textual annotations inspired by infographics, data comics, and cheat sheets in other domains. Cheat sheets aim to address the increasing need for accessible material that supports a wide audience in understanding data visualization techniques, their use, their fallacies and so forth. We have carried out an iterative design process with practitioners, teachers and students of data science and visualization, resulting six types of cheat sheet (anatomy, construction, visual patterns, pitfalls, false-friends and well-known relatives) for six types of visualization, and formats for presentation. We assess these with a qualitative user study using 11 participants that demonstrates the readability and usefulness of our cheat sheets.

Daftar Referensi

0 referensi

Tidak ada referensi ditemukan.

Artikel yang Mensitasi

0 sitasi

Tidak ada artikel yang mensitasi.