COnVIDa - COVID19 data monitoring in Spain

Introduction

COnVIDa is a tool developed by the Cybersecurity and Data Science Laboratory at the University of Murcia (Spain) that allows easily gathering data related to the COVID19 pandemic form different data sources, in the context of Spain, and visualize them in a graph. Contact us at convida@listas.um.es.

COnVIDa - Overview

How to use COnVIDa

In order to use this tool, first we have to select the dates range for which we want to collect data.

Date selection

Then we will select the Autonomous Communities and/or Provinces of Spain that interest us.

Region selection

PFinally, we simply select the data items within each fuente de datos (COVID19, INE, Movilidad, MoMo y AEMET) that we want to know and automatically all selected data will be displayed in the main temporal and regional graphs, as well as their respective summary tables. It is important to note that when you move the mouse over each data item, a description of the data will be displayed.

Data sources selection

COnVIDa offers two types of data visualisation: temporal and regional. In the temporal display (make sure that the panel is activated) the daily values will be shown in the graph for those temporal data for which information is available (the statistical data of the INE do not make sense here). For example, if we select COVID19 cases, smoking rates, mobility in parks, observed deaths, and insolation; in Murcia, Madrid, Cuenca, Granada and Spain as a whole; from 21/02/2020 to 21/01/2021; the X-axis will be divided into the days between these two dates, while the Y-axis will show the types of data selected for these geographic locations. As the data may have different scales, it is possible that some variables may make other variables insignificant in the overlay, but the graph can be interactively explored in detail in the upper right hand corner.

Line graph with linear scale

Additionally, it is also possible to change the type of graph, choosing between line graph or bar graph.

Bar chart with linear scale

On the other hand, the regional display is subdivided into two panels. On the left, the data are grouped by selected regions and aggregated into boxplots (taking into account the data series for the selected time range). Once the data is plotted, it is possible to easily change the scale of the graph, either linear or logarithmic. The logarithmic scale is useful for simultaneously displaying data series with different orders of magnitude. On the right, a national map is displayed showing the selected regions whose statistical data can be directly compared. Only one type of geographical granularity (the whole country, autonomous communities, or provinces), one measure (the mean, maximum, minimum, or principal percentiles), and one variable can be represented on the map at a time.

Region map

Finally, each summary table shows, as the name suggests, a statistical summary of each of the selected data items, including: a count of the data, the arithmetic mean of the data, the standard deviation, the minimum, the 25th percentile, the median, the 75th percentile and the maximum value of the series.

Summary table

As can be seen, two buttons are offered to download either all the data collected according to the criteria specified by the user or the summary table. COnVIDa offers the possibility to download either of these two data tables in CSV, XLS, JSON and HTML formats.

Descargar datos de COnVIDa

Data sources

Current version of COnVIDa includes 5 data sources related to the COVID19 pandemic in Spain. These data sources are:

As stated previously, when passing the mouse over each data item, a description of such item will be automatically displayed.

Source code

COnVIDa has been developed from its very conception as an Open Science project with the aim and spirit of serving and assisting anyone who might need it in the context of the COVID19 pandemic in Spain. In this regard, all the project source code is publicly accessible through the next repository, where a developer manual is also included:

https://github.com/CyberDataLab/COnVIDa

Limitations

COnVIDa was born from the Cybersecurity and Data Science Laboratory of the University of Murcia (CyberDataLab) as a disinterested response to the critical situation generated by the pandemic. Thus, in spite of the involvement and technical capabilities invested, the project has limitations such as the dependence on external sources to collect data (which may fail or have invalid values), small bugs in the web page, or certain impurities in the visualisation of the data.

References

Enrique Tomás Martínez Beltrán, Mario Quiles Pérez, Javier Pastor-Galindo, Pantaleone Nespoli, Félix Jesús García Clemente y Félix Gómez Mármol. COnVIDa: COVID-19 multidisciplinary data collection and dashboard Journal of Biomedical Informatics. March, 2021.


I will be updating the blog post as improvements are made to the tool. Thank you for your time and attention. Feel free to contact me with any questions or suggestions.

Enrique Tomás :man_technologist: