Water quality data from the Cesar River, Colombia
The water quality data of the Cesar River for this study are constituted by historical information recorded between 2004 and 2020. The first group of data comes from official sources (stations 1-6) and corresponds to physicochemical parameters measured by the Institute of Hydrology, Meteorology, and Environmental Studies (IDEAM). The second group (PM 7-30) corresponds to grey literature from documents of water resource management plans, watershed management, public services, and scientific research that describe the water quality conditions of the river (Nuñez & Reyes, 2012; Gutierrez et al., 2014; Madera et al., 2016). The structure of the database was designed under a hierarchical metadata system with a relational schema (RDBMS) which is normalized to the 3NF form, that is, in addition to being of the 2NF form, there is no transitive dependence between the attributes that are not key. The database obeys nine relational tables describing the type, primary attribute, and derived or secondary information. Raw data are included with information on physicochemical parameters, location, municipalities, surface current, region and hydro region, years reported and the source of the information, organized hierarchically. The metadata was created on the PostgreSQL platform and visualized in the QGIS environment. Also, for the queries of prop data, an open-source code was developed that encodes the meta-of the parameters or stations of interest.
Steps to reproduce
Cesar_river.tar: Comprises a backup of the database developed in the PostgreSQL open-source object-oriented relational database management system. ADG_1.txt;ADG_2.txt;ADG_3.txt;Data.txt;DocID.txt;HDG.txt;RepID.txt;RepSHP.txt;RepTY.txt;SDG.txt: Raw data from the tables relationships comprising the database in PostgreSQL. Database_Diagram.png: Database diagram on water quality of the Cesar River. The primary attributes are the unique elements per table, in this case, they are represented by the black key; secondary information is identified by green keys; and the rest of the information is derived