Sociodemographic data on live births children, Brazil, 2018-2022

Published: 2 July 2024| Version 3 | DOI: 10.17632/z3ychcthm2.3
Contributors:
,
,
,
,
,
,

Description

The dataset is an open data from the Sistema de Informação de Nascidos Vivos (SINASC), which is a system implemented by the Brazilian federal government in the 1990s, with the purpose of collecting data on all live births in the national territory. The system makes it possible to provide information on birth rates for all levels of the Brazilian health system, as well as the development of relevant indicators in the strategic planning of management to support the planning of actions, activities, public policies and programs aimed at health. The dataset is related to three years (2018, 2019, 2020, 2021 and 2022) of SINASC referring only to the state of Pernambuco, and it is composed of routine prenatal data, gestational history, sociodemographic data and data of newborns. born, including their weight.

Files

Steps to reproduce

Data were extracted only from the state of Pernambuco, from 2018 to 2022, resulting in a dataset with 526,368 records and 61 attributes from the SINAN (Sistema de Informação de Agravos de Notificação). To prepare the dataset, the records where the target attribute 'CLASSE' resulted in Term or Preterm and empty values. Additionally, all attributes that contained more than 70% empty data, attributes related to postpartum, duplicated attributes, attributes that represented geographic environment codes, and date-type attributes were discarded. The target attribute that was in grams was also modified to become binary.

Institutions

Universidade Federal de Pernambuco, Secretaria de Saude do Estado de Pernambuco, Universidade de Pernambuco

Categories

Brazil, Premature Birth, Prematurity, Database

Licence