Brazilian Research Output in Education from 2007 to 2016

Published: 06-07-2020| Version 1 | DOI: 10.17632/f3j5dptdrv.1
Eliseo Reategui,
Alause Pires


This dataset has been used to analyze publication practices of Brazilian researchers in Education in three different periods: [2007, 2009], [2010, 2012], [2013, 2016]. The dataset is composed of 4249 records, each of them containing information about a scientific journal in which faculty and students of Brazilian graduate programs in Education have published their work. In addition, each record also stores information about the number of articles published by the Brazilian Education community in each journal and its citation impact in a given period. The indicator chosen to represent the journal's citation impact has been Elsevier's SNIP. The records also store information from a Brazilian journal ranking system called Qualis, a classification made by the Coordination for the Improvement of Higher Education Personnel (CAPES), a governmental institution whose purpose is to coordinate efforts for the improvement of the quality of faculty and staff in higher education. The dataset comprises information about the publications by Brazilian researchers in Education including articles published in journals that were and were not indexed by SCOPUS. The research questions that were answered with the analysis of the data were: 1) What was the distribution of publications in indexed and non-indexed journals in the three periods [2007, 2009], [2010, 2012], [2013, 2016]? 2) What was the distribution of publications according to country of publication of the journals? 3) What was the citation impact of the publications in indexed journals in the three different periods? 4) Were there discrepancies between the classification of journals in the CAPES Qualis system and the citation impact provided by SCOPUS' SNIP? It is important to highlight that the three periods considered in the dataset do not have the same number of years. This is because CAPES previously collected information for its evaluation of graduate programs every three years ([2007, 2009], [2010, 2012]), but it changed to four years in the last period ([2013, 2016]). The final structure of the records of the dataset is detailed below. - Publication period: year interval in which the article publications were made: 2007-2009; 2010-2012; 2013-2016 - ISSN_PRINT: unique identifier of the printed version of the journal in which the article publications were made - ISSN_ONLINE: unique identifier of the online version of the journal in which the article publications were made - Title: journal title - Number of articles: total number of articles published in the journal in the given period - SNIP: Elsevier SNIP citation impact computed for the journal in the given period - Qualis: classification rank of the journal at the given period, with values ranging from (highest to lowest): A1, A2, B1, B2, B3, B4, B5 - Country: country of publication of the journal


Steps to reproduce

The dataset has been composed through the gathering of data in a few steps: 1) CAPES provided an initial spreadsheet with the following data (the details of each of the fields has been given in the above description of the dataset): - Publication period - ISSN - Title - Number of articles - Qualis - Country Although CAPES provided us with this data on demand, the data collected by the institution is public and can be accessed from its website. At the time of the publication of this dataset, the data about the publications of Brazilian researchers in all fields of knowledge could be found in the following link: 2) The ISSN of the journals appeared in one single column, which meant that ONLINE and PRINTED versions of the same journal could appear in different entries, when they were in fact representing information of the same journal. Thus, we started by deduplicating the dataset to eliminate journals that had two entries for the same publication period. All the information from the journals was kept the same, apart from the Number of articles published and values that were added. We identified the origin of the ISSN (PRINTED or ONLINE) and substituted the ISSN field by two new fields: ISSN_PRINTED and ISSN_ONLINE. 3) As the C classification in the Qualis ranking system implies that the publication is not a scientific journal, all the records classified with C were eliminated from the dataset 4) SNIP data was downloaded from Elsevier's website and matched with the ISSN information we had from the CAPES dataset. For the first period [2007, 2009] we used the 2010 SNIP values; for the second period [2010, 2012] we used the 2013 SNIP values; and for the third period [2013, 2016], we used the 2016 SNIP values. The name of the file downloaded from Elsevier's website was SNIP_IPP_SJR_complete_1999_2016 retrieved from: 5) In the original CAPES dataset, the Country field was not provided for some of the journals. In these cases, we manually searched for the country information of these journals in the ISSN portal (