Data for "Comprehensive mapping of local and diaspora scientists for evaluation, planning and policy purposes: a database and analysis of 63951 Greek scientists"

Published: 13-01-2021| Version 1 | DOI: 10.17632/zbyctscmbn.1
Contributors:
,
Chara Koutsioumpa,
Angeliki Vakka,
Georgios Agoranos,
Chrysanthi Mantsiou,
Maria Kyriaki Drekolia,
Nikos Avramidis,
Despina G. Contopoulos-Ioannidis,
Konstantinos Drosatos,

Description

Research policy and planning for a given country may benefit from reliable data on both its scientific workforce as well as the diaspora of scientists for countries with substantial brain drain. Here we use a systematic approach using Scopus to generate a comprehensive country-level database of all scientists in Greece. Moreover, we expand that database to include also Greek diaspora scientists. The database that we have compiled includes 63951 scientists who have published at least 5 papers indexed in Scopus. Of those, 35116 have an affiliation in Greece. We validate the sensitivity and specificity of the database against different control sets of scientists. We also analyze the scientific disciplines of these scientists according to the Science Metrix classification (174 subfield disciplines) and provide detailed data on each of the 63951 scientists using multiple citation indicators and a composite thereof. These analyses demonstrate differential concentrations in specific subfields for the local versus the diaspora cohorts, as well as an advantage of the diaspora cohort in terms of citation indicators especially among top-impact researchers. The approach that we have taken can be applied to map also the scientific workforce of other countries and nations for evaluation, planning and policy purposes.

Files

Steps to reproduce

The dataset is produced using the methodology as outlined with dataset https://doi.org/10.17632/btchxktzyw. The Scopus snapshot used for this dataset in ICSR Lab is 2020-10-01 (Lab dataset v004.20201013070847). A curated list of author profiles is applied as a filter on the data in order to produce the subset of 63951 profiles.