An Augmented Dataset of Unicorn Companies and their Graph Visualizations
Description
Firstly, the repository includes a tabular dataset of unicorn companies around the world, which is originally obtained from a Kaggle repository with CC0 license (https://www.kaggle.com/datasets/deepcontractor/unicorn-companies-dataset). The source tabular dataset was then methodologically enriched with additional data for missing cells, and represented in a format that can be read into yEd graph analytics software (http://www.yworks.com/products/yed). The steps are described in the paper and the presentation. Secondly, the repository includes multiple yEd-readable *.graphml files, that were constructed using various layout algorithms. Thirdly, the repository includes a zip file that contains the Full Supplement to the paper. This zip file contains the dataset, graphml files, the presentation of the research, zoomable complete graphs as pdf files, a graphml drawing of the applied graph analysis methodology, and a 30+ page Supplement document. The 30+ page Supplement document (within the Full Supplement zip file) contains 50+ references, steps of the methodology (including the steps to produce the data), settings and parameters for the graph layout algorithms, and additional analyses that present a spectrum of different types of insights that can be derived from the data.
Files
Steps to reproduce
Given the original source data, the steps of data preparation, augmentation, graph representation, and analysis with graph visualizations is described in the research paper, as well as the presentation within the Full Supplement.