Pygotham 2017

Published: 6 Oct 2017 | Version 1 | DOI: 10.17632/8kyckg3dh5.1
Contributor(s):

Description of this data

This dataset contains 4 files:

  1. A .csv containing 29,105 sentences from CC-BY papers that contain citations ("pygothamCleanDataset.csv").
  2. A community edition databricks notebook to process and explore the data as .dbc
  3. A community edition databricks notebook to view in HTML.
  4. Pygotham slides in PDF format.

Experiment data files

Steps to reproduce

Make sure to update all paths!

Please see this link for an archived copy of the notebook with all output:
https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/2644196477475309/2247597868200546/3108286398802724/latest.html

Related links

Latest version

  • Version 1

    2017-10-06

    Published: 2017-10-06

    DOI: 10.17632/8kyckg3dh5.1

    Cite this dataset

    Cox, Jessica; Harper, Corey (2017), “Pygotham 2017”, Mendeley Data, v1 http://dx.doi.org/10.17632/8kyckg3dh5.1

Statistics

Views: 3291
Downloads: 621

Categories

Data Analysis

Licence

CC BY 4.0 Learn more

The files associated with this dataset are licensed under a Creative Commons Attribution 4.0 International licence.

What does this mean?

This dataset is licensed under a Creative Commons Attribution 4.0 International licence. What does this mean? You can share, copy and modify this dataset so long as you give appropriate credit, provide a link to the CC BY license, and indicate if changes were made, but you may not do so in a way that suggests the rights holder has endorsed you or your use of the dataset. Note that further permission may be required for any content within the dataset that is identified as belonging to a third party.

Report