The data in this repository is associated with a manuscript paper, entitled "Python Source Code Vulnerability Detection with Named Entity Recognition". The paper has been submitted to the "DevSecOps: Advances for Secure Software Development" special issue in the "Computers & Security" journal. This research is part of an in-progress dissertation for George Washington University. In addition to the data shown in this repository, the following NER models were created with this data to identify 6 vulnerability types in Python source code: https://huggingface.co/mmeberg/RoRo_PyVulDet_NER https://huggingface.co/mmeberg/RoCo_PyVulDet_NER https://huggingface.co/mmeberg/DiDi_PyVulDet_NER https://huggingface.co/mmeberg/CoRo_PyVulDet_NER https://huggingface.co/mmeberg/CoCo_PyVulDet_NER
Steps to reproduce
Details on how we created these datasets are in our paper entitled “Python Source Code Vulnerability Detection with Named Entity Recognition”.