PyVulDet-NER

Published: 19 September 2023| Version 1 | DOI: 10.17632/h22kxj6ydt.1
Contributor:
Melanie Ehrenberg

Description

The data in this repository is associated with a manuscript paper, entitled "Python Source Code Vulnerability Detection with Named Entity Recognition". The paper has been submitted to the "DevSecOps: Advances for Secure Software Development" special issue in the "Computers & Security" journal. This research is part of an in-progress dissertation for George Washington University. In addition to the data shown in this repository, the following NER models were created with this data to identify 6 vulnerability types in Python source code: https://huggingface.co/mmeberg/RoRo_PyVulDet_NER https://huggingface.co/mmeberg/RoCo_PyVulDet_NER https://huggingface.co/mmeberg/DiDi_PyVulDet_NER https://huggingface.co/mmeberg/CoRo_PyVulDet_NER https://huggingface.co/mmeberg/CoCo_PyVulDet_NER

Files

Steps to reproduce

Details on how we created these datasets are in our paper entitled “Python Source Code Vulnerability Detection with Named Entity Recognition”.

Institutions

George Washington University

Categories

Natural Language Processing, Vulnerability Detection, Bidirectional Encoder Representations From Transformers

Licence