Buddhist datasets
Published: 2 January 2024| Version 1 | DOI: 10.17632/5hzs8w46jh.1
Contributor:
Tao HeDescription
This dataset is released as the public data source of the manuscript entitled "A Novel Masking Model for Buddhist Literature Understanding by Using Generative Adversarial Networks". The subfiles contains that: The Buddhist pretraining source documents: --pretrainDataset.zip The Buddhist functional word dictionary: --function_words.txt The Buddhist terminology dictionary: --buddhist_words.txt The Zen Text Segmentation (ZTS) dataset: --ZTSdataset.zip --Train set: segDataTrain.npy --Validation set: segDataVal.npy --Test set: segDataTest.npy The Zen Sentiment Classification (ZSC) dataset: --ZSCdataset.zip --Train set: ZSCTrain.npy --Validation set: ZSCVal.npy --Test set: ZSCTest.npy
Files
Categories
Natural Language Processing, Chinese Language, Chinese Literature