WordNet-based word similarity reproducible experiments based on HESML V1R1 and ReproZip
Description
This dataset is provided as supplementary material of the paper by Lastra-Díaz, J. J., & García-Serrano, A. (2016). HESML: a scalable ontology-based semantic similarity measures library with a set of reproducible experiments and a replication dataset. Information Systems. This dataset contains a ReproZip reproducible experiment file, called "HESMLv1r1_reproducible_exps.rpz", which allows the experimental surveys on word similarity on WordNet introduced in the three papers below to be reproduced exactly. [1] Lastra-Díaz, J. J., & García-Serrano, A. (2015). A novel family of IC-based similarity measures with a detailed experimental survey on WordNet. Engineering Applications of Artificial Intelligence Journal, 46, 140–153. http://dx.doi.org/10.1016/j.engappai.2015.09.006 [2] Lastra-Díaz, J. J., & García-Serrano, A. (2015). A new family of information content models with an experimental survey on WordNet. Knowledge-Based Systems, 89, 509–526. http://dx.doi.org/10.1016/j.knosys.2015.08.019 [3] Lastra-Díaz, J. J., & García-Serrano, A. (2016). A refinement of the well-founded Information Content models with a very detailed experimental survey on WordNet (No. TR-2016-01). NLP and IR Research Group. ETSI Informática. Universidad Nacional de Educación a Distancia (UNED). http://e-spacio.uned.es/fez/view/bibliuned:DptoLSI-ETSI-Informes-Jlastra-refinement
Files
Steps to reproduce
In order to reproduce the experimetns contained in the HESMLv1r1_reproducible_exps.rpz file, you should follow the detailed instructions in the main companion paper below. Lastra-Díaz, J. J., & García-Serrano, A. (2016). HESML: a scalable ontology-based semantic similarity measures library with a set of reproducible experiments and a replication dataset. Information Systems.