SM01: Research sample subsets
Research project SM01 (Parallel Semantic Crawler for manufacturing multilingual web...) Research sample sets mentioned in "Evaluation" section of the paper given in spreadsheet and plain text formats + including some extra information.. Origin of the initial research data set: the research sample set was extracted from CRM system of a company doing business in the domain of application
Steps to reproduce
If you want to run your crawler over sample sets used in our research - the files in plain text format are probably the ones you want to download. Spreadsheet documents and zip with internal reports contain some extra information you might want to dive in later.