English Handwritten Pages Dataset
Description
This dataset contains handwritten document images collected from volunteer undergraduate students of the Department of Computer Science and Engineering (CSE), Varendra University, Bangladesh. The dataset was created to support research in writer identification, writer verification, handwriting analysis, document authentication, and pattern recognition. Each participant provided handwritten pages that were digitized and organized under unique participant identifiers. The dataset includes multiple handwritten samples per writer, enabling the development and evaluation of machine learning and deep learning models for handwritten writer identification systems. All personally identifiable information was removed or replaced with anonymized identifiers before dataset publication. Each writer is assigned a unique dataset ID, and the images are stored using a standardized naming convention. The dataset may be used for research and educational purposes in the fields of computer vision, document analysis, pattern recognition, biometrics, forensic handwriting analysis, and artificial intelligence. Potential applications of this dataset include writer identification, writer verification, handwriting-based biometric authentication, forensic document examination, handwritten document analysis, and the development of machine learning and deep learning methods for handwriting recognition and classification.
Files
Steps to reproduce
1. Recruit volunteer participants and obtain informed consent for the use of handwritten samples in research. 2. Provide participants with blank sheets and instructions to write English text naturally in their own handwriting. 3. Collect multiple handwritten pages from each participant. 4. Digitize the handwritten pages using a scanner or high-resolution camera. 5. Review the collected images and remove low-quality or unreadable samples. 6. Assign an anonymized identifier to each participant to protect personal information. 7. Organize images into writer-specific folders and rename files using a standardized naming convention. 8. Use the resulting dataset for writer identification, writer verification, handwriting analysis, and related machine learning research.
Institutions
- Varendra UniversityRajshahi Division, Rajshahi