Local (Medical) Academic Word List
Description
This dataset contains all of the files and data that we accessed and used to create out initial local (medical) academic word list (L-AWL). We cannot include the actual corpora in this dataset because many files (e.g., lecture PPT slides) were copyrighted by instructors and contain identifying information. The dataset is broken up into steps, and each step has a folder with data, supporting files, and a walkthrough video. Please email us if something is missing or if any information contained in this dataset is not clear.
Files
Steps to reproduce
1. Open up AntWordProfiler. 2. Clear the level lists in the "Level List(s)" pane. 3. Add subcorpora to "User File(s)" pane. 4. Add level lists (NGSL 1-3 and NAWL) to "Level List(s)" pane. 5. In "Output Settings," select "Statistics," "Word Groups (Families)," and "Include words in user file(s) but not in level list(s)." 6. Click "Start." 7. After the analysis, in "Results," scroll down until you find "Groups NOT Found in Base Lists." 8. Copy-Paste the "Groups NOT Found in Base Lists" data into an Excel spreadsheet. 9. Save the Excel spreadsheet as a .CSV file.