Assessing vocabulary used in course wikipages in terms of Common European Framework of Reference for Languages (CEFR) level
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
As a former professor, the following article inspired me to look at the vocabulary used in some Canvas course rooms in terms of the Common European Framework of Reference for Languages (CEFR) levels:
Teachers’ receptive and productive vocabulary sizes in English-medium instruction (tandfonline.com)
A program to extract the vocabulary used in a course room and a program to prune the list of "words" and add information about CEFR levels. Note that this is primarily directed at courses in American English (with some words in Swedish - this support is very limited at present). The vocabulary is based on a third-year course in Internetwork, a 4th-year course in Voice over IP, a 4th-year course in research methodologies and scientific writing, a course in accelerated computing, and a course in data science. [The last two courses are based on material provided by Nvidia's Deep Learning Institute under a CC-BY license that has been converted into Canvas course rooms]. The courses have accompanying videos and these have been captioned and wikipages created for each PowerPoint slide - with the corresponding transcript from the video added to the wikipage. As a result, the input material includes a very large portion of the course content. It does not include content in files, quizzes, etc.
Hopefully, with some automated feedback to the teachers, the accessibility of the course material can be increased. It might even be useful for students to know the distribution of CEFR levels for the vocabulary in a course they are considering taking.
The first program is compute_unique_words_for_pages_in_course.py it takes a course_id as the only argument. The second program is prune_unique_words.py and it also takes a course_id as the only argument.
The code can be found at https://github.com/gqmaguirejr/Canvas-tools
In addition to outputting a number of files, the prune program gives a summary in the form shown below (for a course that has 187,072 words in it):