- 11 Modules
- 58 Module Sections
- 128 Pages
We created two data structures for easy lookup:
- One representing the course information (from course-axis.tsv)
- One containing user events (from tracklog_cleaned.tsv)
We mainly needed the timestamp, user ID, and vertical/element ID.
For the Events tree, we counted how many events happened on each page/section/module. Events include interactions with videos (eg. playing, pausing, skipping) and quiz problems (eg. checking, saving).
For the time spent tree, we found the average time spent on each page/section/module by taking the sum of time deltas between consecutive events and dividing that by the number of users. We used a 30 minute threshold to account for idling users, which was determined by plotting a histogram of time durations.
For the Students Leaving tree, we counted the number of students whose last event occured in each page/section/module.
We didn't include the navigation elements (page_close, seq_goto, seq_next, seq_prev).
Missing data: Most notably, we are missing page load data. While there is navigation event data (eg. moving between pages via the navigation bar), this data is flawed and difficult to parse because it does not contain page IDs. Also, the data does not capture navigation between sections. As a result, the first and last pages in each section would be underrepresented.
Released under the GNU General Public License, version 3.