Informace o publikaci

Data Set Size Analysis for Detecting the Urgency of Discussion Forum Posts

Autoři

ŠVÁBENSKÝ Valdemar BOUCHET François TARRAZONA Francine LOPEZ II Michael BAKER Ryan S.

Rok publikování 2024
Druh Konferenční abstrakty
Citace
Přiložené soubory
Popis In both Massive Open Online Courses (MOOCs) and private courses, instructors face a large amount of queries in discussion forum posts that may merit a response. There has been ongoing research on how to employ machine learning to predict a post’s urgency in order to focus instructors’ attention. However, it is unclear how large a course is needed to develop these models. We took a publicly available data set of 3,503 labeled forum posts and code from one such prior study. We re-trained the six models described in the study, but with progressively smaller sample sizes, to determine if the models’ performance would be preserved. Likewise, we demonstrate that using random subsets even as small as 10% of the original data set achieves comparable performance to full data sets in five out of six models.

Používáte starou verzi internetového prohlížeče. Doporučujeme aktualizovat Váš prohlížeč na nejnovější verzi.

Další info