DeepLMS: a deep learning predictive model for supporting online learning in the Covid-19 era

Dias, Sofia B.; Hadjileontiadou, Sofia J.; Diniz, José; Hadjileontiadis, Leontios J.

doi:10.1038/s41598-020-76740-9

Download PDF

Article
Open access
Published: 16 November 2020

DeepLMS: a deep learning predictive model for supporting online learning in the Covid-19 era

Sofia B. Dias¹^na1,
Sofia J. Hadjileontiadou²^na1,
José Diniz¹ &
…
Leontios J. Hadjileontiadis^3,4,5

Scientific Reports volume 10, Article number: 19888 (2020) Cite this article

12k Accesses
43 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Coronavirus (Covid-19) pandemic has imposed a complete shut-down of face-to-face teaching to universities and schools, forcing a crash course for online learning plans and technology for students and faculty. In the midst of this unprecedented crisis, video conferencing platforms (e.g., Zoom, WebEx, MS Teams) and learning management systems (LMSs), like Moodle, Blackboard and Google Classroom, are being adopted and heavily used as online learning environments (OLEs). However, as such media solely provide the platform for e-interaction, effective methods that can be used to predict the learner’s behavior in the OLEs, which should be available as supportive tools to educators and metacognitive triggers to learners. Here we show, for the first time, that Deep Learning techniques can be used to handle LMS users’ interaction data and form a novel predictive model, namely DeepLMS, that can forecast the quality of interaction (QoI) with LMS. Using Long Short-Term Memory (LSTM) networks, DeepLMS results in average testing Root Mean Square Error (RMSE) $<0.009$, and average correlation coefficient between ground truth and predicted QoI values $r\ge 0.97$ $(p<0.05)$, when tested on QoI data from one database pre- and two ones during-Covid-19 pandemic. DeepLMS personalized QoI forecasting scaffolds user’s online learning engagement and provides educators with an evaluation path, additionally to the content-related assessment, enriching the overall view on the learners’ motivation and participation in the learning process.

Driving STEM learning effectiveness: dropout prediction and intervention in MOOCs based on one novel behavioral data analysis approach

Article Open access 18 March 2024

Xiaona Xia & Wanxue Qi

Interpretable early warning recommendations in interactive learning environments: a deep-neural network approach based on learning behavior knowledge graph

Article Open access 25 May 2023

Xiaona Xia & Wanxue Qi

Embedding cognitive framework with self-attention for interpretable knowledge tracing

Article Open access 20 October 2022

Yanjun Pu, Wenjun Wu, … Pu Feng

Introduction

New designs of educational processes that include online learning have been flourishing in the last decades; some characteristic examples^1,2,3 include affective (a-), blended (b-), collaborative (c-), mobile (m-), game (g-), transformative (t-), Cloud (Cl-), and ubiquitous (u-) learning, among others. Online learning improves access to education and training, aiming at reducing temporal and spatial problems that can be met in the traditional form of education^4,5. In parallel, online learning has become one of the fastest growing industries, with a market growth rate over 900% since 2000, which is expected to reach in 2025 an impressive total market value of $325 billion⁶. Furthermore, as to the production and provision of online learning courses, the latter, when compared against the conventional Face-to-Face (F2F) ones, have an average consumption of 90% less energy and 85% fewer CO2 emissions produced per student⁷.

Online learning, though, asks for the combination of different delivery methodologies to contribute towards the optimization not only of the learning development, but also of deployment costs and time⁸. In this context, a key-factor that adds value to the quality of the learning experience is the quality of interaction (QoI) within an online learning environment (OLE). Apparently, effective integration of technology is needed⁹ to support QoI within OLEs. Hence, efficient blending of strategic decisions, adequate available resources, and quick thinking in implementation are necessary for the development of efficient online learning. Nowadays, this becomes more visible, when the worldwide emergency of the pandemic Coronavirus disease (Covid-19) impacts approximately 600 million learners across the Globe (https://en.unesco.org/covid19/educationresponse, accessed 19/10/2020), rigorously shifting traditional F2F teaching/learning to online one¹⁰.

Learning Management Systems (LMSs) frame a digital learning environment where the user’s learning behavior and it’s evaluation need to be efficiently amended¹¹. LMSs (e.g., Moodle, https://moodle.org/) are actually embedded within OLE, which usually offer quick access, huge data management and a variety of Web-based tools^12,13. As Herrington et al.¹⁴ state, the degree of interactions that take place within an educational context of reference is an essential predictor of its success. The QoI of the learner, within the LMS, is a strong efficacy indicator of the design and its ability to sustain online learning communities^15,16,17,18. In particular, designed procedures within the learning environment can activate and sustain interactions towards learning. Then, upon these interactions¹⁹, knowledge can be extracted concerning the student’s preferred learning patterns while interacting with leaning resources, and/or while collaborating in groups. In this respect, empirical findings suggest that the commitment to the course workflow²⁰, the connection time²¹ and the total number of accesses to the system²² are very important. Moreover, LMS-based online records can also be used to map individual- and population-level social jet lag, showing the potentiality of the LMS to provide behavioral information in terms of learning and attention deficits²³, or emergence of novel relationships between social structure and performance²⁴. From an educational data mining and learning analytics perspective, LMS was used to provide user track data within the online learning context, which were used as additional sources of information in: i) early detection of at-risk students on distance learning modules^25,26, ii) findings as to learning dispositions^27,28, iii) learning success and performance prediction^{29,30,31,32,33,34,35,36,37,38,39}, and iv) learner behavior and goal attainment in Massive Open Online Courses (MOOCs) prediction⁴⁰. Nonetheless, no evaluation of the QoI was performed, beyond merely descriptive statistics of the users’ interactions and their relation with the users’ performance, which were the main focus of the analysis.

The aforementioned place the need on the analysis of user’s LMS-based interactions related with their quality, so the latter could be used to explain the true nature of the users’ behavior when interacting within a LMS. So far, relevant research focused on QoI tends to examine LMS data statistics, including learner-teacher discussions and exchanges in online forums, to investigate the dimension, depth and category of interactions occurred⁴¹. A more extended and quantitative approach in QoI analysis was introduced by Dias and Diniz⁴². Their model, namely FuzzyQoI, considers the users’ (professors’ and students’) interactions, based on LMS use, and, by translating the knowledge of the experts in the field to fuzzy constructs, quantitatively estimates, a normalized index of the users’ QoI. As a result, the latter, can be used to identify users’ LMS interaction trends and provide personalised feedback to users. Another approach to evaluate the human interaction processes on a LMS-based online learning course was proposed by Dzandu and Tang⁴³. They used a semiotic framework as guide to identify syntactic, semantic, pragmatic and social context gaps or problems, focusing on only the human information interaction issues. Nevertheless, their approach was based on simple questionnaires, missing out the dynamic characteristics of LMS interactions. In an further effort, Dias et al.⁴⁴ suggested the use of a Fuzzy Cognitive Map (FCM) as a means to efficiently model the way LMS users interact with it, by estimating their QoI within a b-learning context. Their FCM-QoI model was used to analyse the QoI influential concepts’ contribution to self-sustained cycles (static analysis) and corresponding alterations, when the use of the LMS time period is considered (dynamic analysis), demonstrating potential to increase the flexibility and adaptivity of the QoI modeling and feedback approaches. In the work of Cerezo et al.⁴⁵, identical students’ LMS Moodle logs behaviours were grouped concerning effort, time spent working, and procrastination, in order to investigate the students’ asynchronous learning processes, matching their behaviours with different achievement levels. Although this approach tries to shed light upon the role of the LMS interaction in the students’ achievements, it lacks generalisation power and evaluates the LMS-based QoI mainly from the grading of the students’ achievements and not from the actual interaction quality per se.

The current work explores, for the first time, the predictive power that can be drawn from the analysis of the LMS-based QoI using Deep Learning. The proposed enhancement of LMS, namely DeepLMS, fills the gap in predictive use of LMS-based QoI to early inform effective feedback providers, i.e., educators, policy makers, relevant stakeholders, so to apply any corrective measure to increase the efficiency of the educational processes. In addition, DeepLMS acts as a metacognitive triggering tool to the learners, as it provides them with a prediction of their LMS-based QoI, so to reflect on their current QoI and proceed with any necessary personal corrective actions. By adopting a Long Short-Term Memory (LSTM) artificial Recurrent Neural Network (RNN) architecture⁴⁶, a LSTM-based predictor was employed to form the QoI prediction model of DeepLMS, trained and tested on experimental LMS-based QoI data drawn from three databases, i.e., DB1⁴², DB2, and DB3, that come from both the pre- (DB1) and during- (DB2, DB3) Covid-19 pandemic periods, and refer to different countries, sociocultural and educational settings. The derived experimental results across DB1-DB3 show efficient predictive performance of the DeepLMS to accurately predict the daily QoI values, despite any temporal and/or educational setting differences. An illustration of the DeepLMS-based QoI prediction process is depicted in Fig. 1.

Results

As the three examined databases come from different countries (Portugal, United Arab Emirates, Greece), and refer to different time periods, i.e., pre- (DB1) and during-Covid-19 pandemic (DB2, DB3), and systemic settings, i.e., macro: Higher-Educational Institution (HEI)’s level (DB1), meso: course level (DB2), and micro: focused discipline level (DB3), the performance of the proposed DeepLMS approach is separately presented per database.

DB1-related performance

Figure 2 depicts the predictive performance of the DeepLMS upon some excerpts of the QoI time series derived from the DB1 75 Professors (P#2, P#33, P#35, P#60, P#65, and P#70). In particular, the left column of Fig. 2 shows the QoI data used for training (from day 1 until day 323 where the vertical solid line lies) and for testing (day 324 until the day 358), whereas the right column zooms into the testing QoI data (blue solid line) and the DeepLMS predicted QoI (red dashed line). Moreover, in the right column of Fig. 2, the estimated correlation coefficient r between the testing and the estimated QoI data (see “Methods” section) for each case is also superimposed. These cases of QoI were selected to showcase the predictive performance of DeepLMS on QoI time series that have various patterns across the whole duration of the two academic semesters (358 days). In particular, for the case of P#2, an almost periodic pattern is noticeable, where the alteration between $QoI=0.11$ and $QoI=0.5$ is visible. Nevertheless, this is not evident in the rest of the cases, where there are sparse alterations of QoI with various frequency alterations between $QoI=0.11$ and $QoI=0.5$ values. As the LSTM network is capable of forgetting data that are not useful for its predictive performance, the predictive results shown in the right column of Fig. 2 and the corresponding r values, justify the efficient predictive performance of the proposed DeepLMS approach. One step further, in the right column of Fig. 2 for the case of P#55, out of all of the changes in slope across the depicted subfigures, this is the only one that contains a proactive prediction value, i.e., not just reacting to the QoI change (blue solid line) with a slight lag due to the use of the recurrent LSTM model.

Similarly to Fig. 2, Fig. 3 depicts the predictive performance of the DeepLMS upon some excerpts of the QoI time series derived from the DB1 1037 Students (S#55, S#60, S#155, S#310, S#612, and S#775). The same configuration as in Fig. 2 is also followed in Fig. 3, where again various cases of the QoI time series distribution across the two academic semesters are shown. The solid line at day 323 separates the data used for the training from the ones used for testing. Moreover, the estimated correlation coefficient r per case is also depicted. From the results presented in Fig. 3, the same level of high predictive performance of DeepLMS seen in Fig. 2 is sustained for the case of Students.

Figure 4 illustrates the distribution of the DeepLMS performance indices (see “Methods” section) across the whole set of QoI data per user type (DB1 Professors (75): Fig. 4a–c; DB1 Students (1037): Fig. 4d–f. In particular, the distribution of the Root Mean Square Error (RMSE) between the testing and the estimated QoI data is depicted in Fig. 4a,d for the case of DB1 Professors and Students, respectively. Moreover, the distribution of the correlation coefficient r between the testing and the estimated QoI data is depicted in Fig. 4b,e for the case of DB1 Professors and Students, respectively. Furthermore, the distribution of the correlation coefficient $r_d$ between the derivative of the testing and the derivative of the estimated QoI data is depicted in Fig. 4c,f for the case of DB1 Professors and Students, respectively. The median and the 95% Confidence Interval (CI) of the estimated RMSE, r and $r_d$ are tabulated in Table 1. From both Fig. 4a,d and Table 1, it is clear that the testing RMSE lies in quite satisfactory levels across the two users’ groups of DB1, showing an efficient predictive performance by the DeepLMS. This is further justified by the very strong correlation identified both between the amplitude of the testing and predicted QoI values (Fig. 4b,e, Table 1) and the trend of the testing and predicted QoI values (Fig. 4c,f, Table 1). The number of outliers (red crosses in Fig. 4 lying outside > 1.5 times the interquartile range, i.e., the box-plot whiskers) does not really affect the overall predictive performance of the DeepLMS, as expressed by the corresponding high median values and low 95% CIs (Table 1). Moreover, the difference in the number of outliers between Fig. 4a–f comes from the distinct difference in the number of users per group (DB1 75 Professors vs. 1037 Students).

Table 1 DeepLMS and baseline FCM-QoI⁴⁴ predictive performance indices across the whole set of QoI data per database and users’ group.

Full size table

DB2-related performance

Similarly to the results presented in the previous subsection for DB1, Fig. 5 depicts the predictive performance of the DeepLMS upon the QoI time series derived from the DB2 3 Professors (Fig. 5-top panel: P#1, P#2, and P#3) and from some excerpts from the DB2 180 Students (Fig. 5-bottom panel: S#25, S#39, S#58, S#158, S#171, and S#172). In both panels, the left column of Fig. 5 shows the QoI data used for training (before the vertical solid line) and for testing (after the vertical solid line), whereas the right column zooms into the testing QoI data (blue solid line) and the DeepLMS predicted QoI (red dashed line). Analogously to Figs. 2 and 3, the estimated correlation coefficient r between the testing and the estimated QoI data for each case is also superimposed on the right column of Fig. 5. As it can be seen from Fig. 5, the QoI time series involved in the testing part tends to converge to a constant value ($QoI=0.11$), whereas in the training QoI data there are notable alterations of QoI between $QoI=0.11$ and $QoI=0.5$ values. This difference in QoI values can potentially be attributed to the fact that the last days of the course were devoted to the demo presentation of students’ projects; hence, the focus was mostly placed on hands-on activities rather than LMS interactions. From the results depicted in Fig. 5, it is evident that the DeepLMS captures such change in the QoI values, exhibiting efficient performance in predicting the underlying trend.

Consonantly to Figs. 4 and 6 illustrates the distribution of the DeepLMS performance indices, i.e., RMSE, r, and $r_d$, across the whole set of QoI data per user type (DB2 Professors (3): Fig. 6a–c; DB2 Students (180): Fig. 6d–f). The corresponding values of the median and 95% CI of the estimated RMSE, r, and $r_d$ for the DeepLMS performance when using DB2 QoI data are tabulated in Table 1. From both Fig. 6 and Table 1, it is clear that DeepLMS sustains its efficient predictive performance reported in the case of DB1 also in the case of DB2, exhibiting quite satisfactory performance metrics across the two users’ groups of DB2. Apparently, the differences between DB1 and DB2 performance metrics are due to the different number of users per database (see Table 2); however, they are both quite acceptable.

Table 2 Characteristics of the databases used.

Full size table

DB3-related performance

Figure 7 depicts the predictive performance of the DeepLMS upon the QoI time series derived from the DB3 1 Professor (Fig. 7-top panel: P#1) and from some excerpts from the DB3 52 Students (Fig. 7-bottom panel: S#13, S#18, S#23, S#27, S#29, and S#40), presented at the same format as in Fig. 5. As it can be seen from Fig. 7, the QoI time series involved in the testing part, unlike the ones depicted in Fig. 5, exhibit alterations between $QoI=0.11$ and $QoI=0.5$ values similar to the ones observed in the training QoI data, resembling also the patterns followed in Figs. 2 and 3. This can potentially be explained by the difference in the way the focused discipline related to DB3 was evaluated during the exams period, as it involves more online research-based activities, rather than the hands-on ones seen in DB2. From the results depicted in Fig. 7, it is clear that the DeepLMS takes into account the QoI characteristics from the training period and successfully predicts the various QoI patterns seen in the testing period.

The distribution of the DeepLMS performance indices, i.e., RMSE, r, and $r_d$, across the whole set of QoI data for the case of DB3 Students (52) is illustrated in Figs. 8a–c, respectively. The distribution for the case of DB3 Professors was omitted, as there is only one Professor involved within the DB3. The corresponding values of the median and 95% CI of the estimated RMSE, r, and $r_d$ for the DeepLMS performance when using DB3 QoI data are tabulated in Table 1. The results presented both in Fig. 8 and Table 1, confirm efficient predictive performance of the DeepLMS when using QoI data from DB3, similarly to the cases of DB1 and DB2. Apparently, the different number of users per database (see Table 2) contributes to the differences seen in the DeepLMS performance metrics across DB1, DB2 and DB3; yet, in all cases, the DeepLMS predictive performance can be considered quite satisfactorily.

Discussion

In the unprecedented era of Covid-19, an alteration in the landscape for online education is clearly manifested by the hundreds of thousands of educators and learners setting out to academic cyberspace and OLEs. This is a paradigmatic change, a ‘black swan’ moment⁴⁷, as the unforeseen event of Covid-19 pandemic ushers the educational practices in video conferencing platforms (e.g., Zoom, WebEx, MS Teams) and LMS-based uses. Surely, there is a high variability in the way educators act online (often for the first time) for offering remote instruction to their students outside the physical classroom. The abrupt ending of in-person classes leading to online settings can speed up the adoption of OLEs as learning mediators. Nevertheless, this instructional change, in such a compressed time frame, has the risk to solely create an insipid copy of today’s best online learning practices. Possibly, this is due, in part, to the lack of investment in online education modality by many educational institutions and/or to underestimation of online learning as a core aspect of their learner experience⁴⁸. However, led by top universities, a noticeable change began a few years ago, as fully digital academic experiences started flourishing⁴⁹. The current crisis due to Covid-19 is accelerating this trend, revealing the need for Higher Education Institutions (HEIs) to promote faculty’s digital skills. The latter can be facilitated by the construction of a technological backbone, to mitigate the effects of this crisis and to welcome the online teaching/learning within a digital era. These digital competences can amalgamate the short-term response to crisis into an enduring digital transformation of education contexts.

In this disrupted educational landscape, the issues of how instructors and colleges treat student evaluation and how institutions treat student evaluations of professors have surfaced. Educators face a challenge/opportunity in trying to evaluate quality, even as the educational activity they are evaluating is mutating, in real time. DeepLMS comes into foreground as a means to offer quantitative metrics of the user’s LMS-based QoI, as an alternative to conventional evaluation metrics. The efficient predictive performance of the DeepLMS, as justified by the experimental results derived from three databases, involving pre- and during-Covid-19 pandemic data, establishes a reliable basis to construct a motivational, personalized feedback to the LMS users, so to readjust their interaction with the LMS, as an effort to increase the related QoI. The latter refers to the efficient engagement of the user with the online part of the learning process (nowadays almost the sole one), and provides educators with an evaluation path, in parallel to the content-related assessment, that could enrich their overall view about learner motivation and participation in the learning process. Moreover, based on the estimated dQoI(k) and its segmentation setting (see “Methods” section), personalized feedback can be provided to users that helps them get the most out of their interaction with the LMS and the related online material, and can have a significant impact on overall learning performance outcomes⁵⁰. Many forms of representation of the segmented dQoI(k) value can be employed (e.g., text, graphs, audiovisual); in all of them, however, a personalization in the way the feedback is communicated to users should be incorporated. For example, when $dQoI(k)\in [-1,-0.8]$, a text message of ‘Serious alert! Your QoI is expected to significantly fall!’, can be used as an intense warning; alternatively, it can be more constructive in the form of ‘From now on, please try to be more focused and more active in the online part of your course’. The former textual feedback is a descriptive interpretation of the dQoI(k) value per se, whereas the latter is a proactive interpretation that motivates learners to act constructively. This need for personalised interpretation stems from the fact that learners, usually, chose different paths to respond to learning challenges. For example, the ones with a positive orientation view feedback (either positive or negative) as information to be assimilated and accommodated. However, the ones with a negative orientation, perceive negative feedback as a ‘crushing blow’ and reflection of their poor ability⁵¹; most of such learners easily give up⁵⁰. Hence, the adaptation of the feedback path could better support the ultimate aim in feedback provision, which allows learners to take over the function of assessing themselves and others⁵².

Within the aforementioned context, yet from a more integrated perspective, the proposed DeepLMS approach can be augmented to become an ideal mechanism/feedback to support various stakeholder groups in the domain of education (including department heads, teachers, administrators, technical support staff, and learners). This can be achieved by aggregating the individual predictive user outputs. This process could lead to effective technology-enabled learning. Amongst its attributes, it should include a focus on enduring learning outcomes. This endurance is reinforced by the DeepLMS through its focusing on the QoI prediction, whose dynamic feedback to LMS users, gradually etches in them the optimized LMS interaction as an enduring learning outcome.

From the results presented in Figs. 2, 3, 4, 5, 6, 7 and 8 and Table 1, the proposed DeepLMS seems independent of the group type, as it shows a similar predictive performance both in Professors’ and Students’ QoI prediction (Wilcoxon rank sum test for DB1: $p=0.070$). In addition, cross-country/scale/time-period statistical analysis has resulted in non-significantly statistical differences of the performance of DeepLMS for different sociocultural and temporal settings (Wilcoxon rank sum test for $RMSE(DB1,DB2): p=0.207; RMSE(DB1,DB3): p=0.219; RMSE(DB2,DB3)$: p = 0.387). The same holds for the factors of sex and age, as linear regression tests did not show any influence of both on the prediction of QoI for Professors ($\{sex, age\}_{P-DB1}: \{p=0.363, p=0.113\}$) and Students ($\{sex, age\}_{S-DB1}: \{p=0.415, p=0.167\}; \{sex, age\}_{S-DB2}: \{p=0.465, p=0.673\}; \{sex, age\}_{S-DB3}: \{p=0.508, p=0.693\}$). Note that the statistics related to Professors were estimated for DB1 only, as the number of Professors in DB2 (3) and DB3 (1) is limited. Moreover, DeepLMS seems insensitive to the sparsity of the interaction, as it efficiently models the user’s LMS-based various interaction patterns, as expressed in the QoI time-series morphology across time (Figs. 2, 3, 5, 7). These patterns are governed by various academic calendar activities, e.g., lectures, mid-term exams, final exams, winter/spring/summer breaks, and/or external ones, especially for DB2 and DB3, as the lockdown due to Covid-19 pandemic (DB2: 26/3-24/4/2020; DB3: 11/3-4/5/2020) lies within their time duration (see Table 2). In spite of these, the DeepLMS acknowledges such data evolution, resulting in adequate predictive performance due to the ability of its embedded LSTM forecasting model to outperform classical time series methods in cases with long, interdependent and sparse time series⁵³. Clearly, the aforementioned results show increased generalization power in the performance of DeepLMS. Extending the demographics analysis in the bias domain, as Table 2 shows, the distribution of Male/Female was quite balanced, both in Professors and Students, along with their age, without any heterogeneity that would potentially produce data bias in LSTM learning. Hence, no historical bias (as no socio-technical issues were involved), no representation bias (sufficient number of users were involved and the significant spread of QoI data comes from users across three countries, with five courses with 30-40 different disciplines each course (macro level: DB1), one course (meso level: DB2) and one discipline (micro level: DB3)), no measurement bias (data were captured from users that all had equal access to the LMS Moodle pages after logged in), no evaluation bias (the evaluation was performed on an equal basis and with the same objective measures of (RMSE, r) as in the baseline algorithm (FCM-QoI⁴⁴)), no population bias (user population represented in the datasets is coming from a real-life setting (University) end expresses the original target population), no Simpson’s Paradox (the data were homogeneous and there were no subgroups in Professors’ and Students’ groups), no sampling bias (uniform sampling across all groups), no user-interaction bias (the LMS Moodle metrics involved in the production of the QoI are 112 (see Table S1) and provide a high variety to the user to interact with the LMS Moodle), and no self-selection bias (the data were analyzed after the users interacted with the LMS and they were totally unaware of the research; hence, they could not influence the results by selective self-participation). Consequently, there is no unfairness arising from biases in the data. Taking the bias issue further, another source of unfairness could potentially arise from the learning algorithm involved itself. To avoid such event, some techniques⁵⁴ could be explored to try to transform the data (pre-processing), so that the underlying discrimination is removed, or incorporating changes into the objective function or imposing a constraint (in-processing), or accessing a holdout set, which was not involved during the training of the model, and reassign the initially assigned labels by the model based on a function (post-processing). In the DeepLMS case, although no data bias was identified, in a broader perspective, flexibly fair representation in DeepLMS learning could be introduced in its future edition by creating a layer that disentangles the information that relate with sensitive attributes (e.g., demographics) and create a targeted learning for such sensitive latent variables, which potentially can bias the model, and incorporate such knowledge in a debias process (e.g., as in^55,56) at the higher QoI prediction layer.

When performing a relevant comparative analysis between the DeepLMS performance and the most related FCM-QoI model⁴⁴, that it is also based on the same QoI data drawn from the FuzzyQoI model⁴², it seems that the proposed DeepLMS achieves higher overall performance, when compared to the testing output of FCM-QoI. In particular, based on the predictive performance of both the DeepLMS and the FCM-QoI⁴⁴ tabulated in Table 1, it is apparent that the DeepLMS exhibits lower testing RMSE and higher r values in its predictive output, when compared with the ones from the FCM-QoI model⁴⁴. From a structural comparison, DeepLMS overcomes the training limitation of the FCM-QoI, i.e., its training is based on the mean values of QoI across users provided by the FuzzyQoI model; this, however, merges the specific characteristics of each user to an average behavior⁴⁴. On the contrary, the DeepLMS provides personalised predictions for the QoI of each user across the academic semesters.

From a more general perspective, DeepLMS aligns with the previous efforts that incorporate LSTM-based predictions in the context of online education, yet not at the exact same specific problem settings as in DeepLMS. Hence, the latter is well-positioned with the approaches related to: i) cross-domains analysis, e.g., MOOCs impact in different contexts⁵⁷, as DeepLMS could be easily adapted to a micro analysis of the QoI per discipline/course and transfer learning from one discipline to another at the same course (or courses with comparable content), as shown here with the application of DeepLMS to DB1-DB3, in a similar manner that was applied in MOOCs from different domains⁵⁷; ii) combination of learning patterns in the context domain with the temporal nature of the clickstream data⁵⁸, and identification of students at risk⁵⁹, as DeepLMS could be combined with an auto-encoder to capture both the underlying behavioral patterns and the temporal nature of the interaction data at various levels of the predicted QoI (e.g., low (<0.5) QoI (at risk level)); iii) predicting learning gains by incorporating skills discovery^60,61, as DeepLMS could provide the predicted QoI as an additional source of the user profile to his/her skills and learning gains; iv) user learning states and learning activities prediction from wearable devices⁶², as DeepLMS could easily be embedded in the expanded space of affective (a-) learning, and inform a more extended predictive model that would incorporate the learning state with the estimated QoI; v) increasing the communication of the instructional staff to learners based on individual predictions of their engagement during MOOCs^63,64, as DeepLMS could facilitate the coordination of the instructor with the learner based on the informed predicted QoI; and vi) predicting the learning paths/performance⁶⁵ and the teaching paths⁶⁶, as the DeepLMS could be extended in the context of affecting the learning/teaching path by the predicted QoI.

Despite the promising results of the proposed DeepLMS towards prediction of the user’s LMS-based QoI, certain limitations exist. In particular, no correlation analysis with the content evaluation outcome from, e.g., quizzes, mid-/final exams, was undertaken. In fact, this was left for a future endeavor, as the focus here was to explore the predictive performance of the DeepLMS in LMS-based QoI prediction, fostering the role of the latter as an additional, to conventional grading, assessment field. Moreover, the data used here refer to one (2009/2010) or half (2020) academic year; thus, exploration of the DeepLMS application and further validation of its predictive performance upon follow-up data, i.e., monitoring of the same users across sequential academic years, would shed light upon the consistency in the predictive performance of the DeepLMS across longer time periods.

The efficient performance of the DeepLMS was validated on real data, incorporating adequate number of users and LMS data logs from different countries and educational settings. Since the structure and training of the proposed DeepLMS are not restricted to a specific course content, actually they were tested on human kinetics (DB1), engineering design (DB2), and advanced signal processing (DB3) educational contents, it could easily be expanded to the analysis of LMS data coming from various fields, e.g., Social Sciences, Medical and/or Engineering Education⁶⁷. This would allow for the exploration of any dis/similarities and correlations in the LMS users’ QoI, from an institutional perspective. Moreover, as the Covid-19 pandemic shifted the use of LMS Moodle to Secondary Education Institutions (SEIs), as well, the DeepLMS could be used for comparing the LMS-based QoI across younger student groups and explore the age-related trends in LMS-based interaction.

As part of our future work on DeepLMS, we aim to perform a fusion of other measures of user’s quality in the online learning context at both SEIs and HEIs. This includes prediction of the Quality of Collaboration (QoC)⁶⁸ and Quality of Affective Engagement (QoAE)^1,69, in an effort to predict, in a holistic way, the various components that play significant role in the learning process, i.e., interaction, collaboration and affectiveness¹. The incorporation of Deep Learning-based predictions of QoC and QoAE, in parallel to the QoI ones, extends the work of the authors^70,71,72 from the concept of affective/blended/collaborative-teaching/learning (a/b/c-TEACH, http://abcteach.fmh.ulisboa.pt/) to the a/b/c/d(eep)-TEACH one. In the midst of the Covid-19 pandemic, such an AI-based scaffolding helps educators and learners move from quick fixes, and their possible consequence of regressing to poor practice, to maximum efficiency of the online learning tools available and truly support learning. Finally, as distinct time periods of pre-, during- and post-Covid-19 lockdown have been formed, the analysis of LMS data that emerged during these three periods seems promising, in particular for the identification of any effect on the QoI per se and its related prediction via the proposed DeepLMS model. This analysis will allow for further evaluation of the DeepLMS model predictive robustness against effects caused by time-related disruptors, such as Covid-19, in the context of education; ongoing efforts towards such direction are reported in⁷³.

Methods

The proposed DeepLMS approach explores the predictive power of Deep Learning in estimating the user’s LMS-based QoI within an online learning context, from his/her historical QoI data. This efficient QoI prediction feeds the feedback path (see Fig. 1), in an effort to provide metacognitive stimulus to learners and timely inform the educators as to their possible lack of motivation and course focus and/or adoption of unstructured online course interaction, alerting for preventive and corrective interventions. The performance of the DeepLMS was evaluated on QoI data estimated from $>647.000$ LMS Moodle interactions, as described below.

Dataset

The LMS Moodle data used in DeepLMS were drawn from three databases, i.e., DB1, DB2 and DB3. The users’ characteristics and their contribution in LMS interactions per database, along with the related HEI, country, time period, Covid-19 association, and scale, are tabulated in Table 2. In particular, DB1 refers to the data included in the work of Dias and Diniz⁴², with 610,775 in total users’ LMS interactions, across two academic semesters (358 days) of the 2009/2010 academic year. All users started to use LMS Moodle in that academic year. These contributions were provided by the users (75 Professors and 1,037 Students) within five b-learning-based undergraduate courses, i.e., Sport Sciences, Ergonomics, Dance, Sport Management and Psychomotor Rehabilitation, offered by a public HEI (Faculdade de Motricidade Humana, Portugal). DB2 includes overall 9,646 users’ LMS online learning interactions drawn from Khalifa University of Science and Technology (KUST), Abu Dhabi, UAE, during the Spring semester of 2020 (76 days). These contributions were provided by the users (3 Professors and 180 Students) during the course of Engineering Design. The latter is a freshman course on the basic principles of engineering design, applied on solving real-life problems via projects. DB3 includes overall 27,056 users’ LMS online learning interactions drawn from a discipline in the area of Advanced Signal Processing at the Department of Electrical and Computer Engineering (ECE), Aristotle University of Thessaloniki (AUTH), Greece, taught at the 4th year of ECE studies, during the Spring semester and Summer/Fall exam periods of 2020 (181 days). The LMS contributions come from one Professor and 52 Students; the discipline is focused in techniques and algorithms of advanced signal processing, as a means to propose efficient solutions in research problems. The set of the available 110 LMS Moodle metrics ($M_1-M_{110}$ in Fig. 1; see Supplementary Table S1) logged by the users were corresponded to 14 categories ($C_1-C_{14}$ in Fig. 1; see Supplementary Table S1) that formed the inputs to the FuzzyQoI model⁴². The latter outputted the QoI estimations per user across the whole time-span of the analysis, which was kept the same across all databases, i.e., 358 days as in DB1, by using linear interpolation in the cases of DB2 and DB3; yet, displaying the initial length (DB1: 76 days; DB3: 180 days) in all resulted illustrations (Figs. 2, 3, 5, 7). These QoI daily estimations were used as ground-truth inputs to an LSTM-based predictor (Fig. 1) for training and testing (see relevant subsections below). More details of the QoI estimation from the FuzzyQoI model can be found in the work of Dias and Diniz⁴².

It should be noted that all data used here were de-identified (any information that would allow individual’s identity was stripped out). DB1 data come from the two authors’ (S.D and J.D) previous work⁴², where they had ethics clearance for research purpose use; hence, no ethical approval is needed for their reuse here. The use of DB2 data was approved by the KUST Ethics Committee (Protocol #: H20-021, 17.6.2020), whereas access to DB3 for research purpose use was granted by the AUTH eLearning Administrator to the last author (L.H), who was the responsible Professor of the related discipline.

DeepLMS predictive performance evaluation

The predictive performance of the DeepLMS was separately evaluated for the two user types, i.e., Professors and Students (Table 2), analyzing their testing data in terms of: (a) the RMSE between the QoI values from the FuzzyQoI model⁴², i.e., $QoI^{FuzzyQoI}$, and the ones predicted by the DeepLMS, i.e., $QoI^{DeepLMS}$, (b) the correlation coefficient r between the $QoI^{FuzzyQoI}$ and $QoI^{DeepLMS}$, in order to evaluate the correctness in the estimation of the $QoI^{FuzzyQoI}$ values, and (c) the correlation coefficient $r_d$ between the derivative of the $QoI^{FuzzyQoI}$ and the derivative of the $QoI^{DeepLMS}$, in order to evaluate the correctness in the estimation of the $QoI^{FuzzyQoI}$ dynamics trend (increase/decrease). In both r and $r_d$ estimations, the value of $p\le 0.05$ was used for adopting them as statistically significant. Finally, the distributions of (a)-(c) across the whole set per user type were estimated (displayed as boxplots), in order to evaluate the overall predictive performance of the proposed DeepLMS approach.

User’s feedback path triggering

For the triggering of the user’s feedback path (Fig. 1), the difference, at instance k, between the $QoI^{FuzzyQoI}(k)$ and ${\hat{QoI}}^{DeepLMS}(k+1)$ is estimated, i.e., $dQoI(k)={\hat{QoI}}^{DeepLMS}(k+1)-QoI^{FuzzyQoI}(k)$, considering the use of an already trained LSTM network. As all estimated QoI values are normalized within [0,1], the estimated dQoI(k) ranges between $[-1,1]$. Positive dQoI(k) values can be used for a rewarding user’s feedback, whereas negative dQoI(k) values can be used for a warning one. Segmentation of the dQoI(k) range [−1,1] to different subsets, e.g., $[-1,-0.8), [-0.8,-0.5), [-0.5,-0.3), [-0.3,-0.1) [-0.1,0.1), [0.1,0.3), [0.3,0.5), [0.5,0.8)$ and [0.8, 1], could allow for flexibility in the granularity of the feedback construction.

Long short-term memory networks

An LSTM network is a subclass of RNNs⁴⁶, trying to circumvent RNNs’ inability to learn to recognise long-term dependencies in the data sequences. Hochreiter and Schmidhuber⁷⁴ addressed the latter by presenting the LSTM unit, whereas LSTM networks are constructed by combining several layers of LSTM units. Figure 9 shows the structure of an LSTM unit, and its sequence across time. Each LSTM unit consists of three gates that operate on the input vector, $x_t$, to generate the cell state, $C_t$, and the hidden state, $h_t$. From a physical interpretation, the cell state can be viewed as the memory of the cell, while the gates control the flow of information in and out of the memory. In addition, the input gate determines the incorporation of new information, the forget gate determines which information should be discarded, and the output gate controls the information that passes along to the next layer. Following the interconnections presented in Fig. 9, the following formulas per category of the variables hold:

1.
Gating variables:
$$\begin{aligned} f_t&=\sigma (W_fx_t+U_fh_{t-1}+b_f) \end{aligned}$$
(1)
$$\begin{aligned} i_t&=\sigma (W_ix_t+U_ih_{t-1}+b_i) \end{aligned}$$
(2)
$$\begin{aligned} o_t&=\sigma (W_ox_t+U_oh_{t-1}+b_o) \end{aligned}$$
(3)
2.
Candidate (memory) cell state variable:
$$\begin{aligned} \tilde{C_t}=\tanh {(W_cx_t+U_ch_{t-1}+b_c)} \end{aligned}$$
(4)
3.
Cell and hidden state variables:
$$\begin{aligned} C_t&=f_t\quad \circ \quad C_{t-1}+i_t\quad \circ \quad \tilde{C_t} \end{aligned}$$
(5)
$$\begin{aligned} h_t&=o_t\circ \tanh {(C_t)} \end{aligned}$$
(6)

where {W, U} and b are the learnable weights and bias of the LSTM layer, respectively, for the input and the recurrent connections for the input/output/forget gates and cell state; $\circ $ is the element-wise product of two vectors; $\sigma $ is a sigmoid function given by $\sigma (x)=(1+e^{-x})^{-1}$ to compute the gate activation function, whereas the hyperbolic tangent function (tanh) is used to compute the state activation function.

Implementation issues

The final network was implemented in Matlab 2020a (The Mathworks, Natick, USA), and trained using the Adaptive Moment Estimation (Adam) optimizer⁷⁵. The final selection of the hyperparameters of the network was based on the results from early test runs with different settings; the one which provided most promising predictive performance was finally chosen. In particular, the final network consisted of four layers, i.e., the sequence input layer, the LSTM layer with 1200 hidden units, the fully connected layer and the regression output layer, and was trained for 300 epochs. With this selection, the estimated training RMSE was converging to values less than 0.001 across the 300 iterations (Fig. 10). To prevent the gradients from exploding, the gradient threshold was set to 1. The initial learn rate was set to 0.005 and the learn rate was dropped after 150 epochs by multiplying the initial rate by a factor of 0.2. The size of the mini-batch used for each training iteration to evaluate the gradient of the loss function and update the weights was set equal to 128.

A common issue that should be considered during training any kind of neural network is overfitting, due to the highly flexible nature of the network. In order to reduce the negative effects of overfitting, apart from the dropout process described above, regularisation techniques can also be applied to reduce the generalization error. In this vein, the $L^2$ norm regularization was also adopted here⁷⁶. This technique, also known as Tikhonov regularization and ridge regression in statistics, is a specific way of regularizing a cost function with the addition of a complexity-representing term. In the case of $L^2$ regularization in neural networks, the term is simply the squared Euclidean norm of the weight matrix of the hidden layer of the network. $L^2$ regularization usually results in much smaller weights across the entire model. An additional parameter, $\lambda $, is added to allow control of the strength of the regularization; here the value of $\lambda =0.0005$ was used.

Training and testing

The model was trained and tested on a High Performance Computing infrastructure at KUST, Abu Dhabi, UAE (equipped with 84 Nodes, 168 Processors, 2016 Cores, 21.5 TB Mem, 23+ TB NFS), using 24 Ivy Bridge processing nodes (2x Intel Xeon E5-2697 v2, 12Core 2.7GHz, 256 GB Memory/300 GB Local storage), running in parallel. Training was realized using the first 90% of the QoI sequence per user, whereas testing was applied on its last 10%. At each time step of the input sequence, the LSTM network learnt to predict the value of the next time step (see Fig.1).

Data availability

All data generated and analysed during this work are available from https://github.com/LeontiosH/DeepLMS/tree/DeepLMS-data.

Code availability

All codes used in this work are available from https://github.com/LeontiosH/DeepLMS/tree/Matlab-code.

References

Picard, R. W. et al. Affective learning-a manifesto. BT Technol. J. 22, 253–269 (2004).
Article Google Scholar
Ponce, O. A., Gómez, J. & Pagán, N. Current scientific research in the humanities and social sciences: central issues in educational research. Eur. J. Sci. Theol. 15, 81–95 (2019).
Google Scholar
Alexander, B. et al. EDUCAUSE Horizon Report 2019 Higher Education Edition. Tech. Rep., EDU19 (2019).
Anderson, T. The Theory and Practice of Online Learning (Athabasca University Press, Edmonton, 2008).
Google Scholar
Panigrahi, R., Srivastava, P. R. & Sharma, D. Online learning: adoption, continuance, and learning outcome—a review of literature. Int. J. Inf. Manag. 43, 1–14 (2018).
Article Google Scholar
Meskhi, B., Ponomareva, S. & Ugnich, E. E-learning in higher inclusive education: needs, opportunities and limitations. Int. J. Educ. Manag. 33, 424–437 (2019).
Article Google Scholar
Roy, R., Potter, S. & Yarrow, K. Towards sustainable higher education: environmental impacts of conventional campus, print-based and electronic/open learning systems. In Distance Education and Technology: Issues and Practice (eds Murphy, D. et al.) 129–145 (Open University of Hong Kong Press, Kowloon, 2004).
Google Scholar
Oliver, M. & Trigwell, K. Can ‘blended learning’ be redeemed?. E-learning Digit. Media 2, 17–26 (2005).
Google Scholar
Garrison, D. R. & Kanuka, H. Blended learning: uncovering its transformative potential in higher education. Internet Higher Educ. 7, 95–105 (2004).
Article Google Scholar
Sun, L., Tang, Y. & Zuo, W. Coronavirus pushes education online. Nat. Mater. 19, 687 (2020).
Article ADS CAS PubMed Google Scholar
Hijón-Neira, R. & Velázquez-Iturbide, Á. From the discovery of students access patterns in e-learning including Web 2.0 resources to the prediction and enhancements of students outcome. In E-learning, Experiences and Future, Chap. 14 (ed. Soomro, S.) 275–294 (IntechOpen, London, 2010).
Google Scholar
Conole, G., De Laat, M., Dillon, T. & Darby, J. ‘Disruptive technologies’, ‘pedagogical innovation’: whats new? Findings from an in-depth study of students’ use and perception of technology. Comput. Educ. 50, 511–524 (2008).
Article Google Scholar
Redecker, C. Review of learning 2.0 practices: Study on the impact of Web 2.0 innovations of education and training in Europe. Tech. Rep., European Commission EUR 23664 EN – Joint Research Centre – Institute for Prospective Technological Studies (2009).
Herrington, J., Reeves, T. C. & Oliver, R. Immersive learning technologies: realism and online authentic learning. J. Comput. Higher Educ. 19, 80–99 (2007).
Article Google Scholar
Anderson, T., Liam, R., Garrison, D. R. & Archer, W. Assessing teaching presence in a computer conferencing context. J. Asynchronous Learn. Netw. 5, 1–17 (2001).
Google Scholar
Kidd, T. Key aspects affecting students’ perception regarding the instructional quality of online and web based courses. Instr. Technol. 2, 55–61 (2005).
Google Scholar
Lim, C. & Lee, S. Pedagogical usability checklist for ESL/EFL e-learning websites. J. Converg. Inf. Technol. 2, 67–76 (2007).
Google Scholar
Grant, M. R. & Thornton, H. R. Best practices in undergraduate adult-centered online learning: mechanisms for course design and delivery. J. Online Learn. Teach. 3, 346–356 (2007).
Google Scholar
Sheard, J. I., Albrecht, D. W. & Butbul, E. ViSION: visualizing student interactions online. In Australasian World Wide Web Conference, 48–58 (Southern Cross University, 2005).
Chen, N.-S. & Lin, K.-M. Factors affecting e-learning for achievement. In IEEE International Conference on Advanced Learning Technologies, Kazan, Russia 200–205 (2002).
Kickul, J. & Kickul, G. New pathways in e-learning: the role of student proactivity and technology utilization. In 45rd Annual Meeting of the Midwest Academy of Management Conference, Indiana, USA (2002).
Ramos, C. & Yudko, E. “Hits” (not “discussion posts”) predict student success in online courses: a double cross-validation study. Comput. Educ. 50, 1174–1182 (2008).
Article Google Scholar
Smarr, B. L. & Schirmer, A. E. 3.4 million real-world learning management system logins reveal the majority of students experience social jet lag correlated with decreased performance. Sci. Rep. 8, 1–9 (2018).
Article CAS Google Scholar
Vaquero, L. M. & Cebrian, M. The rich club phenomenon in the classroom. Sci. Rep. 3, 1–8 (2013).
Article CAS Google Scholar
Wolff, A., Zdrahal, Z., Herrmannova, D., Kuzilek, J. & Hlosta, M. Developing predictive models for early detection of at-risk students on distance learning modules. In Machine Learning and Learning Analytics Workshop at The 4th International Conference on Learning Analytics and Knowledge (LAK14), 24–28 Mar 2014, Indianapolis, Indiana, USA (2014).
Hung, J.-L., Shelton, B. E., Yang, J. & Du, X. Improving predictive modeling for at-risk student identification: a multistage approach. IEEE Trans. Learn. Technol. 12, 148–157 (2019).
Article Google Scholar
Tempelaar, D. T., Rienties, B. & Giesbers, B. In search for the most informative data for feedback generation: learning analytics in a data-rich context. Comput. Hum. Behav. 47, 157–167 (2015).
Article Google Scholar
Tempelaar, D. T., Rienties, B. & Nguyen, Q. Towards actionable learning analytics using dispositions. IEEE Trans. Learn. Technol. 10, 6–16 (2017).
Article Google Scholar
Gašević, D., Dawson, S., Rogers, T. & Gasevic, D. Learning analytics should not promote one size fits all: the effects of instructional conditions in predicting academic success. Internet Higher Educ. 28, 68–84 (2016).
Article Google Scholar
Conijn, R., Snijders, C., Kleingeld, A. & Matzat, U. Predicting student performance from LMS data: a comparison of 17 blended courses using Moodle LMS. IEEE Trans. Learn. Technol. 10, 17–29 (2016).
Article Google Scholar
Pardo, A., Han, F. & Ellis, R. A. Combining university student self-regulated learning indicators and engagement with online learning events to predict academic performance. IEEE Trans. Learn. Technol. 10, 82–92 (2016).
Article Google Scholar
Saqr, M., Fors, U. & Nouri, J. Using social network analysis to understand online Problem-Based Learning and predict performance. PLoS ONE 13, e0203590 (2018).
Article PubMed PubMed Central CAS Google Scholar
Larrabee Sønderlund, A., Hughes, E. & Smith, J. The efficacy of learning analytics interventions in higher education: a systematic review. Br. J. Educ. Technol. 50, 2594–2618 (2019).
Article Google Scholar
Herodotou, C. et al. The scalable implementation of predictive learning analytics at a distance learning university: insights from a longitudinal case study. Internet Higher Educ. 45, 100725 (2020).
Article Google Scholar
Jovanović, J., Dawson, S., Joksimović, S. & Siemens, G. Supporting actionable intelligence: reframing the analysis of observed study strategies. In Proceedings of the Tenth International Conference on Learning Analytics & Knowledge 161–170 (2020).
Abdous, M., Wu, H. & Yen, C.-J. Using data mining for predicting relationships between online question theme and final grade. J. Educ. Technol. Soc. 15, 77–88 (2012).
Google Scholar
Aldowah, H., Al-Samarraie, H. & Fauzy, W. M. Educational data mining and learning analytics for 21st century higher education: a review and synthesis. Telemat. Inform. 37, 13–49 (2019).
Article Google Scholar
Kostopoulos, G., Karlos, S. & Kotsiantis, S. Multiview learning for early prognosis of academic performance: a case study. IEEE Trans. Learn. Technol. 12, 212–224 (2019).
Article Google Scholar
Viberg, O., Hatakka, M., Bälter, O. & Mavroudi, A. The current landscape of learning analytics in higher education. Comput. Hum. Behav. 89, 98–110 (2018).
Article Google Scholar
Kizilcec, R. F., Pérez-Sanagustín, M. & Maldonado, J. J. Self-regulated learning strategies predict learner behavior and goal attainment in massive open online courses. Comput. Educ. 104, 18–33 (2017).
Article Google Scholar
Ping, T. A., Cheng, A. Y. & Manoharan, K. Students’ interaction in the online learning management systems: a comparative study of undergraduate and postgraduate courses. In Proceedings of the AAOU-2010 Annual Conference 1–14 (2010).
Dias, S. B. & Diniz, J. A. FuzzyQoI model: a fuzzy logic-based modelling of users’ quality of interaction with a learning management system under blended learning. Comput. Educ. 69, 38–59 (2013).
Article Google Scholar
Dzandu, M. D. & Tang, Y. Beneath a learning management system-understanding the human information interaction in information systems. Procedia Manuf. 3, 1946–1952 (2015).
Article Google Scholar
Dias, S. B., Hadjileontiadou, S. J., Hadjileontiadis, L. J. & Diniz, J. A. Fuzzy cognitive mapping of lms users’ quality of interaction within higher education blended-learning environment. Expert Syst. Appl. 42, 7399–7423 (2015).
Article Google Scholar
Cerezo, R., Sánchez-Santillán, M., Paule-Ruiz, M. P. & Núñez, J. C. Students’ LMS interaction patterns and their relationship with achievement: a case study in higher education. Comput. Educ. 96, 42–54 (2016).
Article Google Scholar
Bengio, Y., Simard, P. & Frasconi, P. Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Netw. 5, 157–166 (1994).
Article CAS PubMed Google Scholar
Taleb, N. N. The Black Swan: The Impact of the Highly Improbable Vol. 2 (Random House, New York, 2007).
Google Scholar
Fee, K. Delivering E-Learning: A Complete Strategy for Design Application and Assessment (Kogan Page Ltd, London, 2009).
Google Scholar
Kim, H. J., Hong, A. J. & Song, H.-D. The roles of academic engagement and digital readiness in students’ achievements in university e-learning environments. Int. J. Educ. Technol. Higher Educ. 16, 21 (2019).
Article Google Scholar
Yorke, M. Formative assessment in higher education: moves towards theory and the enhancement of pedagogic practice. Higher Educ. 45, 477–501 (2003).
Article Google Scholar
Poyatos-Matas, C. & Allan, C. Providing feedback to online students: a new approach. In Higher Education in A Changing World, Annual International HERDSA Conference 3–7 (2005).
Light, G., Calkins, S. & Cox, R. Learning and Teaching in Higher Education: The Reflective Professional (Sage, Thousand Oaks, 2009).
Google Scholar
Laptev, N., Yosinski, J., Li, L. E. & Smyl, S. Time-series extreme event forecasting with neural networks at uber. Int. Conf. Mach. Learn. 34, 1–5 (2017).
Google Scholar
d’Alessandro, B., O’Neil, C. & LaGatta, T. Conscientious classification: a data scientist’s guide to discrimination-aware classification. Big Data 5, 120–134 (2017).
Article PubMed Google Scholar
Creager, E. et al. Flexibly fair representation learning by disentanglement. In Proceedings of International Conference on Machine Learning 1436–1445 (2019).
Amini, A., Soleimany, A. P., Schwarting, W., Bhatia, S. N. & Rus, D. Uncovering and mitigating algorithmic bias through learned latent structure. In Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society 289–295 (2019).
Wei, X., Lin, H., Yang, L. & Yu, Y. A convolution-LSTM-based deep neural network for cross-domain MOOC forum post classification. Information 8, 92 (2017).
Article Google Scholar
Ding, M., Yang, K., Yeung, D.-Y. & Pong, T.-C. Effective feature learning with unsupervised learning for improving the predictive models in massive open online courses. In Proceedings of the 9th International Conference on Learning Analytics & Knowledge 135–144 (2019).
Aljohani, N. R., Fayoumi, A. & Hassan, S.-U. Predicting at-risk students using clickstream data in the virtual learning environment. Sustainability 11, 7238 (2019).
Article Google Scholar
student models for interventions. Mao, Y. Deep learning vs. Bayesian knowledge tracing. J. Educ. Data Min. 10, 28–54 (2018).
Google Scholar
Doleck, T., Lemay, D. J., Basnet, R. B. & Bazelais, P. Predictive analytics in education: a comparison of deep learning frameworks. Educ. Inf. Technol. 25, 1951–1963 (2020).
Article Google Scholar
Zhou, Z. et al. Applying deep learning and wearable devices for educational data analytics. In 2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI) 871–878 (IEEE, 2019).
Le, C. V., Pardos, Z. A., Meyer, S. D. & Thorp, R. Communication at scale in a mooc using predictive engagement analytics. In International Conference on Artificial Intelligence in Education 239–252 (Springer, 2018).
Xiong, F., Zou, K., Liu, Z. & Wang, H. Predicting learning status in MOOCs using LSTM. In Proceedings of the ACM Turing Celebration Conference-China 1–5 (2019).
Zhou, Y., Huang, C., Hu, Q., Zhu, J. & Tang, Y. Personalized learning full-path recommendation model based on LSTM neural networks. Inf. Sci. 444, 135–152 (2018).
Article Google Scholar
Ahad, M. A., Tripathi, G. & Agarwal, P. Learning analytics for IoE based educational model using deep learning techniques: architecture, challenges and applications. Smart Learn. Environ. 5, 1–16 (2018).
Article Google Scholar
Lawton, D. et al. Online learning based on essential concepts and formative assessment. J. Eng. Educ. 101, 244–287 (2012).
Article Google Scholar
Dias, S. B., Hadjileontiadou, S. J., Diniz, J. A. & Hadjileontiaids, L. J. Towards a hybrid world-the Fuzzy Quality of Collaboration/Interaction (FuzzyQoC/I) hybrid model in the semantic Web 3.0. In International Conference on Computer Supported Education, vol. 2, 187–195 (SCITEPRESS, 2015).
Landowska, A. Affective learning manifesto-10 years later. In European Conference on e-Learning 281 (Academic Conferences International Limited, 2014).
Hadjileontiadou, S. J., Dias, S. B., Diniz, J. A. & Hadjileontiadis, L. J. Fuzzy Logic-Based Modeling in Collaborative and Blended Learning (Information Science Reference, 2015).
Dias, S. B., Diniz, J. A. & Hadjileontiadis, L. J. Towards an Intelligent Learning Management System Under Blended Learning: Trends, Profiles and Modeling Perspectives (Springer, Berlin, 2013).
Google Scholar
Dias, S. B., Hadjileontiadou, S., Diniz, J. A. & Hadjileontiadis, L. Towards an intelligent learning management system: the A/B/C-TEACH approach. In International Conference on Technology and Innovation in Learning, Teaching and Education 397–411 (Springer, 2018).
Hadjileontiadou, S. J., Dias, S. B., Diniz, J. A. & Hadjileontiadis, L. J. FuzzyQoI-based estimation of the Quality of Interaction in online learning amid Covid-19: a Greek case-study. In Proceedings of the 2nd International Conference on Technology and Innovation in Learning, Teaching and Education (TECH-EDU 2020), online (December 2–4, 2020).
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9, 1735–1780 (1997).
Article CAS PubMed Google Scholar
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
Merity, S., Keskar, N. S. & Socher, R. Regularizing and optimizing LSTM language models. arXiv preprint arXiv:1708.02182 (2017).

Download references

Acknowledgements

The authors would like to thank the Assistant Professor Carlos Alberto Rosa Ferreira at the Faculdade de Motricidade Humana, Universidade de Lisboa, Lisbon, Portugal, for his contribution in LMS Moodle data handling and reformatting. Moreover, the authors acknowledge the eLearning Administrators of KUST and AUTH for their assistance in the DB2 and DB3 access, respectively. Finally, the authors would like to acknowledge the help of Dr. Ana Balula in the manuscript proof-reading.

Author information

These authors contributed equally: Sofia B. Dias and Sofia J. Hadjileontiadou.

Authors and Affiliations

CIPER, Faculdade de Motricidade Humana, Universidade de Lisboa, Lisbon, Portugal
Sofia B. Dias & José Diniz
Department of Primary Education, Democritus University of Thrace, Alexandroupolis, Greece
Sofia J. Hadjileontiadou
Department of Electrical Engineering and Computer Science, Khalifa University of Science and Technology, Abu Dhabi, UAE
Leontios J. Hadjileontiadis
Healthcare Engineering Innovation Center, Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, UAE
Leontios J. Hadjileontiadis
Department of Electrical and Computer Engineering, Aristotle University of Thessaloniki, Thessaloníki, Greece
Leontios J. Hadjileontiadis

Authors

Sofia B. Dias
View author publications
You can also search for this author in PubMed Google Scholar
Sofia J. Hadjileontiadou
View author publications
You can also search for this author in PubMed Google Scholar
José Diniz
View author publications
You can also search for this author in PubMed Google Scholar
Leontios J. Hadjileontiadis
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.D, S.H., and L.H. conceived the study protocol; L.H. developed the algorithms and trained the model networks; S.D, S.H., and L.H. analysed the data. All authors discussed the results and contributed to the manuscript.

Corresponding author

Correspondence to Leontios J. Hadjileontiadis.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Table S1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dias, S.B., Hadjileontiadou, S.J., Diniz, J. et al. DeepLMS: a deep learning predictive model for supporting online learning in the Covid-19 era. Sci Rep 10, 19888 (2020). https://doi.org/10.1038/s41598-020-76740-9

Download citation

Received: 02 April 2020
Accepted: 27 October 2020
Published: 16 November 2020
DOI: https://doi.org/10.1038/s41598-020-76740-9

This article is cited by

Anxiety about the pandemic and trust in financial markets
- Roy Cerqueti
- Valerio Ficcadenti
The Annals of Regional Science (2024)
Recent advances in Predictive Learning Analytics: A decade systematic review (2012–2022)
- Nabila Sghir
- Amina Adadi
- Mohammed Lahmer
Education and Information Technologies (2023)
A systematic review on trends in using Moodle for teaching and learning
- Sithara H. P. W. Gamage
- Jennifer R. Ayres
- Monica B. Behrend
International Journal of STEM Education (2022)
Three-dimensional DenseNet self-attention neural network for automatic detection of student’s engagement
- Naval Kishore Mehta
- Shyam Sunder Prasad
- Sanjay Singh
Applied Intelligence (2022)
COVID-19 early detection for imbalanced or low number of data using a regularized cost-sensitive CapsNet
- Malihe Javidi
- Saeid Abbaasi
- Mahdi Jampour
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.