MMARAU Institutional Repository

Evaluating the Performance of Tree-Based Predictive Models as Programme Recommenders for University Entrants in Kenya.

Show simple item record

dc.contributor.author Kibuthi J. Kabiru, Ratemo C. Makiya , Anduvare, E.M
dc.date.accessioned 2025-01-16T08:19:20Z
dc.date.available 2025-01-16T08:19:20Z
dc.date.issued 2024-10
dc.identifier.uri http://hdl.handle.net/123456789/17500
dc.description.abstract Enrolling for the wrong programme by university students has, to an extent, contributed to the high rates of discontinuation on academic grounds, repeat year cases, change of programme after registration, interuniversity transfers, deferments to change programme, drop out cases, suspension over exam irregularities as well as to strikes. This study focused on finding a technological solution for reducing these cases by evaluating three tree-based predictive models and recommending the most predictive model to implement as a programme recommender. Data was collected in five selected public universities in Kenya using Google Forms. The respondents were 308 translating to 308 rows of data with 36 columns. Numpy, Pandas, Matplotlib, Sklearn, Seaborn, Scipy, Plotly python analytics libraries were deployed using Jupyter Notebook for Anaconda. The cleaned and processed dataset features had categorical variables thus one-hot-encoding technique was employed. Data was split for training and testing with the random_state set to 42. Gini index criteria was implemented. The three models were evaluated on their performance from the optimally split data for training and test with a 80:20 ratio. Random Forest (RF) came out the most predictive at 99.3% followed by Gradient Boosting (XG Boost) at 90% then Decision Tree (DT) at 80.93%. The testing accuracy score for RF was 81.72%, XGBoost was at 75.72% and DT was at 76.34%. Confusion matrix criterion was implemented to evaluate the performance of the three models. The results of this study have demonstrated the high accuracy level of RF as the most predictive tree-based model for this real-world University crisis. The model is recommended for development as a system to be integrated into the KUCCPS portal. The integrated system is dubbed Programme Recommender which if launched would highly predict the best programme of study for application by university entrants. en_US
dc.language.iso en en_US
dc.subject Random Forest, Gradient Boosting, Python, KUCCPS, Programme Recommender. en_US
dc.title Evaluating the Performance of Tree-Based Predictive Models as Programme Recommenders for University Entrants in Kenya. en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account