Researcher ORCID Identifier
0000-0001-5081-1758
Graduation Year
2022
Date of Submission
4-2022
Document Type
Open Access Senior Thesis
Degree Name
Bachelor of Arts
Department
Computer Science
Second Department
Mathematical Sciences
Reader 1
Mark Huber
Terms of Use & License Information
Rights Information
© 2022 Emanuel Jarquin
Abstract
In the modern era, sports betting is becoming increasingly popular. This is especially true in the realm of soccer (or ‘football’ as it is known outside the United States). As a result, the concept of attempting to predict the outcomes of soccer matches using machine learning has garnered much attention in recent years. In this thesis, I utilize well-known machine learning techniques to predict the outcomes of El Clásico matchups and compare the predictive performance of these techniques. The predictive methods employed for this thesis are random forests using the party package in R and extreme gradient boosting using the xgboost package. The dataset that will be used has been created using historical soccer data that includes match and team statistics.
Recommended Citation
Jarquin, Emanuel, "Predicting Outcomes of El Clásico Using Random Forests and Extreme Gradient Boosting" (2022). CMC Senior Theses. 3067.
https://scholarship.claremont.edu/cmc_theses/3067
Data Repository Link
https://github.com/ejarquin22/Senior-Thesis.git