Graduation Year


Date of Submission


Document Type

Campus Only Senior Thesis

Degree Name

Bachelor of Arts


Mathematical Sciences

Reader 1

Beth Trushkowsky

Rights Information

© 2016 Rachelle L. Holmgren


Extracting meaningful insights from massive datasets to help guide business decisions requires specialized skills in data analysis. Unfortunately, the supply of these skills does not meet the demand, due to the massive amount of data generated by society each day. This leaves businesses with a large amount of unanalyzed data that could have been used to support business decision making. Automating the process of analyzing this data would help address many companies' key challenge of a lack of appropriate analytical skills. This paper examines the process and challenges in automating this analysis of data. Central challenges include removing outliers without context, transforming data to a format that is compatible with the analysis method that will be used, and analyzing the results of the model.

This thesis is restricted to the Claremont Colleges current faculty, students, and staff.