Parallelization of Matrix Factorization for Recommender Systems
| dc.contributor.author | Baum, Julia | |
| dc.contributor.author | Cook, Cynthia | |
| dc.contributor.author | Curtis, Michael | |
| dc.contributor.author | Edgerton, Joshua | |
| dc.contributor.author | Rabidoux, Scott | |
| dc.contributor.author | Raim, Andrew M. | |
| dc.contributor.author | Neerchal, Nagaraj K. | |
| dc.contributor.author | Bell, Robert M. | |
| dc.date.accessioned | 2018-10-25T14:35:11Z | |
| dc.date.available | 2018-10-25T14:35:11Z | |
| dc.date.issued | 2010 | |
| dc.description.abstract | Recommender systems are emerging as important tools for improving customer satisfaction by mathematically predicting user preferences. Several major corporations including Amazon.com and Pandora use these types of systems to suggest additional options based on current or recent purchases. Netflix uses a recommender system to provide its customers with suggestions for movies that they may like, which are based on their previous ratings. In 2006, Netflix released a large data set to the public and offered one million dollars for significant improvements on their system. In 2009, BellKor’s Pragmatic Chaos, a team of seven, won the prize by combining individual methods. Dr. Robert Bell, with whom we collaborated, was a member of the winning team and provided us with the data set used in this project, consisting of a sparse matrix with rows of users and columns of movies. The entries of the matrix are ratings given to movies by certain users. The objective is to obtain a model that predicts future ratings a user might give for a specific movie. This model is known as a collaborative filtering model, which encompasses the average movie rating (mu), the rating bias of the user (b), the overall popularity of a movie (a), and the interaction between user preferences (p) and movie characteristics (q). Two methods, Alternating Least Squares and Stochastic Gradient Descent, were used to estimate each parameter in this non-linear regression model. Each method fits characteristic vectors for movies and users from the existing data. The overall focus of this project is to explore the two methods, and to investigate the suitability of parallel computing utilizing the cluster Tara in the UMBC High Performance Computing Facility. | en_US |
| dc.description.uri | https://userpages.umbc.edu/~gobbert/papers/REU2010Team2.pdf | en_US |
| dc.format.extent | 10 pages | en_US |
| dc.genre | Technical Report | en_US |
| dc.identifier | doi:10.13016/M2N29PB0D | |
| dc.identifier.uri | http://hdl.handle.net/11603/11685 | |
| dc.language.iso | en_US | en_US |
| dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
| dc.relation.ispartof | UMBC Mathematics Department Collection | |
| dc.relation.ispartof | UMBC Faculty Collection | |
| dc.relation.ispartof | UMBC Student Collection | |
| dc.relation.ispartofseries | HPCF Technical Report;HPCF-2010-22 | |
| dc.rights | This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author. | |
| dc.subject | Parallelization | en_US |
| dc.subject | Matrix Factorization | en_US |
| dc.subject | Recommender Systems | en_US |
| dc.subject | UMBC High Performance Computing Facility (HPCF) | en_US |
| dc.subject | Alternating Least Squares | en_US |
| dc.subject | Stochastic Gradient Descent | en_US |
| dc.subject | mathematically predicting user preference | |
| dc.title | Parallelization of Matrix Factorization for Recommender Systems | en_US |
| dc.type | Text | en_US |
