Parallelization of Matrix Factorization for Recommender Systems

dc.contributor.authorBaum, Julia
dc.contributor.authorCook, Cynthia
dc.contributor.authorCurtis, Michael
dc.contributor.authorEdgerton, Joshua
dc.contributor.authorRabidoux, Scott
dc.contributor.authorRaim, Andrew M.
dc.contributor.authorNeerchal, Nagaraj K.
dc.contributor.authorBell, Robert M.
dc.date.accessioned2018-10-25T14:35:11Z
dc.date.available2018-10-25T14:35:11Z
dc.date.issued2010
dc.description.abstractRecommender systems are emerging as important tools for improving customer satisfaction by mathematically predicting user preferences. Several major corporations including Amazon.com and Pandora use these types of systems to suggest additional options based on current or recent purchases. Netflix uses a recommender system to provide its customers with suggestions for movies that they may like, which are based on their previous ratings. In 2006, Netflix released a large data set to the public and offered one million dollars for significant improvements on their system. In 2009, BellKor’s Pragmatic Chaos, a team of seven, won the prize by combining individual methods. Dr. Robert Bell, with whom we collaborated, was a member of the winning team and provided us with the data set used in this project, consisting of a sparse matrix with rows of users and columns of movies. The entries of the matrix are ratings given to movies by certain users. The objective is to obtain a model that predicts future ratings a user might give for a specific movie. This model is known as a collaborative filtering model, which encompasses the average movie rating (mu), the rating bias of the user (b), the overall popularity of a movie (a), and the interaction between user preferences (p) and movie characteristics (q). Two methods, Alternating Least Squares and Stochastic Gradient Descent, were used to estimate each parameter in this non-linear regression model. Each method fits characteristic vectors for movies and users from the existing data. The overall focus of this project is to explore the two methods, and to investigate the suitability of parallel computing utilizing the cluster Tara in the UMBC High Performance Computing Facility.en_US
dc.description.urihttps://userpages.umbc.edu/~gobbert/papers/REU2010Team2.pdfen_US
dc.format.extent10 pagesen_US
dc.genreTechnical Reporten_US
dc.identifierdoi:10.13016/M2N29PB0D
dc.identifier.urihttp://hdl.handle.net/11603/11685
dc.language.isoen_USen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Mathematics Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofseriesHPCF Technical Report;HPCF-2010-22
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subjectParallelizationen_US
dc.subjectMatrix Factorizationen_US
dc.subjectRecommender Systemsen_US
dc.subjectUMBC High Performance Computing Facility (HPCF)en_US
dc.subjectAlternating Least Squaresen_US
dc.subjectStochastic Gradient Descenten_US
dc.subjectmathematically predicting user preference
dc.titleParallelization of Matrix Factorization for Recommender Systemsen_US
dc.typeTexten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
REU2010Team2.pdf
Size:
204.5 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: