Skip to content

icgncl/Anomaly-Detection-in-Recommender-Systems

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Anomaly Detection in Recommendation Systems

It is a term project repository for CENG790 course in METU. In this project, the aim is detect anomalies in the Amazon Review Product dataset between user rating on the product and review text. For example, an user might give a low rating for a product because of courier company, not because of product. Firstly, the product ratings were analyzed according to rating and year. total_ratings_per_year

Also in the project, the product review texts were analyzed semantically with BERT. The output of NLP model is "modelRating" and the dataset has already product ratings which were given by users. With these two data, anomalies were detected in the project.

anomaly_detection

Then we have used ALS in order to recommend new magazines. The model was finetuned, best parameters found as iteration: 20, rank: 24, regParam: 0.5 for ALS. Finally, ALS model was used with original product ratings and ratings generated from product review texts with NLP.

The RMSE value with original product ratings: 1.649
The RMSE value with ratings generated from product review texts with NLP: 1.504

Dataset

In this project, Amazon Review Dataset[1] is used. The dataset was already divided to categories. In this project, "Magazine Subscriptions" category is used; however, the all categories can be used with this project.

References:

[1] Justifying recommendations using distantly-labeled reviews and fined-grained aspects. Jianmo Ni, Jiacheng Li, Julian McAuley Empirical Methods in Natural Language Processing (EMNLP), 2019

About

It is a term project repository for CENG790 course in METU

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages