Friend-recommendation-using-movie-data

This project is composed of two phases-

Querying of the movie dataset for simple statistics retrieval.
Friend recommendation based on user input and the dataset of users and their choices.

There are two datasets - ip.txt(User ratings dataset) and impfields.txt(Movie dataset) (ip.txt is of the form -> user_id, movie_id, rating, timestamp)

Querying of the dataset is done for 3 questions- Q1) Number of movies by a director Q2) Number of action movies in a year Q3) Avg rating of movies by a director

The friend recommendation system is built using the Hadoop framework. It consists of two mappers and two reducers. Mapper1 converts the dataset into key, value pairs of user_id, movie_id Reducer1 combines all the movies watched by a user Mapper2 will just pass the output of Reducer1 in the form of user_id, (movie_id1, movie_id2,...) Reducer2 will find the similarity between the mapper output and the data (movies liked) given by the user.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
bin		bin
input		input
input123		input123
output		output
output1		output1
outputq1		outputq1
outputq2		outputq2
outputq3		outputq3
src		src
BDA Report.docx		BDA Report.docx
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bin

bin

input

input

input123

input123

output

output

output1

output1

outputq1

outputq1

outputq2

outputq2

outputq3

outputq3

src

src

BDA Report.docx

BDA Report.docx

README.md

README.md

Repository files navigation

Friend-recommendation-using-movie-data

About

Releases

Packages

Languages

philomathic-guy/Friend-recommendation-using-movie-data

Folders and files

Latest commit

History

Repository files navigation

Friend-recommendation-using-movie-data

About

Topics

Resources

Stars

Watchers

Forks

Languages