YouTube video recommendation using content base collaborative filtering in python

YouTube video recommendation using content base collaborative filtering in python

PROJECT ID: PYTHON30

PROJECT NAME: YouTube video recommendation using content base collaborative filtering in python

PROJECT CATEGORY: MCA / BCA / BCCA / MCM / POLY / ENGINEERING

PROJECT ABSTRACT:

Recommender systems are among the most popular applications of data science today. They are used to predict the "rating" or "preference" that a user would give to an item. Almost every major tech company has applied them in some form. Amazon uses it to suggest products to customers, YouTube uses it to decide which video to play next on autoplay, and Facebook uses it to recommend pages to like and people to follow.

What's more, for some companies like Netflix, Amazon Prime, Hulu, and Hotstar, the business model and its success revolves around the potency of their recommendations. Netflix even offered a million dollars in 2009 to anyone who could improve its system by 10%.

There are also popular recommender systems for domains like restaurants, movies, and online dating. Recommender systems have also been developed to explore research articles and experts, collaborators, and financial services. YouTube uses the recommendation system at a large scale to suggest you videos based on your history. For example, if you watch a lot of educational videos, it would suggest those types of videos.

But what are these recommender systems?

Broadly, recommender systems can be classified into 3 types:

Simple recommenders: offer generalized recommendations to every user, based on movie popularity and/or genre. The basic idea behind this system is that movies that are more popular and critically acclaimed will have a higher probability of being liked by the average audience. An example could be IMDB Top 250.

Content-based recommenders: suggest similar items based on a particular item. This system uses item metadata, such as genre, director, description, actors, etc. for movies, to make these recommendations. The general idea behind these recommender systems is that if a person likes a particular item, he or she will also like an item that is similar to it. And to recommend that, it will make use of the user's past item metadata. A good example could be YouTube, where based on your history, it suggests you new videos that you could potentially watch.

Collaborative filtering engines: these systems are widely used, and they try to predict the rating or preference that a user would give an item-based on past ratings and preferences of other users. Collaborative filters do not require item metadata like its content-based counterparts.

Simple Recommenders

As described in the previous section, simple recommenders are basic systems that recommend the top items based on a certain metric or score. In this section, you will build a simplified clone of IMDB Top 250 Movies using metadata collected from IMDB.

The following are the steps involved:

1. Decide on the metric or score to rate movies on.

2. Calculate the score for every movie.

3. Sort the movies based on the score and output the top results.

SOFTWARE REQUIREMENTS:

OS : Windows

Python IDE : Python 2.7.x and above

Language : Python Programming

Database : MYSQL

HARDWARE REQUIREMENTS:

RAM : 4GB and Higher

Processor : Intel i3 and above

Hard Disk : 500GB Minimum

Setting up Software Environment

Python is a high level, interpreted, interactive and object-oriented scripting language.

Python is designed to be highly readable and has fewer syntactical constructions than other languages. Python is used in the development of this model. In this experiment, the following python libraries are used to develop the machine learning models:

• NLTK: It is a python package which works with human language data and provides an easy-to-use interface to different lexical resources like WordNet and text processing libraries. These lexical resources are used for classification, tokenization, stemming, tagging, parsing, and semantic reasoning [23].

• Pandas: It is a python package which acts as a data analysis tool and deals with data structures. Pandas carry out entire data analysis workflow in Python without having to switch to a more domain specific language like R [46].

• Tweepy: It is used in accessing the Twitter API by establishing the connection and to gather tweets from Twitter [24]. This module is used to stream live tweets directly from Twitter in real-time.

• Numpy: NumPy is the fundamental package for computing with Python. It is used to add support to multi-dimensional arrays and matrices, with a large collection of high-level mathematical functions [47].

• scikit-learn: It is a simple and efficient tool for data mining and data analysis [47]. • matplotlib python library which generates plots, histograms, power spectra, bar charts, etc.

In this work matplotlib.pyplot module is used to plot the metrics [47].

• Gensim It is used to automatically extract semantic topics from documents, as efficiently as possible. Gensim is designed to process raw, unstructured text data. The algorithms in Gensim, such as Word2Vec where it automatically discovers the semantic structure of phrase by examining statistical co-occurrence patterns within a corpus of training documents. These algorithms are unsupervised. Once these statistical patterns are found, any plain text documents can be succinctly expressed in the new, semantic representation and queried for topical similarity against other documents [48].

• Keras: Keras is a high-level neural networks API, written in Python and capable of running on top of TensorFlow, CNTK, or Theano. It was developed with a focus on enabling fast experimentation. Being able to go from idea to result with the least possible delay is key to doing good research [49]

TABLE OF CONTENTS

· Title Page

· Declaration

· Certification Page

· Dedication

· Acknowledgements

· Table of Contents

· List of Tables

· Abstract

CHAPTER SCHEME

CHAPTER ONE: INTRODUCTION

CHAPTER TWO: OBJECTIVES

CHAPTER THREE: PRELIMINARY SYSTEM ANALYSIS

· Preliminary Investigation

· Present System in Use

· Flaws In Present System

· Need Of New System

· Feasibility Study

· Project Category

CHAPTER FOUR: SOFTWARE ENGINEERING AND PARADIGM APPLIED

· Modules

· System / Module Chart

CHAPTER FIVE: SOFTWARE AND HARDWARE REQUIREMENT

CHAPTER SIX: DETAIL SYSTEM ANALYSIS

· Data Flow Diagram

· Number of modules and Process Logic

· Data Structures and Tables

· Entity- Relationship Diagram

· System Design

· Form Design

· Source Code

· Input Screen and Output Screen

CHAPTER SEVEN: TESTING AND VALIDATION CHECK

CHAPTER EIGHT: SYSTEM SECURITY MEASURES

CHAPTER NINE: IMPLEMENTATION, EVALUATION & MAINTENANCE

CHAPTER TEN: FUTURE SCOPE OF THE PROJECT

CHAPTER ELEVEN: SUGGESTION AND CONCLUSION

CHAPTER TWELE: BIBLIOGRAPHY& REFERENCES

Other Information

PROJECT SOFWARE	ZIP
PROJECT REPORT PAGE	60 -80 Pages
CAN BE USED IN	Marketing (MBA)
PROJECT COST	1500/- Only
PDF SYNOPSIS COST	250/- Only
PPT PROJECT COST	300/- Only
PROJECT WITH SPIRAL BINDING	1750/- Only
PROJECT WITH HARD BINDING	1850/- Only
TOTAL COST (SYNOPSIS, SOFTCOPY, HARDBOOK, and SOFTWARE, PPT)	2500/- Only
DELIVERY TIME	1 OR 2 Days (In case Urgent Call: 8830288685)
SUPPORT / QUERY	www.projectsready.in
CALL	8830288685
Email	help@projectsready.in
[Note: We Provide Hard Binding and Spiral Binding only Nagpur Region]	Download

Search This Blog

Mca project download with source code

YouTube video recommendation using content base collaborative filtering in python

Other Information

Comments

Post a Comment

Popular posts from this blog

Online Salon & Spa Booking System

Fake Review Identification in php

E-Post Office System in php