Project Description

The overall goal of the project is to analyze and mine a Yelp review text data set to discover useful knowledge and solve a real-world data mining problem. I will work on a restaurant review data set provided by Yelp.com. The project builds on knowledge of various interesting data science and computer science topics, such as pattern discovery, clustering, text retrieval, text mining, and data visualization - all of which are necessary to effectively mine the data to discover interesting insights and meet the requirements described below.

Requirements

Opinion Visualization

DESCRIPTION

Explore and visualize the review content to understand what people have said in those reviews.

SUCCESS MEASURE

It can be shown that this requirement is met by producing a visualization of a topic model. A topic model can be used to help convey what reviewers have written about in these reviews. A topic model can also be used to help convey the similarities between topics

Cuisine Map Construction

DESCRIPTION

Mine the data set to understand the landscape of different types of cuisines and their similarities.

SUCCESS MEASURE

It can be shown that this requirement is met by constructing a cuisine map. A cuisine map can help convey the landscape of different types of cuisines and their similarities, including what cuisines are available and their relations. A visualization incorporating clustering can help convey major categories of cuisines.

Recommendation of Restaurants to Help People Decide Where to Dine Based on Dishes

DESCRIPTION

Mine the data set to rank restaurants for a specific dish.

SUCCESS MEASURE

It can be shown that this requirement is met by ranking dish names of a particular cuisine based on reviews that mention the dish names, e.g., by ranking the popular dishes with positive comments on the top. I can also rank restaurants offering particular dishes based on reviews about the dish so that the restaurant reviews with many positive comments about the dish are highly ranked. This requirement can be accomplished by creating a visualization showing the ranking of the recommended restaurants.

Recommendation of Restaurants to Help People Decide Where to Dine Based on Hygiene

DESCRIPTION

Use machine learning on hygiene data to predict the hygiene condition of a restaurant.

SUCCESS MEASURE

It can be shown that this requirement is met by training a classifier over the data to make binary predictions.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
requirement-1		requirement-1
requirement-2		requirement-2
requirement-3		requirement-3
requirement-4		requirement-4
utility-scripts		utility-scripts
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project Description

Requirements

Opinion Visualization

DESCRIPTION

SUCCESS MEASURE

Cuisine Map Construction

DESCRIPTION

SUCCESS MEASURE

Recommendation of Restaurants to Help People Decide Where to Dine Based on Dishes

DESCRIPTION

SUCCESS MEASURE

Recommendation of Restaurants to Help People Decide Where to Dine Based on Hygiene

DESCRIPTION

SUCCESS MEASURE

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

josediaznunez/yelp-review-data-set-project

Folders and files

Latest commit

History

Repository files navigation

Project Description

Requirements

Opinion Visualization

DESCRIPTION

SUCCESS MEASURE

Cuisine Map Construction

DESCRIPTION

SUCCESS MEASURE

Recommendation of Restaurants to Help People Decide Where to Dine Based on Dishes

DESCRIPTION

SUCCESS MEASURE

Recommendation of Restaurants to Help People Decide Where to Dine Based on Hygiene

DESCRIPTION

SUCCESS MEASURE

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages