Yelp Review Generator

A seq2seq (BERT) model from HuggingFace was trained on the Yelp Open Dataset to generate Yelp reviews.

Dataset

~1.5M reviews (subset of the ~8M reviews from Yelp Open Dataset)
~100k businesses

Model Input Features

stars: rating of business (1 - 5)
funny: number of funny votes received
elite level: total number of years the reviewer was elite
name: business' name
city: city of the business
categories: business categories

The six input features are concatenated into a single string for each example.

Model Output (label)

Yelp review

During training, the Yelp review is constrained to 128 tokens. During inference, the model can generate words until an EOS token is generated or stopped early at a predetermined fixed length.

Getting Started

Training the Model

To train a new model, use the seq2seq_train notebook. The Yelp Open Dataset needs to be downloaded first.

Generate Reviews with Trained Model

Use the seq2seq_predict notebook. The model trained from this project will be downloaded automatically in the notebook. You don't have to provide your own trained model.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
seq2seq_predict.ipynb		seq2seq_predict.ipynb
seq2seq_train.ipynb		seq2seq_train.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

seq2seq_predict.ipynb

seq2seq_predict.ipynb

seq2seq_train.ipynb

seq2seq_train.ipynb

Repository files navigation

Yelp Review Generator

Dataset

Model Input Features

Model Output (label)

Getting Started

Training the Model

Generate Reviews with Trained Model

About

Releases 1

Packages

Languages

License

michaelnation26/yelp-review-generator

Folders and files

Latest commit

History

Repository files navigation

Yelp Review Generator

Dataset

Model Input Features

Model Output (label)

Getting Started

Training the Model

Generate Reviews with Trained Model

About

Resources

License

Stars

Watchers

Forks

Languages