Skip to content

sezan92/CaptionGeneration

Repository files navigation

Caption Generation using VGG16 and LSTM

MD Muhaimin Rahman contact: sezan92[at]gmail[dot]com

In this project, I have tried to work on Caption generation of Images of Flickr_8k dataset. I took extensive help from Jason Brownlee's Blog article on the same dataset. But I thought some codeblocks were unnecessarily complex . So I changed them for my project. The main architecture is mainly taken from Googles paper,

Some Examples: Caption1 Caption2 Caption3 Caption8 Caption13

Dataset

I have used Flickr8k dataset, which I cannot redistribute. You have to fillup this form and they will give you the dataset. You have to keep the folders Flicker8k_Dataset and Flickr_Text inside the dataset folder.

Further Improvement:

Till now, I have used features extracted from the VGG16 Model and trained on them . I think fully trainable model should improve the results which I am looking forward to work in future , God Willing.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published