GitHub

Caption Generation using VGG16 and LSTM

MD Muhaimin Rahman contact: sezan92[at]gmail[dot]com

In this project, I have tried to work on Caption generation of Images of Flickr_8k dataset. I took extensive help from Jason Brownlee's Blog article on the same dataset. But I thought some codeblocks were unnecessarily complex . So I changed them for my project. The main architecture is mainly taken from Googles paper,

Some Examples:

Dataset

I have used Flickr8k dataset, which I cannot redistribute. You have to fillup this form and they will give you the dataset. You have to keep the folders Flicker8k_Dataset and Flickr_Text inside the dataset folder.

Further Improvement:

Till now, I have used features extracted from the VGG16 Model and trained on them . I think fully trainable model should improve the results which I am looking forward to work in future , God Willing.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Caption+Generation_Test.py		Caption+Generation_Test.py
Caption0.jpg		Caption0.jpg
Caption1.jpg		Caption1.jpg
Caption10.jpg		Caption10.jpg
Caption11.jpg		Caption11.jpg
Caption12.jpg		Caption12.jpg
Caption2.jpg		Caption2.jpg
Caption3.jpg		Caption3.jpg
Caption4.jpg		Caption4.jpg
Caption5.jpg		Caption5.jpg
Caption6.jpg		Caption6.jpg
Caption7.jpg		Caption7.jpg
Caption8.jpg		Caption8.jpg
Caption9.jpg		Caption9.jpg
Caption_Generation.py		Caption_Generation.py
Caption_Generation_VGG.ipynb		Caption_Generation_VGG.ipynb
FlowChart2.jpeg		FlowChart2.jpeg
README.md		README.md
README.md~		README.md~
model.png		model.png

sezan92/CaptionGeneration

Folders and files

Latest commit

History

Repository files navigation

Caption Generation using VGG16 and LSTM

Dataset

Further Improvement:

About

Resources

Stars

Watchers

Forks

Languages