CS 228 Final Project

Enhancing Image Captioning with Deep Learning Models

Saul Gonzalez - sgonz081

Shaheriar Malik - smali032

Dataset: https://www.kaggle.com/datasets/hsankesara/flickr-image-dataset

Abstract

Image captioning is a challenging task that involves generating descriptive textual representations for images, surpassing the complexity of mere image classification. To tackle this intricate task, we adopt a well-established approach that combines Convolutional Neural Networks (CNNs) with Long Short-Term Memory (LSTM) networks , further enhanced by the integration of an Attention layer within the decoder. This enables us to effectively generate coherent and meaningful captions. Moreover, we employ advanced techniques such as mutliprocessing during the image retrieval and preprocessing stages, resulting in a substantial reduction in training time. We can efficiently fetch and preprocess multiple images simultaneously, harnessing the full potential of modern computing architectures.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
CS228FinalProject.ipynb		CS228FinalProject.ipynb
README.md		README.md
examples.png		examples.png
model.png		model.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CS 228 Final Project

Enhancing Image Captioning with Deep Learning Models

Saul Gonzalez - sgonz081

Shaheriar Malik - smali032

Abstract

Following is the diagram of our model and three randomly picked images with the generated captions using our model:

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

shaheriar/Image-Captioning-AI

Folders and files

Latest commit

History

Repository files navigation

CS 228 Final Project

Enhancing Image Captioning with Deep Learning Models

Saul Gonzalez - sgonz081

Shaheriar Malik - smali032

Abstract

Following is the diagram of our model and three randomly picked images with the generated captions using our model:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages