Skip to content

Explaining the contents of an image in the form of speech through caption generation using Inception-V3 model , LSTM model ,Goggle Text-To-Speech API and playsound library .

License

Notifications You must be signed in to change notification settings

SARIT42/image-Annotation-Speech

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Image-Annotation-Speech

Explaining the contents of an image in the form of speech through caption generation using Inception-V3 model for image feature extraction, LSTM model for caption generation and Goggle Text-To-Speech API and playsound library for text to speech conversion.

To view/edit the full model,visit my kaggle notebook : Image Annotation Kaggle Notebook

Upvotes & Suggestions are appreciated!

About

Explaining the contents of an image in the form of speech through caption generation using Inception-V3 model , LSTM model ,Goggle Text-To-Speech API and playsound library .

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published