Logo
  • Home
  • About Us
    • Aim and Scope
    • Research Area
    • Impact Factor
    • Indexing
  • For Authors
    • Authors Guidelines
    • How to publish paper?
    • Download Paper format
    • Submit Manuscript
    • Processing Charges
    • Download Copyrights Form
    • Submit Payment-Copyrights
  • Archives
    • Current Issues
    • Past Issues
    • Conference Issues
    • Special Issues
    • Advance Search
  • IJARIIE Board
    • Join as IJARIIE Board
    • Advisory Board
    • Editorial Board
    • Sr. Reviewer Board
    • Jr. Reviewer Board
  • Proposal
    • Conferece Proposal
    • Special Proposal
    • Faqs
  • Contact Us
  • Payment Detail

Call for Papers:Vol.12 Issue.2

Submission
Last date
28-Apr-2026
Acceptance Status In One Day
Paper Publish In Two Days
Submit ManuScript

News & Updates

Submit Article

Dear Authors, Article publish in our journal for Volume-12,Issue-2. For article submission on below link: Submit Manuscript


Join As Board

Dear Reviewer, You can join our Reviewer team without given any charges in our journal. Submit Details on below link: Join As Board


Paper Publication Charges

IJARIIE APP
Download Android App

For Authors

  • How to Publish Paper
  • Submit Manuscript
  • Processing Charges
  • Submit Payment

Archives

  • Current Issue
  • Past Issue

IJARIIE Board

  • Member Of Board
  • Join As Board

Downloads

  • Authors Guidelines
  • Manuscript Template
  • Copyrights Form

Android App

Download IJARIIE APP
  • Authors
  • Abstract
  • Citations
  • Downloads
  • Similar-Paper

Authors

Title: :  "VocPix: Image Captioning using CNN and LSTM"
PaperId: :  19942
Published in:   International Journal Of Advance Research And Innovative Ideas In Education
Publisher:   IJARIIE
e-ISSN:   2395-4396
Volume/Issue:    Volume 9 Issue 2 2023
DUI:    16.0415/IJARIIE-19942
Licence: :   IJARIIE is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Author NameAuthor Institute
Bhushan Arun AmbhoreSinhgad College Of Engineering, Pune
Linal Anil PatilSinhgad College Of Engineering, Pune
Manasi Shekhar PatilSinhgad College Of Engineering, Pune
Niket Sitaram SharmaSinhgad College Of Engineering, Pune
Hitesh E. ChaudhariSinhgad College Of Engineering, Pune

Abstract

Computer Engineering
Deep learning, CNN, LSTM, Machine learning, Neural Networks, Text-To-Speech
With billions of users on sites like Facebook, Twitter, Instagram, and YouTube, social media has become an essential component of modern life. Social media has fundamentally altered how we communicate, share information, and engage with one another. It has also changed how we both consume and produce material. Therefore, an image caption generator has become essential in today's culture because it is necessary for social media addicts or people who are blind. It is a kind of algorithm or software program that makes use of Deep Learning techniques and analyses an image's visual information before converting it into plain language. It can be used as a plugin on the popular social networking sites of today to suggest appropriate captions for users to include with their postings. The goal of the suggested research is to create an image caption, also known as a description of an image, and to translate it into different languages, using CNN-LSTM architecture. To use the current word as input for the prediction of the following word, CNN layers will help in retrieving input data, and LSTM will extract essential information as it processes input. Python 3 and machine learning will be the programming languages used. This study will go into great detail about the many Neural networks involved, including their structures and functions. The program that converts created captions into spoken words is called a text-to-speech synthesizer, and it uses Natural Language Processing (NLP) to analyze and process the text. The text is subsequently converted into a synthesized speech representation using digital signal processing (DSP) technology. Here, we've created a practical text-to-speech synthesizer in the form of an easy-to-use application that reads aloud generated captions as synthesized speech. The proposed deep learning approach aims at generating the best caption for a particular image by analyzing and extracting various features from images and converting that textual caption into speech using Text-To-Speech (TTS).

Citations

Copy and paste a formatted citation or use one of the links to import into a bibliography manager and reference.

IJARIIE Bhushan Arun Ambhore, Linal Anil Patil, Manasi Shekhar Patil, Niket Sitaram Sharma, and Hitesh E. Chaudhari. ""VocPix: Image Captioning using CNN and LSTM"" International Journal Of Advance Research And Innovative Ideas In Education Volume 9 Issue 2 2023 Page 2873-2880
MLA Bhushan Arun Ambhore, Linal Anil Patil, Manasi Shekhar Patil, Niket Sitaram Sharma, and Hitesh E. Chaudhari. ""VocPix: Image Captioning using CNN and LSTM"." International Journal Of Advance Research And Innovative Ideas In Education 9.2(2023) : 2873-2880.
APA Bhushan Arun Ambhore, Linal Anil Patil, Manasi Shekhar Patil, Niket Sitaram Sharma, & Hitesh E. Chaudhari. (2023). "VocPix: Image Captioning using CNN and LSTM". International Journal Of Advance Research And Innovative Ideas In Education, 9(2), 2873-2880.
Chicago Bhushan Arun Ambhore, Linal Anil Patil, Manasi Shekhar Patil, Niket Sitaram Sharma, and Hitesh E. Chaudhari. ""VocPix: Image Captioning using CNN and LSTM"." International Journal Of Advance Research And Innovative Ideas In Education 9, no. 2 (2023) : 2873-2880.
Oxford Bhushan Arun Ambhore, Linal Anil Patil, Manasi Shekhar Patil, Niket Sitaram Sharma, and Hitesh E. Chaudhari. '"VocPix: Image Captioning using CNN and LSTM"', International Journal Of Advance Research And Innovative Ideas In Education, vol. 9, no. 2, 2023, p. 2873-2880. Available from IJARIIE, http://ijariie.com/AdminUploadPdf/_VocPix__Image_Captioning_using_CNN_and_LSTM__ijariie19942.pdf (Accessed : 19 September 2024).
Harvard Bhushan Arun Ambhore, Linal Anil Patil, Manasi Shekhar Patil, Niket Sitaram Sharma, and Hitesh E. Chaudhari. (2023) '"VocPix: Image Captioning using CNN and LSTM"', International Journal Of Advance Research And Innovative Ideas In Education, 9(2), pp. 2873-2880IJARIIE [Online]. Available at: http://ijariie.com/AdminUploadPdf/_VocPix__Image_Captioning_using_CNN_and_LSTM__ijariie19942.pdf (Accessed : 19 September 2024)
IEEE Bhushan Arun Ambhore, Linal Anil Patil, Manasi Shekhar Patil, Niket Sitaram Sharma, and Hitesh E. Chaudhari, ""VocPix: Image Captioning using CNN and LSTM"," International Journal Of Advance Research And Innovative Ideas In Education, vol. 9, no. 2, pp. 2873-2880, Mar-App 2023. [Online]. Available: http://ijariie.com/AdminUploadPdf/_VocPix__Image_Captioning_using_CNN_and_LSTM__ijariie19942.pdf [Accessed : 19 September 2024].
Turabian Bhushan Arun Ambhore, Linal Anil Patil, Manasi Shekhar Patil, Niket Sitaram Sharma, and Hitesh E. Chaudhari. ""VocPix: Image Captioning using CNN and LSTM"." International Journal Of Advance Research And Innovative Ideas In Education [Online]. volume 9 number 2 (19 September 2024).
Vancouver Bhushan Arun Ambhore, Linal Anil Patil, Manasi Shekhar Patil, Niket Sitaram Sharma, and Hitesh E. Chaudhari. "VocPix: Image Captioning using CNN and LSTM". International Journal Of Advance Research And Innovative Ideas In Education [Internet]. 2023 [Cited : 19 September 2024]; 9(2) : 2873-2880. Available from: http://ijariie.com/AdminUploadPdf/_VocPix__Image_Captioning_using_CNN_and_LSTM__ijariie19942.pdf
BibTex EndNote RefMan RefWorks

Number Of Downloads


Last download on 9/19/2024 2:09:48 PM

Save in Google Drive

Similar-Paper

TitleArea of ResearchAuther NameAction
STATE AWARE MULTI-HOP ROUTING VIA DIGITAL TWIN FOR IOT NETWORKSINTERNET OF THINGS (WSN)PATEL NENSI RAKESHBHAI Download
Soil Quality Assessment Using Machine Learning & IoTComputer EngineeringKiran D Kshirsagar Download
Unsupervised Contribution-Oriented Learning Model for Social Influence DetectionComputer EngineeringSnehal Mahjaan Download
DESIGN AND IMPLEMENTATION OF A BLUETOOTH-CONTROLLED ROBOTIC CAR USING ARDUINOComputer Mr. Swapnil Sanjay Bafana Download
AI-DRIVEN DEEPFAKE IDENTIFICATION IN REAL TIMEComputer EngineeringPavan Gajanan Bhonde Download
RESQ-BOTComputer Engineering Tiparkar Prathamesh Navnath Download
A Critical Review and Modern Contextualization of the 2009 Distributed Real-Time Computer Network Architecture (DRNA)Computer EngineeringNandishwar EN Download
Block-Chain Based Document Verification System using IPFSComputer EngineeringAkash Santosh Devade Download
A COMPREHENSIVE REVIEW OF DUAL FEATURE-BASED INTRUSION DETECTION SYSTEM FOR IoT NETWORK SECURITYComputer Science and EngineeringShrinidhi Hegde Download
Civica AI: A Politician-Centric Grievance Redressal and Service DirectoryComputer EngineeringKaranjule Dhanashri Bhausaheb Download
A Deep Learning Framework for Mood-Based Music Recommendation via Facial Expression AnalysisComputer Vaibhav Ashok Bhangare Download
GREEN NETWORKING: ENERGY-EFFICIENT PROTOCOLS AND SUSTAINABLE NETWORK DESIGN: A COMPREHENSIVE REVIEWComputer Science and EngineeringPradeep Nayak Download
DIABETIC RETINOPATHY DETECTION USING MACHINE LEARNINGComputer EngineeringSiddharth Shukracharya Rokade Download
PERSONALITY PREDICTION USING MLComputer EngineeringTanvi Dashrath Bhagat Download
Crop Disease Detectioncomputer Mansi Sunil Sansare Download
12
For Authors
  • Submit Paper
  • Processing Charges
  • Submit Payment
Archive
  • Current Issue
  • Past Issue
IJARIIE Board
  • Member Of Board
  • Join As Board
Privacy and Policy
Follow us

Contact Info
  • +91-8401209201 (India)
  • +86-15636082010 (China)
  • ijariiejournal@gmail.com
  • M-20/234 Ami Appt,
    Nr.Naranpura Tele-Exch,
    Naranpura,
    Ahemdabad-380063
    Gujarat,India.
Copyright © 2026. IJARIIE. All Rights Reserved.