Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning (original) (raw)
Related papers
Text-Guided Attention Model for Image Captioning
2017
Imageability- and Length-controllable Image Captioning
IEEE Access, 2021
Image Captioning using Deep Learning
International Journal for Research in Applied Science and Engineering Technology, 2020
Data-driven image captioning via salient region discovery
IET Computer Vision, 2017
Image Captioning Using Deep Learning and NLP Techniques
International Journal for Research in Applied Science and Engineering Technology
On combining image features and word embeddings for image captioning
Annals of Computer Science and Information Systems
IMAGE CAPTIONING USING TRANSFORMER: VISIONAID
IRJET, 2022
Text-to-Image Synthesis Based on Machine Generated Captions
Communications in Computer and Information Science, 2020
TransforMatcher: Match-to-Match Attention for Semantic Correspondence
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
Image Captioning using Multiple Transformers for Self-Attention Mechanism
ArXiv, 2021
Annals of the New York Academy of Sciences, 2010
Video captioning using transformer network
THE 2ND UNIVERSITAS LAMPUNG INTERNATIONAL CONFERENCE ON SCIENCE, TECHNOLOGY, AND ENVIRONMENT (ULICoSTE) 2021
Visual attention for efficient high-fidelity graphics
Proceedings of the 21st spring conference on Computer graphics - SCCG '05, 2005
Image captioning model using attention and object features to mimic human image understanding
Journal of Big Data, 2022
ACORT: A compact object relation transformer for parameter efficient image captioning
Neurocomputing, 2022
Panoptic Segmentation-Based Attention for Image Captioning
Applied Sciences
Efficient Image Captioning Based on Vision Transformer Models
Computers, Materials & Continua
Human Attention in Image Captioning: Dataset and Analysis
2019 IEEE/CVF International Conference on Computer Vision (ICCV)
A Position-Aware Transformer for Image Captioning
Computers, Materials & Continua
Longer Version for "Deep Context-Encoding Network for Retinal Image Captioning
arXiv (Cornell University), 2021
Enhanced Modality Transition for Image Captioning
arXiv (Cornell University), 2021
Multimodal Image Captioning for Marketing Analysis
2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)
Image Caption Generator Using Attention Based Neural Networks
International Journal for Research in Applied Science and Engineering Technology, 2023
Crowdsourcing Thumbnail Captions via Time-Constrained Methods
27th International Conference on Intelligent User Interfaces
Deep Learning based, a New Model for Video Captioning
International Journal of Advanced Computer Science and Applications, 2020
Inserting Faces inside Captions: Image Captioning with Attention Guided Merging
2024
Geometry-aware Relational Exemplar Attention for Dense Captioning
1st International Workshop on Multimodal Understanding and Learning for Embodied Applications - MULEA '19, 2019
Attention to Form and Meaning Revisited
Language Learning, 2008
Encoder-Decoder Based Long Short-Term Memory (LSTM) Model for Video Captioning
arXiv (Cornell University), 2023
IRJET- Image Captioning using Attention Mechanism with ResNet, VGG and Inception Models
IRJET, 2020
End-to-End Attention-based Image Captioning
ArXiv, 2021
International Journal of Innovative Research in Engineering and Management (IJIREM), 2023
Deep learning for image captioning: an encoder-decoder architecture with soft attention
2019
Integrated text and image understanding for document understanding
Proceedings of the workshop on Human Language Technology - HLT '94, 1994