Manara - Qatar Research Repository
Browse

Text-to-picture tools, systems, and approaches: a survey

Download (2 MB)
journal contribution
posted on 2022-11-22, 21:13 authored by Jezia Zakraoui, Moutaz Saleh, Jihad Al Ja’am

Text-to-picture systems attempt to facilitate high-level, user-friendly communication between humans and computers while promoting understanding of natural language. These systems interpret a natural language text and transform it into a visual format as pictures or images that are either static or dynamic. In this paper, we aim to identify current difficulties and the main problems faced by prior systems, and in particular, we seek to investigate the feasibility of automatic visualization of Arabic story text through multimedia. Hence, we analyzed a number of well-known text-to-picture systems, tools, and approaches. We showed their constituent steps, such as knowledge extraction, mapping, and image layout, as well as their performance and limitations. We also compared these systems based on a set of criteria, mainly natural language processing, natural language understanding, and input/output modalities. Our survey showed that currently emerging techniques in natural language processing tools and computer vision have made promising advances in analyzing general text and understanding images and videos. Furthermore, important remarks and findings have been deduced from these prior works, which would help in developing an effective text-to-picture system for learning and educational purposes.

Other Information

Published in: Multimedia Tools and Applications
License: https://creativecommons.org/licenses/by/4.0
See article on publisher's website: http://dx.doi.org/10.1007/s11042-019-7541-4

History

Language

  • English

Publisher

Springer Science and Business Media LLC

Publication Year

  • 2019

Institution affiliated with

  • Qatar University

Usage metrics

    Manara - Qatar Research Repository

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC