Facebook
TwitterWe introduce the first dataset for sequential vision-to-language, and explore how this data may be used for the task of visual storytelling. The dataset includes 81,743 unique photos in 20,211 sequences, aligned to descriptive and story language. VIST is previously known as "SIND", the Sequential Image Narrative Dataset (SIND).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This version of the Bloom Library data is developed specifically for the Visual Story Telling (VIST) task. It includes data from 363 languages across 36 language families, with many of the languages represented being extremely low resourced languages.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Video Storytelling is a dataset for generating text story/summarization for videos containing social events. It consists of 105 videos from four categories: birthday, camping, Christmas and wedding. For each video, we provide at least 5 human-written stories.
Please cite the following paper if you use the Video Storytelling dataset in your work (papers, articles, reports, books, software, etc):
Facebook
TwitterVisual storytelling refers to the manner of describing a set of images rather than a single image, also known as multi-image captioning. Visual Storytelling Task (VST) takes a set of images as input and aims to generate a coherent story relevant to the input images. In this dataset, we bridge the gap and present a new dataset for expressive and coherent story creation. We present the Sequential Storytelling Image Dataset (SSID), consisting of open-source video frames accompanied by story-like annotations. In addition, we provide four annotations (i.e., stories) for each set of five images. The image sets are collected manually from publicly available videos in three domains: documentaries, lifestyle, and movies, and then annotated manually using Amazon Mechanical Turk. In summary, SSID dataset is comprised of 17,365 images, which resulted in a total of 3,473 unique sets of five images. Each set of images is associated with four ground truths, resulting in a total of 13,892 unique ground truths (i.e., written stories). And each ground truth is composed of five connected sentences written in the form of a story.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Visual and digital storytelling methods can reposition research participants as coproducers of knowledge, foster engagement and collaboration with marginalized peoples, and offer greater depth of self-expression. However, these methods are constituted in complex terrains of power. Without continual attenuation to power imbalances, the methods will contribute to the silencing and erasure of marginalized communities. This study outlines how reflexivity as a methodological tool and part of the Cultured-Centered Approach can enable the interrogation of terrains of power, allowing for the continual opening of democratic possibilities and community ownership of visual and digital storytelling infrastructures. Excerpts from the “Poverty Is Not Our Future” campaign illustrate the argument. The campaign's cocreated audio-visual advertisements communicate everyday stories of poverty among residents living in a poor suburban site in Auckland, Aotearoa New Zealand, and serve as a visual narrative of resistance to dominant structures. This study contributes to critical theorizing of culture and communication and the coconstruction of visual stories.
Facebook
Twitterhttps://choosealicense.com/licenses/cc/https://choosealicense.com/licenses/cc/
BLOOM VIST is a visual storytelling of books that consists of 62 languages indigenous to SEA. This dataset is owned by Bloom, a free, open-source software developed by SIL International and associated with Bloom Library, app, and services. This dataset is released with the LICENSE family of Creative Commons (although each story datapoints has its licensing in more detail, e.g cc-by, cc-by-nc, cc-by-nd, cc-by-sa, cc-by-nc-nd, cc-by-nc-sa). Before using this dataloader, please accept the… See the full description on the dataset page: https://huggingface.co/datasets/SEACrowd/bloom_vist.
Facebook
TwitterIntelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models (CVPR 2024)
This is the StorySalon dataset proposed in StoryGen. For the open-source PDF data, you can directly download the frames, corresponding masks, descriptions and original story narratives. For the data extracted from YouTube videos, we also provide their corresponding masks, descriptions and original story narratives in this repository. However, you need to refer to… See the full description on the dataset page: https://huggingface.co/datasets/haoningwu/StorySalon.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Supporting data for the article "The Illustrated Page: Analyzing Illustrations of Historical Children’s Books Using Citizen Science" (CHR 2025)
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Story maps have emerged as a popular storytelling device in recent years with cartographers and journalists leveraging geospatial web technologies to create unique spatial narratives. However, empirical research analyzing the design of story maps remains limited. Two recently proposed design frameworks provide promising avenues to characterize story maps in terms of elements of vivid cartography and techniques of map-based storytelling. In this article, I conducted a quantitative content analysis on 117 story maps of COVID-19 to operationalize map-based storytelling and vividness frameworks and to identify common design traits in contemporary story maps. My findings indicated that most story maps are longform infographics that use scrolling to advance the narrative. Stories applied a variety of attention, dosing, and mood techniques to enrich the storytelling experience. Story maps were primarily vivid through their use of color and novelty. Overall, most story maps utilized only a fraction of the map-based storytelling framework techniques. This research also demonstrated that it is challenging to analyze story maps based on these frameworks. Finally, this article improves the frameworks by proposing two new story map techniques and suggesting avenues of refinement.
Facebook
Twitterhttps://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy
The global smart storytelling device market was valued at $4.2 billion in 2025 and is projected to reach $10.8 billion by 2034, expanding at a compound annual growth rate (CAGR) of 11.1% during the forecast period from 2026 to 2034. Smart storytelling devices encompass a broad spectrum of technology-enabled products including AI-driven audio companions, augmented-reality (AR) picture books, voice-activated interactive readers, and multisensory narrative platforms designed for children, adults, and elderly users across education, entertainment, and healthcare settings. The market is driven by converging trends: an accelerating shift toward experiential and personalized learning, the rapid integration of natural language processing (NLP) and machine learning algorithms into consumer electronics, heightened parental awareness of screen-time management, and a post-pandemic surge in at-home educational technology adoption. In 2025, over 320 million households globally reported using some form of interactive learning or storytelling device for children aged 2 to 12 years, underscoring the enormous addressable market. The proliferation of affordable broadband connectivity and the rollout of 5G networks in key markets have further enabled cloud-based content delivery, allowing devices to access near-unlimited narrative libraries without on-device storage constraints. Governments in markets such as the United States, Germany, Japan, South Korea, and India have introduced digital literacy initiatives and early childhood education mandates that explicitly include smart device integration, creating a favorable regulatory environment. Additionally, the growing prevalence of e-commerce platforms has dramatically lowered the barrier to market entry for emerging brands while simultaneously expanding consumer reach for established players such as LeapFrog, VTech, and Luka Inc. The convergence of physical and digital storytelling formats, exemplified by near-field communication (NFC)-enabled figurines paired with narrative apps, has opened new product innovation corridors that are expected to sustain double-digit revenue growth throughout the forecast period.
Facebook
Twitterhttps://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
📚 Russian Storytelling Video Dataset (700 participants)
This dataset contains full-body videos of 700 native Russian speakers engaged in unscripted storytelling. Participants freely tell personal stories, express a wide range of emotions, and naturally use facial expressions and hand gestures. Each video captures authentic human behavior in high resolution with high-quality audio.
📊 Sample
📺 Preview Video (10 Participants)
To get a quick impression of… See the full description on the dataset page: https://huggingface.co/datasets/MaratDV/video-dataset.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Comprehensive analysis and strategic guide for mortgage lending professionals seeking to implement advanced content marketing strategies in 2025. This dataset provides detailed insights into video marketing, interactive content, visual storytelling, and specialized SEO techniques specifically tailored for the mortgage industry. The content addresses the evolution from generic blog posts to personalized, engaging content that builds trust with modern borrowers who expect authentic, helpful guidance through their home-buying journey.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This study investigates how storytelling strategies, particularly visual elements, platform-specific narratives, and authentic brand messages, influence consumer engagement and brand awareness. Employing a mixed-methods design, the research emphasizes quantitative data gathered through online questionnaires distributed via Google Forms. This approach allows for standardized responses from a diverse participant pool and facilitates the use of descriptive and inferential statistical methods to analyze the impact of storytelling on consumer engagement and brand awareness. Supplementary qualitative interviews provide additional context and depth. Key findings reveal that visually compelling storytelling increases consumer engagement, though clear visuals alone do not substantially enhance brand recall, suggesting that certain dimensions of visual storytelling are more influential than others. Moreover, digital media platforms significantly moderate the effectiveness of storytelling, indicating that tailoring narratives to platform-specific features amplifies consumer interaction. A positive, albeit not strong, correlation between narrative strategies and consumer trust suggests that real-life success stories and testimonials are beneficial in fostering credibility for both established and emerging brands. These results underscore the importance of integrating emotionally resonant content, creative storytelling approaches, and transparent communication to strengthen engagement and trust. By refining visual elements, leveraging platform-specific strategies, and adopting trust-building narratives, brands can more effectively captivate consumers and amplify their market presence. Keywords: Brand Storytelling, Consumer Engagement, Brand Awareness, Narrative Strategies, Visual Storytelling and Consumer Behavior.
Facebook
Twitterhttps://www.wiseguyreports.com/pages/privacy-policyhttps://www.wiseguyreports.com/pages/privacy-policy
Storyboarding Software Market Overview: The Storyboarding Software Market Size was valued at 935.9 USD Million in 2024. The Storyboarding Software Market is expected to grow from 1,023 USD Million in 2025 to 2,500 USD Million by 2035. The Storyboarding Software Market CAGR (growth rate) is expected to be around 9.3% during the forecast period (2025 - 2035). Key Storyboarding Software Market Trends Highlighted The Global Storyboarding Software Market is witnessing significant growth driven by the increasing need for visual communication in various industries, including film, animation, advertising, and gaming. The rapid adoption of digital tools for content creation has become a key market driver, enabling creators to streamline their storytelling process and enhance collaboration. Additionally, the integration of artificial intelligence and machine learning capabilities into storyboarding software is providing users with innovative features, such as automated layout suggestions and real-time feedback, further attracting professionals looking to improve their workflow. Opportunities within the global market include the expansion of software offerings tailored for specific sectors, like education and corporate training, which require customized storyboarding solutions. With the rise of e-learning and remote working arrangements, educational institutions and businesses are seeking tools that facilitate clear and effective communication of ideas through storyboards. Innovations in cloud-based technology are also creating avenues for collaboration and sharing, making it easier for teams to work together from different locations, thus improving productivity and the quality of final outputs. Trends in recent times indicate a shift towards user-friendly interfaces and templates designed for novice users, enabling a broader demographic to utilize storyboarding software.The increasing focus on visual storytelling in marketing strategies is compelling companies to explore these tools, aiming to create engaging content that resonates with their audience. As the demand for high-quality content continues to rise across various platforms, the Global Storyboarding Software Market is expected to expand, reflecting the ongoing transformation in digital content creation practices worldwide. Source: Primary Research, Secondary Research, WGR Database and Analyst Review Storyboarding Software Market Segment Insights: Storyboarding Software Market Regional Insights The Regional segmentation of the Global Storyboarding Software Market reveals that North America dominates the market with a significant valuation of 460 USD Million in 2024 and is projected to reach 1,150 USD Million by 2035. This region's growth can be attributed to its robust technology adoption and high demand for innovative content creation solutions across various industries. Europe is experiencing steady expansion driven by increasing digital storytelling trends and a growing emphasis on media and entertainment applications. In APAC, the market shows moderate increase as businesses recognize the value of visual planning tools for project management and creative processes.Meanwhile, South America is beginning to see gradual growth, with businesses slowly adopting storyboarding software to enhance their creative outputs. The Middle East and Africa, although smaller in scale, are also witnessing a progressive shift towards embracing digital storytelling techniques as companies aim to improve their communication strategies. In summary, North America remains the leader in the Global Storyboarding Software Market segment, along with Europe and APAC showing significant potential for future growth. Source: Primary Research, Secondary Research, WGR Database and Analyst Review North America : North America is witnessing a surge in the adoption of storyboarding software, primarily in the automotive and healthcare sectors. The integration of AIoT technologies is driving efficiency and innovation. Policies like the AI in Transportation initiative encourage advancements, with investment in smart manufacturing projected to reach USD 40 billion by 2025. Europe : Europe is focusing on enhancing user experiences in the creative sectors through advanced storyboarding tools. Urban surveillance and smart city initiatives support the integration of such technologies. Policies like the European Green Deal emphasize sustainability, driving investments in smart technologies to reach USD 50 billio
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
In visual narratives like comics, the most overt form of perspective-taking comes in panels that directly depict the viewpoints of characters in the scene. We therefore examined these subjective viewpoint panels (also known as point-of-view panels) in a corpus of over 300 annotated comics from Asia, Europe, and the United States. In line with predictions that Japanese manga use a more “subjective” storytelling style than other comics, we found that more manga use subjective panels than other comics, with high proportions of subjective panels also found in Chinese, French, and American comics. In addition, panels with more “focal” framing, i.e. micro panels showing close ups and/or amorphic panels showing views of the environment, had higher proportions of subjective panels than panels showing wider views of scenes. These findings further show that empirical corpus analyses provide evidence of cross-cultural variation and reveal relationships across structures in the visual languages of comics.
Facebook
TwitterAttribution-NonCommercial 3.0 (CC BY-NC 3.0)https://creativecommons.org/licenses/by-nc/3.0/
License information was derived automatically
This dataset is introduced by the paper "Understanding, Categorizing and Predicting Semantic Image-Text Relations". If you are using this dataset it in your work, please cite: @inproceedings{otto2019understanding, title={Understanding, Categorizing and Predicting Semantic Image-Text Relations}, author={Otto, Christian and Springstein, Matthias and Anand, Avishek and Ewerth, Ralph}, booktitle={In Proceedings of ACM International Conference on Multimedia Retrieval (ICMR 2019)}, year={2019} } To create the full tar use the following command in the command line: cat train.tar.part* > train_concat.tar Then simply untar it via tar -xf train_concat.tar The jsonl files contain metadata of the following format: id, origin, CMI, SC, STAT, ITClass, text, tagged text, image_path License Information: This dataset is composed of various open access sources as described in the paper. We thank all the original authors for their work. Pitt Image Ads Dataset: http://people.cs.pitt.edu/~kovashka/ads/ Image-Net challenge: http://image-net.org/ Visual Storytelling Dataset (VIST): http://visionandlanguage.net/VIST/ Wikipedia: https://www.wikipedia.org/ Microsoft COCO: http://cocodataset.org/#home
Facebook
Twitter
According to our latest research, the global visual content market size reached USD 72.7 billion in 2025, reflecting robust expansion driven by the growing digital ecosystem and the rising adoption of visual storytelling across industries. The market is registering a strong CAGR of 9.8% and is forecasted to reach USD 161.6 billion by 2034. This impressive growth trajectory is primarily propelled by the increasing demand for engaging, high-quality visual content in marketing, education, entertainment, and e-commerce, as organizations and individuals alike recognize the unparalleled impact of visuals in capturing attention and conveying information efficiently.
One of the most significant growth factors in the visual content market is the surging adoption of digital marketing strategies across diverse industries. Brands and businesses are increasingly leveraging visual content such as images, videos, infographics, and animations to enhance their digital presence, improve brand recall, and boost customer engagement. The proliferation of social media platforms like Instagram, TikTok, and YouTube has further intensified the need for visually appealing content, as these platforms prioritize visuals in their algorithms and user experiences. Moreover, the shift towards mobile-first content consumption has made bite-sized, visually rich formats such as GIFs and short videos indispensable for marketers aiming to capture and retain the fleeting attention of modern consumers. This trend is expected to continue driving the demand for visual content, as organizations seek innovative ways to differentiate themselves in a crowded digital landscape.
Another critical driver for the visual content market is the rapid advancement in content creation technologies, including artificial intelligence (AI), machine learning, and augmented reality (AR). These technologies have democratized the creation of high-quality visual assets, enabling even small businesses and individual content creators to produce professional-grade visuals without extensive technical expertise or large budgets. AI-powered tools can now automate tasks such as image enhancement, video editing, and content personalization, significantly reducing production times and costs. Generative AI in particular has emerged as a transformative force in 2025, with platforms integrating text-to-image and text-to-video capabilities that allow rapid, scalable content production. Additionally, the integration of AR and interactive visuals is opening new avenues for immersive storytelling, particularly in sectors like education, entertainment, and e-commerce. As these technologies continue to evolve, they are expected to further accelerate the adoption of visual content across a broader range of applications and end-users.
The increasing importance of data-driven decision-making is also fueling the growth of the visual content market. Organizations are leveraging visual analytics and infographics to simplify complex data sets and facilitate more effective communication of insights to stakeholders. Infographics and data visualizations have become essential tools for businesses, educators, and media organizations seeking to present information in a clear, compelling, and easily digestible manner. This trend is particularly pronounced in sectors such as publishing, finance, and healthcare, where the ability to quickly interpret and act on data is critical. As the volume and complexity of data continue to grow, the demand for visually intuitive content formats is expected to rise correspondingly, further boosting the market.
In the travel industry, Visual Content Management for Travel has become a vital component for engaging potential travelers and enhancing their experience. With the rise of digital platforms, travel agencies and tourism boards are leveraging visual content to showcase destinations, accommodations, and experiences in a more immersive way. High-quality images and videos allow potential travelers to visualize
Facebook
TwitterVisual storytelling engagement reaching 7.2% for Turkish travel influencers.
Facebook
Twitterhttps://choosealicense.com/licenses/unknown/https://choosealicense.com/licenses/unknown/
Illustrations by tubik.arts from https://www.behance.net/gallery/201500787/Character-Illustrations-for-Visual-Storytelling To be used for educational purposes. Original images are copyrighted.
Facebook
TwitterFounded in 2015, MGL Infographic operates in Media & Entertainment offering infographic design services that communicate data, ideas, or stories visually. The agency is comprised of experienced professionals including infographic scientists, data miners, and data analysts focused on delivering quality visual content. Customer satisfaction remains a top priority as they collaborate closely with clients to develop unique visual solutions. MGL Infographic aspires to go beyond conventional expectations in visual storytelling.
Facebook
TwitterWe introduce the first dataset for sequential vision-to-language, and explore how this data may be used for the task of visual storytelling. The dataset includes 81,743 unique photos in 20,211 sequences, aligned to descriptive and story language. VIST is previously known as "SIND", the Sequential Image Narrative Dataset (SIND).