ACM CHI Conference on Human Factors in Computing Systems 2026
University of Oulu • Carleton University • IST, University of Lisbon & IHA-NOVA FCSH / IN2PAST, Portugal
Figure 1: Default images in Midjourney. Varied, seemingly unrelated prompts lead to visually similar outputs (the "default image"), motivating our exploration of this behavior in text-to-image generation models (Midjourney).
In the creative practice of text-to-image (TTI) generation, images are synthesized from textual prompts. By design, TTI models always yield an output, even if the prompt contains unknown terms. In this case, the model may generate default images: images that closely resemble each other across many unrelated prompts. Studying default images is valuable for designing better solutions for prompt engineering and TTI generation.
We present the first investigation into default images on Midjourney. We describe an initial study in which we manually created input prompts triggering default images, and several ablation studies. Building on these, we conduct a computational analysis of over 750,000 images, revealing consistent default images across unrelated prompts. We also conduct an online user study investigating how default images may affect user satisfaction.
Dataset: https://huggingface.co/datasets/tti-dev/default-images
Default images are visually similar outputs resulting from dissimilar prompts. They occur when TTI models encounter ambiguous or unknown terms (e.g., words outside the training data) and "collapse" to a specific region in the latent space.
We systematically created 130 prompts across six categories likely to trigger default images:
We analyzed over 750,000 images collected from Midjourney Discord channels.
Results from our user study (N=48) showing mean satisfaction scores (1-7 scale) across different conditions.
Figure 8: Users are significantly dissatisfied when default images appear (Q4) or when the image deviates noticeably from the prompt (Q2).
Through affinity diagramming, we identified specific recurring motifs. These images appear repeatedly for unrelated prompts.
Lady-Birdhead
Floating-Head
Psychedelic-Eye
Eagle-Circle
Growth-Face
Animal-Bush
Standing-Lady
Mirror-Lady
Headpiece-Lady
Fantasy-Castle
Figure 4: Set of default images. We assigned descriptive labels to these recurring outputs.
@inproceedings{defaultimages,
title={An Exploration of Default Images in Text-to-Image Generation},
author={Simonen, Hannu and Kiviniemi, Atte and Johnston, Hannah and Barranha, Helena and Oppenlaender, Jonas},
year={2026},
booktitle={ACM CHI Conference on Human Factors in Computing Systems},
publisher={ACM},
address={New York, NY, USA},
doi={10.1145/3772318.3790681},
eprint={2505.09166},
archivePrefix={arXiv},
url={https://arxiv.org/abs/2505.09166},
}