Evalytic Bench: The Prompt Trap

Rank	Model	Visual Quality	Prompt Adherence	Text Rendering	Overall	Conf
1	flux-schnellWINNER	4.2	4.7	3.9	4.3 n=10	94%

A cat sitting on a windowsill at sunset

flux-schnell

4.7/5

1215ms · $0.0030

Score Details

Visual Quality: 4.0/5 (90% conf)
The image exhibits good visual quality with a pleasing composition and vibrant sunset colors. However, some minor inconsistencies in the cat's fur texture and paw rendering prevent it from achieving an excellent score.

Prompt Adherence: 5.0/5 (100% conf)
The image perfectly matches the prompt, depicting a cat sitting on a windowsill with a beautiful sunset in the background. All elements of the prompt are clearly and accurately represented.

Text Rendering: 5.0/5 (100% conf)
No text was expected to be rendered in the image based on the prompt, and no text is present. Therefore, the text rendering quality is excellent.

A fluffy orange tabby cat sitting gracefully on a white wooden windowsill during golden hour sunset, warm light streaming through glass, bokeh background of city skyline, photorealistic, 8K, ultra detailed

flux-schnell

4.7/5

1206ms · $0.0030

Score Details

Visual Quality: 5.0/5 (95% conf)
The image exhibits excellent visual quality, featuring a sharply rendered cat with realistic fur texture and expressive eyes. The dramatic sunset lighting is beautifully captured, creating a warm glow and effective rim lighting on the subject, contributing to a professional and coherent aesthetic.

Prompt Adherence: 4.0/5 (90% conf)
The image strongly adheres to most aspects of the prompt, featuring a fluffy orange cat on a white windowsill during golden hour with a bokeh city background. The only minor deviation is that the windowsill's texture isn't distinctly wooden.

Text Rendering: 5.0/5 (100% conf)
No text was expected in this image based on the provided prompt. The image successfully renders the scene without any unintended text artifacts.

Coffee shop interior, morning light

flux-schnell

3.0/5

1199ms · $0.0030

Score Details

Visual Quality: 3.0/5 (85% conf)
The image has an appealing aesthetic with warm lighting and a good composition, but suffers from several noticeable rendering inconsistencies. Key elements like the person and the car outside the window lack detail and appear somewhat smudged or generic.

Prompt Adherence: 5.0/5 (95% conf)
The image perfectly captures a coffee shop interior with beautiful morning light. The warm sunbeams streaming through the window and the long shadows create an authentic morning ambiance.

Text Rendering: 1.0/5 (90% conf)
The image contains a license plate on a car outside the window, but the text on it is completely garbled and unreadable. No other text is present in the image, and the expected text content from the prompt is not rendered.

Hyperrealistic coffee shop interior with exposed brick walls, reclaimed wood tables, barista making pour-over coffee, golden morning light streaming through floor-to-ceiling windows, steam rising, warm tones, cinematic composition, shot on Hasselblad

flux-schnell

4.3/5

1426ms · $0.0030

Score Details

Visual Quality: 4.0/5 (85% conf)
The image exhibits high visual quality, primarily due to its excellent golden hour lighting and warm, inviting atmosphere. While the overall scene is coherent and well-composed, minor issues include a slightly blurry and generic face on the person, and some background elements lacking fine detail.

Prompt Adherence: 4.0/5 (90% conf)
The image strongly adheres to most aspects of the prompt, depicting a coffee shop interior with a barista making pour-over coffee, exposed brick, and beautiful golden morning light. However, the windows are large but not strictly 'floor-to-ceiling' as specified, and the 'hyperrealistic' quality could be slightly more pronounced in some details.

Text Rendering: 5.0/5 (100% conf)
No text was expected in the image based on the provided prompt, and indeed, no text is present in the generated image. The image successfully avoids generating any garbled or unintended text.

A neon sign reading 'HELLO' on a brick wall

flux-schnell

5.0/5

772ms · $0.0030

Score Details

Visual Quality: 5.0/5 (95% conf)
The image exhibits excellent visual quality with professional rendering. The neon effect is highly realistic, featuring a bright core and a convincing glow that illuminates the surrounding brick wall naturally. There are no visible artifacts, distortions, or blurring.

Prompt Adherence: 5.0/5 (100% conf)
The image perfectly matches the prompt, featuring a clear neon sign that spells 'HELLO' mounted on a brick wall. All specified elements are present and accurately depicted.

Text Rendering: 5.0/5 (100% conf)
The text 'HELLO' is perfectly rendered, crisp, and accurate. Each letter is clearly formed with a consistent neon effect, making it easily readable.

A bright pink and blue neon sign reading 'HELLO' mounted on a dark red exposed brick wall in a trendy bar, slight glow and reflections, moody atmospheric lighting, shallow depth of field, professional photography

flux-schnell

5.0/5

751ms · $0.0030

Score Details

Visual Quality: 5.0/5 (95% conf)
The image exhibits excellent visual quality with sharp and coherent rendering of the neon sign. The realistic glow effect beautifully illuminates the detailed brick wall, creating a convincing and vibrant scene with no noticeable artifacts.

Prompt Adherence: 5.0/5 (95% conf)
The image perfectly matches all elements of the prompt, featuring a bright pink and blue neon 'HELLO' sign on a dark red brick wall with moody lighting, reflections, and a shallow depth of field, consistent with professional photography in a trendy bar setting.

Text Rendering: 5.0/5 (100% conf)
The text 'Hello' is perfectly rendered, crisp, and accurate. All letters are clearly legible, correctly spelled, and free from any rendering artifacts or distortions.

Mountain landscape with a lake

flux-schnell

5.0/5

1020ms · $0.0030

Score Details

Visual Quality: 5.0/5 (100% conf)
The image exhibits excellent visual quality, characterized by sharp details, natural colors, and a coherent composition. There are no visible artifacts, distortions, or inconsistencies typically associated with AI generation.

Prompt Adherence: 5.0/5 (100% conf)
The image is an excellent representation of the prompt, featuring a clear mountain landscape with a prominent lake in the foreground. All elements requested in the prompt are perfectly depicted.

Text Rendering: 5.0/5 (100% conf)
No text was expected or found within the image. The provided 'expected text content' was a description of the image, not text to be rendered.

Epic panoramic mountain landscape with crystal clear alpine lake reflecting snow-capped peaks, wildflowers in foreground, dramatic cumulus clouds, golden hour lighting, National Geographic quality, medium format film look

flux-schnell

4.7/5

756ms · $0.0030

Score Details

Visual Quality: 4.0/5 (80% conf)
The image presents a high-quality, aesthetically pleasing landscape with vibrant colors and good overall composition. However, upon closer inspection, minor AI-generated artifacts are visible, such as somewhat repetitive or smoothed textures on the distant mountains and slightly unnatural shapes in some foreground flowers.

Prompt Adherence: 5.0/5 (95% conf)
The image perfectly captures all key elements of the prompt, including the epic panoramic mountain landscape, crystal clear alpine lake with reflections, wildflowers in the foreground, and dramatic cumulus clouds. The lighting effectively conveys a golden hour feel, and the overall quality aligns with a National Geographic aesthetic and a medium format film look.

Text Rendering: 5.0/5 (100% conf)
No text was expected in this image based on the provided prompt, and indeed, no text is present in the generated image. The image focuses solely on the landscape elements.

A robot reading a newspaper

flux-schnell

3.3/5

778ms · $0.0030

Score Details

Visual Quality: 4.0/5 (90% conf)
The image exhibits high visual quality with excellent detail and rendering of the robot's metallic and plastic surfaces, along with consistent lighting. However, the text on the newspaper is largely illegible and nonsensical, which detracts slightly from overall realism.

Prompt Adherence: 5.0/5 (95% conf)
The image perfectly depicts a robot holding and looking at a newspaper, fulfilling all aspects of the prompt. The robot's design is detailed, and its posture clearly conveys the act of reading.

Text Rendering: 1.0/5 (90% conf)
The text on the newspaper is completely unreadable and appears as garbled characters, not forming any coherent words or sentences. It is clearly placeholder text rather than actual legible content.

A highly detailed humanoid robot with brushed steel skin and glowing blue eyes sitting in a mid-century modern chair reading The New York Times newspaper, soft directional lighting, photorealistic CGI render, octane render, 4K

flux-schnell

3.0/5

1041ms · $0.0030

Score Details

Visual Quality: 3.0/5 (90% conf)
The image features good overall rendering, lighting, and material quality for the robot and environment. However, the prominent text in the newspaper contains significant artifacts, including misspellings and unreadable gibberish, which detracts from the overall visual quality.

Prompt Adherence: 4.0/5 (90% conf)
The image largely adheres to the prompt, depicting a highly detailed humanoid robot with brushed steel skin and glowing blue eyes, sitting in a mid-century modern chair with appropriate lighting and render quality. The primary deviation is a text hallucination on the newspaper.

Text Rendering: 2.0/5 (90% conf)
While 'The New York Times' is correctly rendered on the right page of the newspaper, the left page displays 'The letrek Times', which is a significant misspelling and garbling of the expected text. This indicates partial readability with significant errors.

Category	Requests	Cost
fal.ai generation	10	$0.0300
gemini-2.5-flash judge	30	$0.0150
Total		$0.0450

Models	flux-schnell
Judge	gemini-2.5-flash
Dimensions	visual_quality, prompt_adherence, text_rendering
Pipeline	text2img
Evalytic Version	0.2.1
Platform	darwin-arm64

The Prompt Trap

Model Rankings

Dimension Profile

A cat sitting on a windowsill at sunset

A fluffy orange tabby cat sitting gracefully on a white wooden windowsill during golden hour sunset, warm light streaming through glass, bokeh background of city skyline, photorealistic, 8K, ultra detailed

Coffee shop interior, morning light

Hyperrealistic coffee shop interior with exposed brick walls, reclaimed wood tables, barista making pour-over coffee, golden morning light streaming through floor-to-ceiling windows, steam rising, warm tones, cinematic composition, shot on Hasselblad

A neon sign reading 'HELLO' on a brick wall

A bright pink and blue neon sign reading 'HELLO' mounted on a dark red exposed brick wall in a trendy bar, slight glow and reflections, moody atmospheric lighting, shallow depth of field, professional photography

Mountain landscape with a lake

Epic panoramic mountain landscape with crystal clear alpine lake reflecting snow-capped peaks, wildflowers in foreground, dramatic cumulus clouds, golden hour lighting, National Geographic quality, medium format film look

A robot reading a newspaper

A highly detailed humanoid robot with brushed steel skin and glowing blue eyes sitting in a mid-century modern chair reading The New York Times newspaper, soft directional lighting, photorealistic CGI render, octane render, 4K

Cost Summary

Configuration