Controlled comparison · precision only

Krea 2 Turbo — fp8 vs int8 on a single NVIDIA GeForce RTX 3090

Resolution 1024²Seed 42 Runs/model 9 cleanDate 2026-06-24
Controlled variable: only the diffusion-model precision changes (fp8_scaled → int8 ConvRot). Same model, same er_sde sampler, same 8 steps, seed, prompt, and resolution — so the difference is attributable to precision alone.
1.92× faster
int8 generates the same image in 48% less time than fp8 on this RTX 3090
+97%
throughput (it/s)
7.1 s
saved / image
+0.3 GB
VRAM vs fp8

Throughput — it/s (higher = faster)

Seconds per image (lower = faster)

Mean peak VRAM

Avg power draw

Per-run consistency (clean runs)

Flat, separated lines = stable measurement; the gap between them is the speedup.

Same prompt · same seed (42) · fp8 vs int8

Identical seed both sides — visual quality is comparable; the difference is speed, not output.
fp8_scaled 14.82 s · 0.65 it/s
int8 ConvRot 7.70 s · 1.27 it/s