Stable Diffusion lineage (Stability AI + community)
Open weights · Hugging Face · ComfyUI / Auto1111
SD 1.5 (2022, latent diffusion), SDXL (2023, larger), SD 3 (2024, flow matching, MMDiT). Open-weights. The reason there's a vibrant local-image-generation community on consumer hardware. ComfyUI + Automatic1111 + Forge are the UI ecosystem.
Flux (Black Forest Labs · 2024+)
Open weights for dev/schnell · API for pro
Successor team to Stable Diffusion's original authors. Flux.1-dev (dev license), Flux.1-schnell (Apache 2.0), Flux.1-pro (API only). Best open-weights image quality of 2024-2025. Heavy flow-matching architecture (MMDiT 12B params).
DALL-E 3 (OpenAI · 2023)
API only · ChatGPT consumer
OpenAI's third-generation image model. API only. Strong prompt adherence + text rendering. Available through ChatGPT + API. Quietly capped by safety filters that can be more restrictive than open-weights options.
Imagen 4 family (Google · 2024)
API only · Google AI Studio + Vertex
Three variants: Imagen 4 (standard), Imagen 4 Fast (lower latency), Imagen 4 Ultra (highest quality). Google AI Studio + Vertex AI access. Strong on text-in-image rendering. Note: different model than Nano Banana Pro below.
Nano Banana Pro · Gemini 3 Pro Image (Google · 2024-2025)
API only · Google AI Studio · generativelanguage.googleapis.com
Google's multimodal Gemini-family image model — used as the image generation engine on atomeons.com. Different lineage from Imagen 4: Nano Banana Pro is the image branch of the Gemini transformer family, not a dedicated diffusion model. Strong on prompt adherence + brand-consistent style.
Sora (OpenAI · video · 2024)
ChatGPT consumer · API limited
OpenAI's video diffusion model. Multi-second video clips from text. Released as ChatGPT consumer feature December 2024. Quality leader for short-clip text-to-video at announcement. Heavy compute per generation.
Veo 2 / Veo 3 (Google · video · 2024+)
API · Google Vertex AI
Google DeepMind's video model. Multi-second clips from text. Strong on physical-world coherence. Available through Google Vertex AI. Often paired with Imagen 4 for stills + Veo for motion.
MusicLM / MusicGen / Suno / Udio (audio · 2023+)
Mostly API · some open-weights (Stable Audio, MusicGen)
Audio diffusion is a different model family. Suno + Udio are the consumer-facing music generators. Stability Audio (Stability AI) is the open-weights alternative. Less attention than image/video in 2025-2026 but actively shipping.