Skip to main content

DALL·E Mini – Generate Images From Any Text Prompt

In this article, we explore Craiyon (also known as DALL·E mini), which is an artificial intelligence model that generates images from any prompt you give.
Created on May 16|Last edited on January 28
Watch The Story Behind DALL·E mini on Gradient Dissent with Boris Dayma
💡
Craiyon (previously DALL·E mini) is an AI model that can generate images from any prompt you give. The model is still training and keeps improving over time.
In this article, we explain how to use it, how it works, and share some examples.

Table of Contents



How To Use It

Generate your own images on craiyon.com.

If you're a developer, you can also refer to our GitHub repository to run or train your own models!

How Does It Work?

History

OpenAI had the first impressive model for generating images with DALL·E.
Craiyon/DALL·E mini is an attempt at reproducing those results with an open-source model.

The Simple Explanation

The model is trained by looking at millions of images from the internet with their associated captions. Over time, it learns how to draw an image from a text prompt.
Some concepts are learned from memory as they may have seen similar images. However, it can also learn how to create unique images that don't exist, such as "the Eiffel tower is landing on the moon," by combining multiple concepts together.
Several models are combined together to achieve these results:
  • an image encoder that turns raw images into a sequence of numbers with its associated decoder
  • a model that turns a text prompt into an encoded image
  • a model that judges the quality of the images generated for better filtering

The Technical Explanation

Refer to our technical report for more details.

What Is DALL·E Mega?

DALL·E mega is the largest version of DALL·E mini trained and contains several improvements:
The model is still training.


See DALL·E mega training journal for more details.

Is the Model Available?

Yes!
While the model is still training, the latest version is regularly pushed:
It can also be downloaded programmatically with:
import wandb
run = wandb.init()
artifact = run.use_artifact('dalle-mini/dalle-mini/mega-1:latest')
artifact_dir = artifact.download()
See the notebooks from our GitHub repository for how to use it or just go to our app!

Results From the Latest Model




Resources

Acknowledgements

  • 🤗 Hugging Face for organizing the FLAX/JAX community week, with a stellar coordination and support from Patrick von Platen and Suraj Patil
  • Google TPU Research Cloud (TRC) program for providing computing resources
  • Weights & Biases for providing the infrastructure for experiment tracking and model management
  • Authors of original DALL·E mini: Boris Dayma, Suraj Patil, Pedro Cuenca, Khalid Saifullah, Tanishq Abraham, Phúc Lê Khắc, Luke Melas, Ritobrata Ghosh
  • Contributions and support from Rohan Anil, Phil Wang, Katherine Crowson, the Gradio team and the LAION & EleutherAI communities
Follow @borisdayma for the latest updates!



Subscribe to our Newsletter for more ML News like DALLE.

Duncan Paolo Kmet
Duncan Paolo Kmet •  
Very Awesome BOIIIIIISSS
Reply
Sully Dawood
Sully Dawood •  
Anyone have insights on commercial use?
Reply
Liam Ames
Liam Ames •  
Each time I try to use it I receive a message saying that there is too many traffic!
1 reply
Frank W.
Frank W. •  
Amazing! One question: Are the generated pictures free to use?
2 replies
Mohammad Bilal Shaikh
Mohammad Bilal Shaikh •  
excellent work mates!
Reply
James Webb
James Webb •  
Amazing work!
Reply
Rana Saleh
Rana Saleh •  
Great job
Reply
Iterate on AI agents and models faster. Try Weights & Biases today.