Skip to content

Lesson 10

Puru - a bunch of images of (spherical) linear interpolation between two different starting points

Student examples of making different pictures

  • Going from an old car prompt to a new car prompt
  • dinosaur -> to a bird
  • dog -> unicorn
  • classic optimizer efforts, couple of hours

Recap

  • With Handwritten digits,
  • start with a 7, add some noise and get a "noisy 7"
  • and feed this into Unet, and try to predict the noise
  • we can also pass in the actual number 7 as "guidance" to help it along
  • Turn captions into embeddings, via using existing images + captions (alt tab)
  • Then train a text encoder + image encoder
  • compare the dot product
  • contrastive loss
  • take the text encoder and feed into

--image--

Doing inference

  • put a prompt + some noise
  • Unet will predict noise
  • will subtract gradually, and then resubmit the updated photo
  • used to take 1000 steps, now takes 60 (maybe less!)

--image--

Papers

A lot of interest + work still being focused on this work.

Progressive Distillation for Fast Sampling of Diffusion Models On Distillation of guided Diffusion Models Imagic: Text-Based Real Image Editing with Diffusion Models

Overview of "Progressive Distillation for Fast Sampling of Diffusion Models"

Hand-written Notes