Kandinsky Deforum

Habr Code Telegram

In the past few years, there has been a marked increase in the popularity of generative models that utilize various data modalities. One of the most challenging undertakings in this regard is synthesizing videos from text, which is both time-consuming and resource-intensive. The core of proposed solution/animation approach is Kandinsky extension with Deforum features. This leads to new generative opportunities of text2image model.

Method Description

Animation generation involves three steps:

1. Generation of a reference frame by Kandinsky

2. Small transformation of the previous frame

3. Processing of the resulting image by diffusion through the image-to-image method

Authors

Said Azizov, Igor Pavlov, Andrey Kuznetsov, Denis Dimitrov, Mikhail Shoitov, Angelina Kuts, Arseny Shakhmatov, Tatiana Paskova, Vladimir Arkhipkin, Sergey Nesteruk, Yulia Agafonova, Anastasia Lysenko