Diffusion Model

Compositional Capability of Diffusion Model

Diffusion Model은 text-to-image generation task에서 뛰어난 성능을 보여주고 있지만, 여러 단어로 이루어져 있거나 여러 단어들의 관계성을 포함한 텍스트에 대해서는 텍스트에 일치하는 이미지를 생성하기 어려워 합니다. Diffusion Model의 이러한 한계를 극복하고 복잡한 텍스트에 대해서도 텍스트와 일치하는 이미지를 생성할 수 있도록 연구를 하고 있습니다.

Ex) A big hippopotamus and a small cat, The square box is next to the circular canister.

Accepted Papers

Page updated

Google Sites

Report abuse