Sora research report

Sora research report

Sora research report

notion image

An Image is Worth 16x16 Words- Transformers for Image Recognition at Scale

notion image

Attention Is All You Need

notion image

Auto-Encoding Variational Bayes

notion image

Denoising Diffusion Probabilistic Models

notion image

Elucidating the Design Space of Diffusion-Based Generative Models

notion image

Generating Long Videos of Dynamic Scenes

notion image

High-Resolution Image Synthesis with Latent Diffusion Models

notion image

Imagen Video- High Definition Video Generation with Diffusion Models

notion image

Improved Denoising Diffusion Probabilistic Models

notion image

Masked Autoencoders Are Scalable Vision Learners

notion image

MoCoGAN- Decomposing Motion and Content for Video Generation

notion image

Scalable Diffusion Models with Transformers

notion image

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation

notion image

SDEdit- Guided Image Synthesis and Editing with Stochastic Differential Equations

notion image

ViViT- A Video Vision Transformer

notion image

notion image

Adversarial Video Generation on Complex Datasets

notion image

Align your Latents- High-Resolution Video Synthesis with Latent Diffusion Models

notion image

Deep Unsupervised Learning using Nonequilibrium Thermodynamics

notion image

Diffusion Models Beat GANs on Image Synthesis

notion image

Generating Videos with Scene Dynamics

notion image

Generative Pretraining From Pixels

notion image

Hierarchical Text-Conditional Image Generation with CLIP Latents

notion image

Improving Image Generation with Better Captions

notion image

Language Models are Few-Shot Learners

notion image

NÜWA- Visual Synthesis Pre-training for Neural visUal World creAtion

notion image

Patch n' Pack- NaViT, a Vision Transformer for any Aspect Ratio and Resolution

notion image

Photorealistic Video Generation with Diffusion Models

notion image

Recurrent Environment Simulators

notion image

Unsupervised Learning of Video Representations using LSTMs

notion image

VideoGPT- Video Generation using VQ-VAE and Transformers

notion image

Zero-Shot Text-to-Image Generation