ViewTube

Skip

Recommended videos

Mike Shou

3:23:53

Tutorial: Video Diffusion Models. Mike Shou, 2023.

7,599 views

4 months ago

Rice Ken Kennedy Institute

1:12:30

Jeff Dean (Google): Exciting Trends in Machine Learning

164,384 views

2 months ago

NLP IL - Natural Language Processing Israel

1:45:31

Train, Deploy, Evaluate, Repeat: Mastering Custom LLMs (NLP IL @ Dream 8.4.24)

135 views

2 weeks ago

NLP IL - Natural Language Processing Israel

2:17:46

LLM Safeguards - Keeping LLMs in Line

113 views

1 month ago

NLP IL - Vision-Language Club #3 Hila Chefer - Lumiere

286 views

NLP IL - Natural Language Processing Israel

486 subscribers

Tue, 05 Mar 2024 00:00:00 GMT

For the third meeting of the Vision-Language Club we were greatful to host Hila Chefer for a talk on Google's new text-to-video diffusion model - Lumiere! In this talk, I will present Lumiere, our latest text-to-video model from Google Research. Lumiere is designed for synthesizing videos that portray realistic, diverse and coherent motion - a pivotal challenge in video synthesis. We achieve this by introducing a Space-Time U-Net architecture that generates the entire temporal duration of the video at once, through a single pass in the model. This is in contrast to existing video models which synthesize distant keyframes followed by temporal super-resolution - an approach that inherently makes global temporal consistency difficult to achieve. During this talk, we will delve into the Space-Time U-Net architecture proposed by Lumiere, comparing it to existing text-to-video models. Additionally, we will explore the broad range of applications facilitated by our model, including image-based generation, video stylization, video editing, cinemagraphs, and more. Hila is a PhD candidate at Tel Aviv University advised by Prof. Lior Wolf, and a research intern at Google in Tel Aviv. Her research is centered around computer vision and multi-modal learning. Her works focus particularly on developing methods to understand deep neural networks and leveraging their insights to enhance the expressiveness, robustness, and fairness of the model.

ViewTube

Recommended videos

NLP IL - Vision-Language Club #3 Hila Chefer - Lumiere

2 Comments