About this episode
General Partner Anjney Midha explores the cutting-edge world of text-to-video AI with AI researchers Andreas Blattmann and Robin Rombach. Released in November, Stable Video Diffusion is their latest open-source generative video model, overcoming challenges in size and dynamic representation. In this episode Robin and Andreas share why translating text to video is complex, the key role of datasets, current applications, and the future of video editing. Topics Covered: 00:00 - Text to Video: The Next Leap in AI Generation 02:41 - The Stable Diffusion backstory 04:25 - Diffusion vs autoregressive models 06:09 - The benefits of single step sampling 09:15 - Why generative video? 11:19 - Understanding physics through AI video 12:20 - The challenge of creating generative video 15:36 - Data set selection and training 17:50 - Structural consistency and 3D objects 19:50 - Incorporating LoRAs 21:24 - How should creators think about these tools? 23:46 - Open challenges in video generation 25:42 - Infrastructure challenges and future research Resources: Find Robin on Twitter: https://twitter.com/robrombach Find Andreas on Twitter: https://twitter.com/andi_blatt Find Anjney on Twitter: https://twitter.com/anjneymidha Stay Updated: Find a16z on Twitter: https://twitter.com/a16z Find a16z on LinkedIn: https://www.linkedin.com/company/a16z Subscribe on your favorite podcast app: https://a16z.simplecast.com/ Follow our host: https://twitter.com/stephsmithio Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures. Stay Updated: Find a16z on X Find a16z on LinkedIn Listen to the a16z Show on Spotify Listen to the a16z Show on Apple Podcasts Follow our host: https://twitter.com/eriktorenberg Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.