Software:VideoPoet

From HandWiki
Short description: Large language model to create videos

VideoPoet is a large language model developed by Google Research in 2023 for video making.[1][2][3][4] It can be asked to animate still images.[5] The model accepts text and image and video as prompt input, with a program to add feature for any input to any format generated content. It is in private test phase.

References

  1. Krithika, K. L. (December 20, 2023). "Google Unveils VideoPoet, a New LLM for Video Generation". https://analyticsindiamag.com/google-unveils-videopoet-a-new-llm-for-video-generation/. 
  2. "Google has introduced VideoPOET breaking new ground in coherent video generation - Gizmochina". https://www.gizmochina.com/2023/12/21/google-videopoet-10-second-coherent-video-generation/. 
  3. Kondratyuk, Dan; Yu, Lijun; Gu, Xiuye; Lezama, José; Huang, Jonathan; Hornung, Rachel; Adam, Hartwig; Akbari, Hassan; Alon, Yair; Birodkar, Vighnesh; Cheng, Yong; Chiu, Ming-Chang; Dillon, Josh; Essa, Irfan; Gupta, Agrim; Hahn, Meera; Hauth, Anja; Hendon, David; Martinez, Alonso; Minnen, David; Ross, David; Schindler, Grant; Sirotenko, Mikhail; Sohn, Kihyuk; Somandepalli, Krishna; Wang, Huisheng; Yan, Jimmy; Yang, Ming-Hsuan; Yang, Xuan; Seybold, Bryan; Jiang, Lu (December 21, 2023). "VideoPoet: A Large Language Model for Zero-Shot Video Generation". arXiv:2312.14125 [cs.CV].
  4. "VideoPoet – Google Research". https://sites.research.google/videopoet/. 
  5. Franzen, Carl (December 20, 2023). "Google's new multimodal AI video generator VideoPoet looks incredible". https://venturebeat.com/ai/googles-new-videopoet-multimodal-ai-video-generation-model-looks-incredible/.