Text-to-Video model

From HandWiki
Revision as of 21:11, 8 February 2024 by Scavis (talk | contribs) (link)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Text-to-Video is a state of the art artificial intelligence technology which needs only text as input for the output as video. The inspiration came from text-to-image models which deliver images as output from text as input. Video prediction on making objects realistic in a stable background is performed by using recurrent neural network for a sequence to sequence model with a connector convolutional neural network encoding and decoding each frame pixel by pixel,[1] creating video using deep learning.[2]

Methodology

Models

There are different models including open source models. CogVideo presented their code in GitHub.[3] Meta Platforms uses text-to-video with makeavideo.studio.[4][5][6]Google used Imagen Video for converting text-to-video.[7][8][9][10][11]

Antonia Antonova presented another model.[12]

References

  1. "Leading India". https://www.leadingindia.ai/downloads/projects/VP/vp_16.pdf. 
  2. Narain, Rohit (2021-12-29). "Smart Video Generation from Text Using Deep Neural Networks" (in en-US). https://www.datatobiz.com/blog/smart-video-generation-from-text/. 
  3. CogVideo, THUDM, 2022-10-12, https://github.com/THUDM/CogVideo, retrieved 2022-10-12 
  4. Davies, Teli (2022-09-29). "Make-A-Video: Meta AI's New Model For Text-To-Video Generation" (in en). https://wandb.ai/telidavies/ml-news/reports/Make-A-Video-Meta-AI-s-New-Model-For-Text-To-Video-Generation--VmlldzoyNzE4Nzcx. 
  5. Monge, Jim Clyde (2022-08-03). "This AI Can Create Video From Text Prompt" (in en). https://betterprogramming.pub/this-ai-can-create-video-from-text-prompt-6904439d7aba. 
  6. "Meta’s Make-A-Video AI creates videos from text". https://www.fonearena.com/blog/375627/meta-make-a-video-ai-create-videos-from-text.html. 
  7. "google: Google takes on Meta, introduces own video-generating AI - The Economic Times". https://m.economictimes.com/tech/technology/google-takes-on-meta-introduces-own-video-generating-ai/amp_articleshow/94681128.cms?amp_gsa=1&amp_js_v=a9&usqp=mq331AQKKAFQArABIIACAw==#amp_tf=From%20%251$s&aoh=16655942495197&referrer=https://www.google.com&ampshare=https://m.economictimes.com/tech/technology/google-takes-on-meta-introduces-own-video-generating-ai/articleshow/94681128.cms. 
  8. Monge, Jim Clyde (2022-08-03). "This AI Can Create Video From Text Prompt" (in en). https://betterprogramming.pub/this-ai-can-create-video-from-text-prompt-6904439d7aba. 
  9. "Nuh-uh, Meta, we can do text-to-video AI, too, says Google". https://www.theregister.com/AMP/2022/10/06/google_ai_imagen_video/. 
  10. "Papers with Code - See, Plan, Predict: Language-guided Cognitive Planning with Video Prediction" (in en). https://paperswithcode.com/paper/see-plan-predict-language-guided-cognitive. 
  11. "Papers with Code - Text-driven Video Prediction" (in en). https://paperswithcode.com/paper/text-driven-video-prediction. 
  12. "Text to Video Generation" (in en-US). https://antonia.space/text-to-video-generation.