Text-to-Video model
Text-to-Video is a state of the art artificial intelligence technology which needs only text as input for the output as video. The inspiration came from text-to-image models which deliver images as output from text as input. Video prediction on making objects realistic in a stable background is performed by using recurrent neural network for a sequence to sequence model with a connector convolutional neural network encoding and decoding each frame pixel by pixel,[1] creating video using deep learning.[2]
Methodology
- Data collection and data set preparation using clear video from kinetic human action video.
- Training the convolutional neural network for making video.
- Keywords extraction from text using natural-language programming .
- Testing of Data set in conditional generative model for existing static and dynamic information from text by variational autoencoder and generative adversarial network.
Models
There are different models including open source models. CogVideo presented their code in GitHub.[3] Meta Platforms uses text-to-video with makeavideo.studio.[4][5][6]Google used Imagen Video for converting text-to-video.[7][8][9][10][11]
Antonia Antonova presented another model.[12]
References
- ↑ "Leading India". https://www.leadingindia.ai/downloads/projects/VP/vp_16.pdf.
- ↑ Narain, Rohit (2021-12-29). "Smart Video Generation from Text Using Deep Neural Networks" (in en-US). https://www.datatobiz.com/blog/smart-video-generation-from-text/.
- ↑ CogVideo, THUDM, 2022-10-12, https://github.com/THUDM/CogVideo, retrieved 2022-10-12
- ↑ Davies, Teli (2022-09-29). "Make-A-Video: Meta AI's New Model For Text-To-Video Generation" (in en). https://wandb.ai/telidavies/ml-news/reports/Make-A-Video-Meta-AI-s-New-Model-For-Text-To-Video-Generation--VmlldzoyNzE4Nzcx.
- ↑ Monge, Jim Clyde (2022-08-03). "This AI Can Create Video From Text Prompt" (in en). https://betterprogramming.pub/this-ai-can-create-video-from-text-prompt-6904439d7aba.
- ↑ "Meta’s Make-A-Video AI creates videos from text". https://www.fonearena.com/blog/375627/meta-make-a-video-ai-create-videos-from-text.html.
- ↑ "google: Google takes on Meta, introduces own video-generating AI - The Economic Times". https://m.economictimes.com/tech/technology/google-takes-on-meta-introduces-own-video-generating-ai/amp_articleshow/94681128.cms?amp_gsa=1&_js_v=a9&usqp=mq331AQKKAFQArABIIACAw==#amp_tf=From%20%251$s&aoh=16655942495197&referrer=https://www.google.com&share=https://m.economictimes.com/tech/technology/google-takes-on-meta-introduces-own-video-generating-ai/articleshow/94681128.cms.
- ↑ Monge, Jim Clyde (2022-08-03). "This AI Can Create Video From Text Prompt" (in en). https://betterprogramming.pub/this-ai-can-create-video-from-text-prompt-6904439d7aba.
- ↑ "Nuh-uh, Meta, we can do text-to-video AI, too, says Google". https://www.theregister.com/AMP/2022/10/06/google_ai_imagen_video/.
- ↑ "Papers with Code - See, Plan, Predict: Language-guided Cognitive Planning with Video Prediction" (in en). https://paperswithcode.com/paper/see-plan-predict-language-guided-cognitive.
- ↑ "Papers with Code - Text-driven Video Prediction" (in en). https://paperswithcode.com/paper/text-driven-video-prediction.
- ↑ "Text to Video Generation" (in en-US). https://antonia.space/text-to-video-generation.