ControlVideo | Free AI tool
Training-free Controllable Text-to-Video Generation
Introduction
ControlVideo
Official PyTorch implementation of “ControlVideo: Training-free Controllable Text-to-Video Generation”
ControlVideo adapts ControlNet to the video counterpart without any finetuning, aiming to directly inherit its high-quality and consistent generation
Citation
If you make use of our work, please cite our paper.
@article{zhang2023controlvideo,
title={ControlVideo: Training-free Controllable Text-to-Video Generation},
author={Zhang, Yabo and Wei, Yuxiang and Jiang, Dongsheng and Zhang, Xiaopeng and Zuo, Wangmeng and Tian, Qi},
journal={arXiv preprint arXiv:2305.13077},
year={2023}
}
Acknowledgement
This work repository borrows heavily from Diffusers, ControlNet, Tune-A-Video, and RIFE.
There are also many interesting works on video generation: Tune-A-Video, Text2Video-Zero, Follow-Your-Pose, Control-A-Video, et al.
Recommendation
Gemini
Gemini, a groundbreaking AI model series developed by Google, contains Gemini 1.5 Flash, Gemini 1.5 Pro and Gemini Pro, seamlessly operates across various modalities including text, images and code.
Claude
You can experience Claude-3-Opus, Claude-3.5-Sonnet, Claude-2.1 and Claude-Instant in this application. Claude is an intelligent conversational assistant based on large-scale language models. It can handle context with up to tens of thousands of words in a single conversation.
It is committed to providing instant, accurate and comprehensive answers to all kinds of user questions. Claude is a professional AI assistant.
Mixtral
Supports Mixtral 7B and 8x7B.
Mixtral AI's next-generation conversational AI uses intelligent Q&A capabilities to solve your tough questions.
MythoMist
MythoMist 7B, a cutting-edge and free Mistral-AI model, offering dynamic performance tuning for user-defined objectives. Using a sophisticated algorithm, it discreetly minimizes overused terms in ChatGPT roleplay, all at no cost.