ControlVideo | Free AI tool
Training-free Controllable Text-to-Video Generation
Introduction
ControlVideo
Official PyTorch implementation of “ControlVideo: Training-free Controllable Text-to-Video Generation”
ControlVideo adapts ControlNet to the video counterpart without any finetuning, aiming to directly inherit its high-quality and consistent generation
Citation
If you make use of our work, please cite our paper.
@article{zhang2023controlvideo,
title={ControlVideo: Training-free Controllable Text-to-Video Generation},
author={Zhang, Yabo and Wei, Yuxiang and Jiang, Dongsheng and Zhang, Xiaopeng and Zuo, Wangmeng and Tian, Qi},
journal={arXiv preprint arXiv:2305.13077},
year={2023}
}
Acknowledgement
This work repository borrows heavily from Diffusers, ControlNet, Tune-A-Video, and RIFE.
There are also many interesting works on video generation: Tune-A-Video, Text2Video-Zero, Follow-Your-Pose, Control-A-Video, et al.
Recommendation
Gemini
Gemini is now free to all users.
Gemini, a groundbreaking AI model created by Google, seamlessly operates across various modalities including text, images, video, audio, and code.
Claude
You can experience Claude-3-Opus, Claude-3-Sonnet, Claude-2.1 and Claude-Instant in this application. Claude is an intelligent conversational assistant based on large-scale language models. It can handle context with up to tens of thousands of words in a single conversation.
It is committed to providing instant, accurate and comprehensive answers to all kinds of user questions. Claude is a professional AI assistant.
Mixtral
Supports Mixtral 7B and 8x7B.
Mixtral AI's next-generation conversational AI uses intelligent Q&A capabilities to solve your tough questions.
MythoMist
MythoMist 7B, a cutting-edge and free Mistral-AI model, offering dynamic performance tuning for user-defined objectives. Using a sophisticated algorithm, it discreetly minimizes overused terms in ChatGPT roleplay, all at no cost.