2024-3-11 03:35 /
今日工作总结

1. 了解视频格式基础知识,压制原理和常见动画画面瑕疵。链接更新在了AI动画技术指南中。

2. 阅读论文
(1) AudioCLIP: Extending CLIP to Image, Text and Audio
(2) Learning Transferable Visual Models From Natural Language Supervision
(3) InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

看论文细节和代码实现

3. 收集数据集