海棠直播

Contextual AD narration with interleaved multimodal sequence

2025年1月1日·

Hanlin Wang

,

Zhan Tong

,

Kecheng Zheng

,

Yujun Shen

Limin Wang

Limin Wang

· 0 分钟阅读时长

引用 URL

类型

出版物

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

最近更新于 2025年1月1日

Limin Wang

Authors

← CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding 2025年1月1日

LeviTor: 3D trajectory oriented image-to-video synthesis 2025年1月1日 →