InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
-
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
Paper • 2512.01342 • Published • 14 -
revliter/internvideo_next_base_p14_res224_f16
91M • Updated • 156 • 3 -
revliter/internvideo_next_large_p14_res224_f16
0.3B • Updated • 309 • 4 -
revliter/internvideo_next_large_p14_res224_f16_stage1
Updated • 10 • 1