CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image Paper • 2502.12894 • Published 5 days ago • 1
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation Paper • 2502.13128 • Published 5 days ago • 33
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models Paper • 2502.06608 • Published 13 days ago • 32
Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance Paper • 2502.06145 • Published 14 days ago • 16
Pippo: High-Resolution Multi-View Humans from a Single Image Paper • 2502.07785 • Published 12 days ago • 11
VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation Paper • 2502.07531 • Published 12 days ago • 13
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation Paper • 2502.05179 • Published 16 days ago • 22
Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 17 days ago • 50
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published 21 days ago • 182
VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Paper • 2502.02492 • Published 19 days ago • 56
MatAnyone: Stable Video Matting with Consistent Memory Propagation Paper • 2501.14677 • Published about 1 month ago • 30
People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text Paper • 2501.15654 • Published 28 days ago • 12
Histoires Morales: A French Dataset for Assessing Moral Alignment Paper • 2501.17117 • Published 26 days ago • 3