MoDA: Multi-modal Diffusion Architecture for Talking Head Generation Paper • 2507.03256 • Published Jul 4 • 2
FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation Paper • 2508.11255 • Published 6 days ago • 9
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 16 days ago • 467
view article Article Why We Built the OpenMDW License: A Comprehensive License for ML Models By linuxfoundation • Jul 2 • 23
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • 23 days ago • 156
GLM-4.5 Collection GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated 10 days ago • 218
view article Article Fast LoRA inference for Flux with Diffusers and PEFT By sayakpaul and 1 other • 29 days ago • 44
NeuralOS: Towards Simulating Operating Systems via Neural Generative Models Paper • 2507.08800 • Published Jul 11 • 79
ERNIE 4.5 Collection collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 25 items • Updated Jul 11 • 158
Kontext Dev LoRAs Collection Collection of Kontext Dev LoRAs by fal • 30 items • Updated 24 days ago • 26
ARIG: Autoregressive Interactive Head Generation for Real-time Conversations Paper • 2507.00472 • Published Jul 1 • 11
Audio-Sync Video Generation with Multi-Stream Temporal Control Paper • 2506.08003 • Published Jun 9 • 3
Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis Paper • 2412.15322 • Published Dec 19, 2024 • 20