UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published Jan 21 • 51
Running on L4 179 179 CosyVoice2-0.5B 🥳 Generate realistic voice audio from text and audio prompts