Audio Conditioned LipSync with Latent Diffusion Models
Generate audio from text with voice customization