--- license: apache-2.0 pipeline_tag: audio-text-to-text language: - en - zh base_model: - Yi3852/MuFun-Base datasets: - Yi3852/ACEStep-Songs --- a prompt generator for the [ACE-Step](https://huggingface.co/ACE-Step/ACE-Step-v1-3.5B) music generation model, fintuned from the MuFun model proposed in [Advancing the Foundation Model for Music Understanding](https://arxiv.org/abs/2508.01178) more info see https://github.com/ace-step/ACE-Step/issues/313 gradio demo: http://47.121.209.64/mufun_demo_acestep demo code: https://github.com/laitselec/MuFun/blob/main/demo/mufun_acestep/gr_app.py train code: https://github.com/laitselec/MuFun ## Usage some audio processing packages like mutagen, torchaudio are needed to be installed ```python from transformers import AutoTokenizer, AutoModelForCausalLM hf_path = 'Yi3852/MuFun-ACEStep' tokenizer = AutoTokenizer.from_pretrained(hf_path, use_fast=False) device='cuda' model = AutoModelForCausalLM.from_pretrained(hf_path, trust_remote_code=True, torch_dtype="bfloat16") model.to(device) aud="/path/to/your/song.wav" inp='