metadata
license: cc-by-4.0
datasets:
- Salesforce/FinTrain
- Salesforce/FinEval
language:
- en
base_model:
- meta-llama/Meta-Llama-3-8B-Instruct
💰 Demystifying Domain-adaptive Post-training for Financial LLMs
This is the finance-specific large language model trained using the recipe described in our paper:
📄 Demystifying Domain-adaptive Post-training for Financial LLMs
For more details, please check the following resources:
- 🌐 Project Page: https://vincent950129.github.io/adapt-llm/
- 📚 Training Data: https://huggingface.co/datasets/Salesforce/FinTrain
- 🧠 Evaluation Data: https://huggingface.co/datasets/Salesforce/FinEval
- 💻 Code Repository: https://github.com/SalesforceAIResearch/FinDAP
Ethical Considerations
Users need to make their own assessment regarding any obligations or responsibilities under the corresponding licenses or terms and conditions pertaining to the original datasets and data. This release is for research purposes only in support of an academic paper.
Citation
If you find our project helpful, please consider citing our paper 😊
@misc{ke2025demystifyingdomainadaptiveposttrainingfinancial,
title={Demystifying Domain-adaptive Post-training for Financial LLMs},
author={Zixuan Ke and Yifei Ming and Xuan-Phi Nguyen and Caiming Xiong and Shafiq Joty},
year={2025},
eprint={2501.04961},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2501.04961},
}