Llama-Fin-8b / README.md

ZixuanKe

Update README.md

491d4c7 verified 16 days ago

preview code

raw

history blame contribute delete

1.77 kB

metadata

license: cc-by-4.0
datasets:
  - Salesforce/FinTrain
  - Salesforce/FinEval
language:
  - en
base_model:
  - meta-llama/Meta-Llama-3-8B-Instruct

💰 Demystifying Domain-adaptive Post-training for Financial LLMs

This is the finance-specific large language model trained using the recipe described in our paper:
📄 Demystifying Domain-adaptive Post-training for Financial LLMs

For more details, please check the following resources:

🌐 Project Page: https://vincent950129.github.io/adapt-llm/
📚 Training Data: https://huggingface.co/datasets/Salesforce/FinTrain
🧠 Evaluation Data: https://huggingface.co/datasets/Salesforce/FinEval
💻 Code Repository: https://github.com/SalesforceAIResearch/FinDAP

Ethical Considerations

Users need to make their own assessment regarding any obligations or responsibilities under the corresponding licenses or terms and conditions pertaining to the original datasets and data. This release is for research purposes only in support of an academic paper.

Citation

If you find our project helpful, please consider citing our paper 😊

@misc{ke2025demystifyingdomainadaptiveposttrainingfinancial,
      title={Demystifying Domain-adaptive Post-training for Financial LLMs}, 
      author={Zixuan Ke and Yifei Ming and Xuan-Phi Nguyen and Caiming Xiong and Shafiq Joty},
      year={2025},
      eprint={2501.04961},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2501.04961}, 
}