Mol-LLaMA: Towards General Understanding of Molecules in Large Molecular Language Model Paper โข 2502.13449 โข Published 5 days ago โข 24
SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models Paper โข 2502.12464 โข Published 6 days ago โข 26
SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models Paper โข 2502.12464 โข Published 6 days ago โข 26
SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models Paper โข 2502.12464 โข Published 6 days ago โข 26 โข 2
Continuous Diffusion Model for Language Modeling Paper โข 2502.11564 โข Published 7 days ago โข 48
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper โข 2502.08910 โข Published 11 days ago โข 140
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models Paper โข 2410.01524 โข Published Oct 2, 2024 โข 3
view post Post 1194 ๐จ๐ฅ New Release Alert! ๐ฅ๐จIntroducing the 435M model that outperforms Llama-Guard-3-8B while slashing 75% of the computation cost! ๐ป๐ฅ๐ Check it out: hbseong/HarmAug-Guard (Yes, INFERENCE CODE INCLUDED! ๐ก)More details in our paper: https://arxiv.org/abs/2410.01524 ๐#HarmAug #LLM # Safety #EfficiencyBoost #Research #AI #MachineLearning ๐ 5 5 โค๏ธ 4 4 + Reply
Self-Supervised Dataset Distillation for Transfer Learning Paper โข 2310.06511 โข Published Oct 10, 2023
DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion Models Paper โข 2305.16943 โข Published May 26, 2023
Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive Tasks Paper โข 2305.18395 โข Published May 28, 2023 โข 1
Contrastive Learning with Adversarial Perturbations for Conditional Text Generation Paper โข 2012.07280 โข Published Dec 14, 2020
Effective and Efficient Conversation Retrieval for Dialogue State Tracking with Implicit Text Summaries Paper โข 2402.13043 โข Published Feb 20, 2024 โข 2
Self-Distillation for Further Pre-training of Transformers Paper โข 2210.02871 โข Published Sep 30, 2022 โข 1
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models Paper โข 2410.01524 โข Published Oct 2, 2024 โข 3