view article Article Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies By prithivMLmods • 6 days ago • 16
PolyLM: An Open Source Polyglot Large Language Model Paper • 2307.06018 • Published Jul 12, 2023 • 26
SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Task Planning Paper • 2307.06135 • Published Jul 12, 2023 • 14