tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity
•
12B
•
Updated
•
10.5k
•
22
None defined yet.
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism
DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation