PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published 4 days ago • 52
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective 8 days ago • 46
HuBERT Collection A collection of checkpoints from the HuBERT release, a speech encoder that learns powerful representations from unlabelled audio data. • 6 items • Updated Jan 16, 2024 • 9
LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR Paper • 2601.14251 • Published 14 days ago • 23
LightOnOCR-2 🦉 Collection LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated 13 days ago • 20
A BERTology View of LLM Orchestrations: Token- and Layer-Selective Probes for Efficient Single-Pass Classification Paper • 2601.13288 • Published 15 days ago • 12
Beyond Cosine Similarity: Taming Semantic Drift and Antonym Intrusion in a 15-Million Node Turkish Synonym Graph Paper • 2601.13251 • Published 15 days ago • 4
A Hybrid Protocol for Large-Scale Semantic Dataset Generation in Low-Resource Languages: The Turkish Semantic Relations Corpus Paper • 2601.13253 • Published 15 days ago • 4
Beyond Cosine Similarity: Taming Semantic Drift and Antonym Intrusion in a 15-Million Node Turkish Synonym Graph Paper • 2601.13251 • Published 15 days ago • 4