Spanish PII & De-Identification Collection 33 models for Spanish PII detection & de-identification. 55+ entity types. HIPAA & GDPR compliant. Apache 2.0. • 35 items • Updated 4 days ago • 4
French PII & De-Identification Collection 33 models for French PII detection & de-identification. 55+ entity types. HIPAA & GDPR compliant. Apache 2.0. • 35 items • Updated 4 days ago • 3
Italian PII & De-Identification Collection 33 models for Italian PII detection & de-identification. 55+ entity types. HIPAA & GDPR compliant. Apache 2.0. • 35 items • Updated 2 days ago • 2
German PII & De-Identification Collection 33 models for German PII detection & de-identification. 55+ entity types. HIPAA & GDPR compliant. Apache 2.0. • 35 items • Updated 4 days ago • 3
Multilingual PII & De-Identification Collection Multilingual models for extracting PII entities and de-identifying clinical text, with support for HIPAA and GDPR compliance. • 155 items • Updated 4 days ago • 20
view article Article From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output 14 days ago • 20
PII & De-Identification Collection Models for extracting PII entities and de-identifying clinical text, with support for HIPAA and GDPR compliance. • 188 items • Updated 4 days ago • 32
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 17 days ago • 97
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective 26 days ago • 56
gliner2 family Collection GLiNER2 extends the original GLiNER architecture to support multi-task information extraction with a schema-driven interface. This base model provid • 4 items • Updated 11 days ago • 28
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models Paper • 2601.10387 • Published Jan 15 • 12
view article Article LeRobot goes to driving school: World’s largest open-source self-driving dataset Mar 11, 2025 • 105