Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2409.07314

Comprehensive Evaluations

Model evaluation framework for Clinical Application

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 54
Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks

Paper • 2407.21072 • Published Jul 29, 2024 • 2
Named Clinical Entity Recognition Benchmark

Paper • 2410.05046 • Published Oct 7, 2024 • 17
Running

5

5

MEDIC Benchmark

📊

Explore LLM performance through benchmark evaluations

Industry models

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 54

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 54

MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

Paper • 2408.00765 • Published Aug 1, 2024 • 13
Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent

Paper • 2407.21646 • Published Jul 31, 2024 • 18
LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection

Paper • 2408.04284 • Published Aug 8, 2024 • 26
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability

Paper • 2408.07852 • Published Aug 14, 2024 • 16

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 140
Elucidating the Design Space of Diffusion-Based Generative Models

Paper • 2206.00364 • Published Jun 1, 2022 • 15
GLU Variants Improve Transformer

Paper • 2002.05202 • Published Feb 12, 2020 • 2
StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 138

Biomedical NLP papers

Papers posted on @[email protected] (Clinical, Healthcare & Biomedical NLP)

about 1 month ago

MedS^3: Towards Medical Small Language Models with Self-Evolved Slow Thinking

Paper • 2501.12051 • Published Jan 21
Bridging Language Barriers in Healthcare: A Study on Arabic LLMs

Paper • 2501.09825 • Published Jan 16 • 14
Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators

Paper • 2501.09484 • Published Jan 16 • 19
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Paper • 2501.07171 • Published Jan 13 • 50

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs