deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • Updated about 5 hours ago • 1.04M • • 1.16k
view post Post 2111 🔦 What? The Hub as a vector search backend!code: https://gist.github.com/davidberenstein1957/f0157a471ec59d9dd44ae6957f1d52ecbuild on DuckDB: https://huggingface.co/docs/hub/en/datasets-duckdb See translation 👀 3 3 👍 1 1 + Reply
Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception Paper • 2410.12788 • Published Oct 16, 2024 • 24
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception Paper • 2410.12628 • Published Oct 16, 2024 • 35
Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance Paper • 2410.18889 • Published Oct 24, 2024 • 15
VLM4Bio: A Benchmark Dataset to Evaluate Pretrained Vision-Language Models for Trait Discovery from Biological Images Paper • 2408.16176 • Published Aug 28, 2024 • 8
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding Paper • 2408.15545 • Published Aug 28, 2024 • 35