5 10 28

Thomas Renkert

trenkert

[email protected]

AI & ML interests

None yet

Recent Activity

new activity 2 months ago

DavidAU/Qwen3-VL-42B-A3B-Thinking-Brainstorm20x-GGUF:Brainstorm question

liked a model 4 months ago

swiss-ai/Apertus-70B-Instruct-2509

upvoted a collection 4 months ago

Apertus LLM

View all activity

Organizations

New activity in DavidAU/Qwen3-VL-42B-A3B-Thinking-Brainstorm20x-GGUF 2 months ago

Brainstorm question

#1 opened 2 months ago by

trenkert

liked a model 4 months ago

swiss-ai/Apertus-70B-Instruct-2509

Text Generation • 71B • Updated Nov 14, 2025 • 5.88k • • 180

upvoted a collection 4 months ago

Apertus LLM

Collection

Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1, 2025 • 319

upvoted a paper 5 months ago

Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM

Paper • 2505.18110 • Published May 23, 2025 • 1

liked a dataset 6 months ago

StudyPal/education

Viewer • Updated Jul 6, 2025 • 250k • 12 • 1

upvoted a paper 6 months ago

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2, 2025 • 130

New activity in AIDC-AI/Ovis2-34B-GPTQ-Int4 6 months ago

Video analysis is unable to process audio

#1 opened 6 months ago by

trenkert

New activity in AIDC-AI/Ovis2-16B 8 months ago

Audio in video files?

#6 opened 8 months ago by

trenkert

New activity in allenai/OLMo-2-0325-32B-Instruct-GGUF 9 months ago

Can't run in llama.cpp, wrong tensor shape

👀 3

#1 opened 10 months ago by

bartowski

liked 2 models 10 months ago

allenai/OLMo-2-0325-32B-Instruct

Text Generation • 32B • Updated Mar 14, 2025 • 1.62k • 148

allenai/OLMo-2-0325-32B

Text Generation • 32B • Updated Apr 29, 2025 • 2.07k • 65

upvoted an article 12 months ago

Article

FineWeb2-C: Help Build Better Language Models in Your Language

Dec 23, 2024

•

upvoted a collection about 1 year ago

EXAONE-3.5

Collection

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B • 11 items • Updated Jul 7, 2025 • 119

liked a model about 1 year ago

BAAI/bge-reranker-v2-m3

Text Classification • 0.6B • Updated Jun 24, 2024 • 2.67M • • 854

liked a Space over 1 year ago

Base Model Explorer

🧭

Explore base models and their descendants on Hugging Face Hub

liked a model over 1 year ago

numind/NuExtract

Text Generation • 4B • Updated Oct 17, 2024 • 271 • 228

updated a dataset over 1 year ago

trenkert/testchatml2

Updated Jul 31, 2024 • 5

liked a dataset over 1 year ago

stefan-it/HisGermaNER

Preview • Updated Mar 28, 2024 • 941 • 2

liked a model over 1 year ago

mistralai/Mixtral-8x22B-Instruct-v0.1

141B • Updated Jul 24, 2025 • 12.8k • 741

reacted to tomaarsen's post with 👍 over 1 year ago

Post

2248

‼️Sentence Transformers v3.0 is out! You can now train and finetune embedding models with multi-GPU training, bf16 support, loss logging, callbacks & much more. I also release 50+ datasets to train on.

1️⃣ Training Refactor
Embedding models can now be trained using an extensive trainer with a lot of powerful features:
- MultiGPU Training (Data Parallelism (DP) and Distributed Data Parallelism (DDP))
- bf16 training support; loss logging
- Evaluation datasets + evaluation loss
- Improved callback support + an excellent Weights & Biases integration
- Gradient checkpointing, gradient accumulation
- Improved model card generation
- Resuming from a training checkpoint without performance loss
- Hyperparameter Optimization
and much more!
Read my detailed blogpost to learn about the components that make up this new training approach: https://huggingface.co/blog/train-sentence-transformers

2️⃣ Similarity Score
Not sure how to compare embeddings? Don't worry, you can now use model.similarity(embeddings1, embeddings2) and you'll get your similarity scores immediately. Model authors can specify their desired similarity score, so you don't have to worry about it anymore!

3️⃣ Additional Kwargs
Sentence Transformers relies on various Transformers instances (AutoModel, AutoTokenizer, AutoConfig), but it was hard to provide valuable keyword arguments to these (like 'torch_dtype=torch.bfloat16' to load a model a lower precision for 2x inference speedup). This is now easy!

4️⃣ Hyperparameter Optimization
Sentence Transformers now ships with HPO, allowing you to effectively choose your hyperparameters for your data and task.

5️⃣ Dataset Release
To help you out with finetuning models, I've released 50+ ready-to-go datasets that can be used with training or finetuning embedding models: sentence-transformers/embedding-model-datasets-6644d7a3673a511914aa7552

Full release notes: https://github.com/UKPLab/sentence-transformers/releases/tag/v3.0.0

Thomas Renkert

AI & ML interests

Recent Activity

Organizations

trenkert's activity

Brainstorm question

Video analysis is unable to process audio

Audio in video files?

Can't run in llama.cpp, wrong tensor shape

FineWeb2-C: Help Build Better Language Models in Your Language

Base Model Explorer

🎉 Free Image Generator Now Available!