Mathias Nielsen's picture

Mathias Nielsen

mathiasn1

·

https://grandaiwizard.com/

AI & ML interests

🏢 Senior Machine Learning Engineer @ https://mediacatch.io/

Recent Activity

upvoted a collection 1 day ago

liked a model 5 days ago

smirki/UIGEN-T1.1-Qwen-14B-GGUF

liked a model 5 days ago

smirki/UIGEN-T1-Qwen-7b

View all activity

Organizations

mathiasn1's activity

upvoted a collection 1 day ago

GemmaX2

GemmaX2 language models, including pretrained and instruction-tuned models of 2 sizes, including 2B, 9B. • 7 items • Updated 17 days ago • 16

upvoted 2 collections 7 days ago

Finnish Whisper speech recognition

Whisper models finetuned for Finnish in various formats • 7 items • Updated Dec 31, 2024 • 1

Ahma models

12 items • Updated 20 days ago • 2

upvoted a collection 9 days ago

NV-Embed

NV-Embed is a generalist embedding model encompassing retrieval, reranking, classification, clustering, STS tasks. • 3 items • Updated Jan 17 • 12

upvoted an article 19 days ago

Article

Open-source DeepResearch – Freeing our search agents

20 days ago

• 1.08k

upvoted an article 25 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

27 days ago

• 770

upvoted an article 29 days ago

Article

We now support VLMs in smolagents!

about 1 month ago

• 84

upvoted an article about 1 month ago

Article

Mastering Long Contexts in LLMs with KVPress

By

and 1 other •

Jan 23

• 63

upvoted 2 collections about 1 month ago

Nordic embedding training data

This is a collection of synthetic datasets for embedding model training in Danish, Swedish and Norwegian (bokmål). • 15 items • Updated 28 days ago • 4

NB-Whisper

Models based on Whisper from OpenAI, and trained on data from Språkbanken and the digital collection at the National Library of Norway. • 7 items • Updated Nov 30, 2024 • 12

upvoted an article about 2 months ago

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

By

•

Jan 3

• 34

upvoted 2 collections about 2 months ago

DeepSeek-V3

3 items • Updated Jan 6 • 187

NB-Llama 3.x

NOTE: CURRENTLY THERE ARE CONVERTION-ERRORS IN THIS MODELS. TEMPORARY PUT OFFLINE. LLama 3.x models in various sizes. • 9 items • Updated 17 days ago • 2

upvoted 2 collections 2 months ago

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 10 days ago • 81

Danish Text Datasets

These include high-quality Danish text datasets for pre-training, fine-tuning, etc. • 16 items • Updated Dec 15, 2024 • 3

upvoted a paper 2 months ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 107

upvoted 2 collections 3 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 525

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 135

upvoted an article 3 months ago

Article

EuroLLM-9B

By

and 5 others •

Dec 2, 2024

• 111

upvoted a collection 3 months ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated Jan 17 • 153