66 3 287

Adam

adamo1139

AI & ML interests

Local training and inference.

Recent Activity

upvoted an article 7 days ago

Exploring Name Diversity in Modern LLMs: A Grimdark Trilogy Experiment

liked a model 9 days ago

haykgrigorian/TimeCapsuleLLM

updated a model 10 days ago

adamo1139/GPT-OSS-20B-HESOYAM-1108-WIP-CHATML-GGUF

View all activity

Organizations

None yet

New activity in adamo1139/DeepSeek-R1-0528-AWQ 3 months ago

running in vllm gives error

#1 opened 3 months ago by

GrigoriiA

New activity in deepseek-ai/DeepSeek-R1-0528 3 months ago

Do you have deepseek-r1-0528-awq plan?

#68 opened 3 months ago by

oliver0102

New activity in unsloth/Qwen3-32B 4 months ago

Base Model?

#2 opened 4 months ago by

Downtown-Case

New activity in adamo1139/Danube3-4b-4chan-HESOYAM-2510-GGUF 7 months ago

Failed to regenerate message

#1 opened 9 months ago by

PeterCastler

New activity in rhymes-ai/Aria 8 months ago

Base model not released

👍 3

#2 opened 10 months ago by

adamo1139

New activity in adamo1139/Yi-1.5-34B-32K-rebased-1406 9 months ago

Still active?

#1 opened 10 months ago by

DazzlingXeno

New activity in adamo1139/magpie-ultra-v0.1-shareGPT-Conversations 9 months ago

Librarian Bot: Add language metadata for dataset

#1 opened 9 months ago by

librarian-bot

New activity in RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic 10 months ago

Can you please add Nemotron 70B static?

#1 opened 10 months ago by

nickandbro

New activity in allenai/Molmo-7B-D-0924 10 months ago

batch inference supported?

👍 1

#7 opened 11 months ago by

chenkq

commented a paper 11 months ago

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15, 2024 • 53 •

New activity in adamo1139/Yi-34B-200K-AEZAKMI-v2 about 1 year ago

Adding Evaluation Results

#4 opened about 1 year ago by

leaderboard-pr-bot

New activity in teknium/OpenHermes-2.5-Mistral-7B about 1 year ago

How to do batch inference?

#34 opened about 1 year ago by

abhijeet-ta

New activity in LoneStriker/DeepSeek-Coder-V2-Instruct-GGUF about 1 year ago

How good is the gguf?

#3 opened about 1 year ago by

Tom-Neverwinter

New activity in deepseek-ai/DeepSeek-V2-Lite about 1 year ago

mixtral format?

#1 opened over 1 year ago by

KnutJaegersberg

New activity in LLM360/K2 about 1 year ago

huggyllama/llama-65b

👀 1

#1 opened about 1 year ago by

KnutJaegersberg

New activity in adamo1139/Lumina-T2I-quantized over 1 year ago

can you provide generation examples? is the quantized version coherent?

#1 opened over 1 year ago by

MayensGuds

New activity in mightbe/Qwen1.5-32B-llamafied over 1 year ago

It seems to be a Chat finetune

#1 opened over 1 year ago by

adamo1139

New activity in adamo1139/toxic-dpo-natural-v1 over 1 year ago

[bot] Conversion to Parquet

#1 opened over 1 year ago by

parquet-converter

New activity in 01-ai/Yi-9B-200K over 1 year ago

GPU Memory Constraints for 01-ai/Yi-9B-200K Model

#3 opened over 1 year ago by

microcn

Problem with finetuning with Axolotl (qlora, lora, and FFT)

#1 opened over 1 year ago by

Undi95

Adam

AI & ML interests

Recent Activity

Organizations

adamo1139's activity

running in vllm gives error

Do you have deepseek-r1-0528-awq plan?

Base Model?

Failed to regenerate message

Base model not released

Still active?

Librarian Bot: Add language metadata for dataset

Can you please add Nemotron 70B static?

batch inference supported?

Adding Evaluation Results

How to do batch inference?

How good is the gguf?

mixtral format?

huggyllama/llama-65b

can you provide generation examples? is the quantized version coherent?

It seems to be a Chat finetune

[bot] Conversion to Parquet

GPU Memory Constraints for 01-ai/Yi-9B-200K Model

Problem with finetuning with Axolotl (qlora, lora, and FFT)