Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
63
3
270
Adam
adamo1139
Follow
zappa2005's profile picture
MarinaraSpaghetti's profile picture
Kha37lid's profile picture
53 followers
·
36 following
AI & ML interests
Local training and inference.
Recent Activity
liked
a model
13 days ago
tomg-group-umd/huginn-0125
reacted
to
v2ray
's
post
with 👍
19 days ago
GPT4chan Series Release GPT4chan is a series of models I trained on https://huggingface.co/datasets/v2ray/4chan dataset, which is based on https://huggingface.co/datasets/lesserfield/4chan-datasets. The dataset contains mostly posts from 2023. Not every board is included, for example, /pol/ is NOT included. To see which boards are included, visit https://huggingface.co/datasets/v2ray/4chan/tree/main/boards. This release contains 2 models sizes, 8B and 24B. The 8B model is based on https://huggingface.co/meta-llama/Llama-3.1-8B and the 24B model is based on https://huggingface.co/mistralai/Mistral-Small-24B-Base-2501. Why I made these models? Because for a long time after the original gpt-4chan model, there aren't any serious fine-tunes on 4chan datasets. 4chan is a good data source since it contains coherent replies and nice topics. It's fun to talk to an AI generated version of 4chan and get instant replies, and without the need to actually visit 4chan. You can also sort of analyze the content and behavior of 4chan posts by probing the model's outputs. Disclaimer: The GPT4chan models should only be used for research purposes, the outputs they generated do not represent the view of me on the subjects. Moderate the responses before sending it online. Model links: Full model: - https://huggingface.co/v2ray/GPT4chan-8B - https://huggingface.co/v2ray/GPT4chan-24B Adapter: - https://huggingface.co/v2ray/GPT4chan-8B-QLoRA - https://huggingface.co/v2ray/GPT4chan-24B-QLoRA AWQ: - https://huggingface.co/v2ray/GPT4chan-8B-AWQ - https://huggingface.co/v2ray/GPT4chan-24B-AWQ FP8: - https://huggingface.co/v2ray/GPT4chan-8B-FP8
updated
a model
22 days ago
adamo1139/Qwen2-VL-7B-Sydney
View all activity
Organizations
adamo1139
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
adamo1139/Danube3-4b-4chan-HESOYAM-2510-GGUF
29 days ago
Failed to regenerate message
1
#1 opened 3 months ago by
PeterCastler
New activity in
rhymes-ai/Aria
2 months ago
Base model not released
11
#2 opened 5 months ago by
adamo1139
New activity in
AI-Safeguard/Ivy-VL-llava
3 months ago
Wrong licensing
#1 opened 3 months ago by
adamo1139
New activity in
adamo1139/Yi-1.5-34B-32K-rebased-1406
3 months ago
Still active?
9
#1 opened 4 months ago by
DazzlingXeno
New activity in
adamo1139/magpie-ultra-v0.1-shareGPT-Conversations
3 months ago
Librarian Bot: Add language metadata for dataset
#1 opened 3 months ago by
librarian-bot
New activity in
rhymes-ai/Allegro
4 months ago
Expected speed
1
#3 opened 4 months ago by
adamo1139
New activity in
neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic
4 months ago
Can you please add Nemotron 70B static?
3
#1 opened 4 months ago by
nickandbro
New activity in
allenai/Molmo-7B-D-0924
4 months ago
batch inference supported?
6
#7 opened 5 months ago by
chenkq
commented
a paper
5 months ago
Hermes 3 Technical Report
Paper
•
2408.11857
•
Published
Aug 15, 2024
•
47
•
8
New activity in
adamo1139/Yi-34B-200K-AEZAKMI-v2
7 months ago
Adding Evaluation Results
#4 opened 7 months ago by
leaderboard-pr-bot
New activity in
teknium/OpenHermes-2.5-Mistral-7B
8 months ago
How to do batch inference?
1
#34 opened 9 months ago by
abhijeet-ta
New activity in
LoneStriker/DeepSeek-Coder-V2-Instruct-GGUF
8 months ago
How good is the gguf?
3
#3 opened 8 months ago by
Tom-Neverwinter
New activity in
deepseek-ai/DeepSeek-V2-Lite
9 months ago
mixtral format?
5
#1 opened 9 months ago by
KnutJaegersberg
New activity in
LLM360/K2
9 months ago
huggyllama/llama-65b
4
#1 opened 9 months ago by
KnutJaegersberg
New activity in
szymonrucinski/Curie-7B-v1
9 months ago
Share the dataset used?
#2 opened 9 months ago by
adamo1139
New activity in
01-ai/Yi-1.5-34B-32K
9 months ago
Plans for 200K?
5
#1 opened 9 months ago by
adamo1139
New activity in
adamo1139/Lumina-T2I-quantized
9 months ago
can you provide generation examples? is the quantized version coherent?
2
#1 opened 9 months ago by
MayensGuds
New activity in
mightbe/Qwen1.5-32B-llamafied
10 months ago
It seems to be a Chat finetune
2
#1 opened 10 months ago by
adamo1139
New activity in
adamo1139/toxic-dpo-natural-v1
11 months ago
[bot] Conversion to Parquet
#1 opened 11 months ago by
parquet-converter
New activity in
01-ai/Yi-9B-200K
11 months ago
GPU Memory Constraints for 01-ai/Yi-9B-200K Model
2
#3 opened 11 months ago by
microcn
Load more