FINAL_Bench

community

AI & ML interests

None defined yet.

Recent Activity

SeaWolf-AI new activity 4 days ago

FINAL-Bench/all-bench-leaderboard:A new model has been listed on the All Bench leaderboard.

SeaWolf-AI updated a Space 4 days ago

FINAL-Bench/all-bench-leaderboard

SeaWolf-AI published an article 6 days ago

🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do

View all activity

Articles

🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do

MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning

Structural Problems in AI Benchmarking and the Case for a Unified Evaluation Framework

Do Bubbles Form When Tens of Thousands of AIs Simulate Capitalism?

FINAL Bench: The Real Bottleneck to AGI Is Self-Correction

View all articles

Collections 1

spaces 3

ALL Bench Leaderboard

ALL Bench Leaderboard

Leaderboard - FINAL Bench 'Metacognitive'

Metacognitive

Invisible Watermark Against Unauthorized AI Training — Text, Image & Video Protection

One embed. Four invisible layers. 34 attacks defeated.

models 0

None public yet

datasets 2

FINAL-Bench/ALL-Bench-Leaderboard

Viewer • Updated 6 days ago • 90 • 1.85k • 19

FINAL-Bench/Metacognitive

Viewer • Updated 17 days ago • 100 • 10.8k • 74