Raman Kumar

Imgonnahugyou

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

upvoted a paper 4 months ago

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems

upvoted a paper 6 months ago

Text2SQL is Not Enough: Unifying AI and Databases with TAG

View all activity

Organizations

None yet

Imgonnahugyou's activity

upvoted a paper 4 days ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published 5 days ago • 72

upvoted a paper 4 months ago

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems

Paper • 2411.02959 • Published Nov 5, 2024 • 68

upvoted 4 papers 6 months ago

SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

Paper • 2402.05935 • Published Feb 8, 2024 • 17

liked a dataset 7 months ago

tatsu-lab/alpaca

Viewer • Updated May 22, 2023 • 52k • 31.4k • 732

liked a model 8 months ago

AI-MO/NuminaMath-7B-TIR

Text Generation • Updated Aug 14, 2024 • 17.1k • 337

upvoted a collection 8 months ago

AIMO Progress Prize

Collection

Models and datasets used in the winning solution to the AIMO 1st Progress Prize • 7 items • Updated Jul 19, 2024 • 12

liked a dataset 8 months ago

proj-persona/PersonaHub

Viewer • Updated 9 days ago • 375k • 5.46k • 522

upvoted a paper 8 months ago

We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?

Paper • 2407.01284 • Published Jul 1, 2024 • 77

upvoted a collection 8 months ago

4M Models

Collection

Multimodal models from https://4m.epfl.ch/ • 14 items • Updated Jun 14, 2024 • 31

liked a dataset 8 months ago

google/spiqa

Viewer • Updated Jan 8 • 666 • 457 • 35

updated a collection 8 months ago

vision-2-audio-hugs

Collection

2 items • Updated Jun 19, 2024

upvoted 2 papers 8 months ago

Plan, Generate and Complicate: Improving Low-resource Dialogue State Tracking via Easy-to-Difficult Zero-shot Data Augmentation

Paper • 2406.08860 • Published Jun 13, 2024 • 1

CapS-Adapter: Caption-based MultiModal Adapter in Zero-Shot Classification

Paper • 2405.16591 • Published May 26, 2024 • 1

updated a collection 8 months ago

vision-2-audio-hugs

Collection

2 items • Updated Jun 19, 2024