Nurmukhamed
's Collections
good-papers
updated
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models
across Computer Vision Tasks
Paper
•
2310.19909
•
Published
•
21
Memory Augmented Language Models through Mixture of Word Experts
Paper
•
2311.10768
•
Published
•
18
FlashDecoding++: Faster Large Language Model Inference on GPUs
Paper
•
2311.01282
•
Published
•
37
Prompt Cache: Modular Attention Reuse for Low-Latency Inference
Paper
•
2311.04934
•
Published
•
34
Exponentially Faster Language Modelling
Paper
•
2311.10770
•
Published
•
119
Weight subcloning: direct initialization of transformers using larger
pretrained ones
Paper
•
2312.09299
•
Published
•
19
MobileVLM : A Fast, Reproducible and Strong Vision Language Assistant
for Mobile Devices
Paper
•
2312.16886
•
Published
•
21
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective
Depth Up-Scaling
Paper
•
2312.15166
•
Published
•
59
Tuning Language Models by Proxy
Paper
•
2401.08565
•
Published
•
24
Patchscope: A Unifying Framework for Inspecting Hidden Representations
of Language Models
Paper
•
2401.06102
•
Published
•
23
Medusa: Simple LLM Inference Acceleration Framework with Multiple
Decoding Heads
Paper
•
2401.10774
•
Published
•
59
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility,
Reasoning, and Efficiency
Paper
•
2508.18265
•
Published
•
179
StepWiser: Stepwise Generative Judges for Wiser Reasoning
Paper
•
2508.19229
•
Published
•
19