-
Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models
Paper • 2312.10835 • Published • 7 -
LIME: Localized Image Editing via Attention Regularization in Diffusion Models
Paper • 2312.09256 • Published • 12 -
PromptBench: A Unified Library for Evaluation of Large Language Models
Paper • 2312.07910 • Published • 19 -
Prompt Expansion for Adaptive Text-to-Image Generation
Paper • 2312.16720 • Published • 6
Collections
Discover the best community collections!
Collections including paper arxiv:2403.13447
-
DocGraphLM: Documental Graph Language Model for Information Extraction
Paper • 2401.02823 • Published • 36 -
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper • 2401.02038 • Published • 63 -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 181 -
Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
Paper • 2309.01131 • Published • 1
-
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper • 2401.02038 • Published • 63 -
TinyLlama: An Open-Source Small Language Model
Paper • 2401.02385 • Published • 92 -
Simple and Scalable Strategies to Continually Pre-train Large Language Models
Paper • 2403.08763 • Published • 50 -
PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Paper • 2403.10704 • Published • 58
-
Magnitude Invariant Parametrizations Improve Hypernetwork Learning
Paper • 2304.07645 • Published • 1 -
HyperShot: Few-Shot Learning by Kernel HyperNetworks
Paper • 2203.11378 • Published • 1 -
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
Paper • 2211.15457 • Published • 1 -
Continual Learning with Dependency Preserving Hypernetworks
Paper • 2209.07712 • Published • 1
-
Woodpecker: Hallucination Correction for Multimodal Large Language Models
Paper • 2310.16045 • Published • 16 -
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Paper • 2310.14566 • Published • 27 -
SILC: Improving Vision Language Pretraining with Self-Distillation
Paper • 2310.13355 • Published • 9 -
Conditional Diffusion Distillation
Paper • 2310.01407 • Published • 20