Hangbo Bao's picture

8 3

Hangbo Bao

addf400

·

https://addf400.github.io/

addf400

AI & ML interests

Sequence modeling cross modalities

Recent Activity

upvoted a paper 11 days ago

VibeVoice Technical Report

liked a model 12 days ago

microsoft/VibeVoice-1.5B

authored a paper 3 months ago

Neural Question Generation from Text: A Preliminary Study

View all activity

Organizations

None yet

authored 10 papers 3 months ago

Neural Question Generation from Text: A Preliminary Study

Paper • 1704.01792 • Published Apr 6, 2017

MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers

Paper • 2002.10957 • Published Feb 25, 2020 • 1

MiniLMv2: Multi-Head Self-Attention Relation Distillation for Compressing Pretrained Transformers

Paper • 2012.15828 • Published Dec 31, 2020 • 1

BEiT: BERT Pre-Training of Image Transformers

Paper • 2106.08254 • Published Jun 15, 2021 • 2

s2s-ft: Fine-Tuning Pretrained Transformer Encoders for Sequence-to-Sequence Learning

Paper • 2110.13640 • Published Oct 26, 2021

VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts

Paper • 2111.02358 • Published Nov 3, 2021 • 1

VL-BEiT: Generative Vision-Language Pretraining

Paper • 2206.01127 • Published Jun 2, 2022

BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers

Paper • 2208.06366 • Published Aug 12, 2022

A Unified View of Masked Image Modeling

Paper • 2210.10615 • Published Oct 19, 2022

Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks

Paper • 2208.10442 • Published Aug 22, 2022

authored a paper 9 months ago

Multimodal Latent Language Modeling with Next-Token Diffusion

Paper • 2412.08635 • Published Dec 11, 2024 • 49