XiaNanWang98's picture

32 5

XiaNanWang98

XiaNanWang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

upvoted a paper 8 days ago

RadEdit: stress-testing biomedical vision models via diffusion image editing

upvoted a paper 8 days ago

Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting

View all activity

Organizations

None yet

XiaNanWang's activity

upvoted a paper 7 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 11 days ago • 181

upvoted 19 papers 8 days ago

RadEdit: stress-testing biomedical vision models via diffusion image editing

Paper • 2312.12865 • Published Dec 20, 2023 • 5

Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting

Paper • 2312.13271 • Published Dec 20, 2023 • 6

SpecNeRF: Gaussian Directional Encoding for Specular Reflections

Paper • 2312.13102 • Published Dec 20, 2023 • 7

UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections

Paper • 2312.13285 • Published Dec 20, 2023 • 7

Model-Based Control with Sparse Neural Dynamics

Paper • 2312.12791 • Published Dec 20, 2023 • 7

Mini-GPTs: Efficient Large Language Models through Contextual Pruning

Paper • 2312.12682 • Published Dec 20, 2023 • 10

Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models

Paper • 2312.12487 • Published Dec 19, 2023 • 10

MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers

Paper • 2312.12468 • Published Dec 19, 2023 • 11

Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models

Paper • 2312.13763 • Published Dec 21, 2023 • 11

Neural feels with neural fields: Visuo-tactile perception for in-hand manipulation

Paper • 2312.13469 • Published Dec 20, 2023 • 12

Cached Transformers: Improving Transformers with Differentiable Memory Cache

Paper • 2312.12742 • Published Dec 20, 2023 • 14

TinySAM: Pushing the Envelope for Efficient Segment Anything Model

Paper • 2312.13789 • Published Dec 21, 2023 • 15

Splatter Image: Ultra-Fast Single-View 3D Reconstruction

Paper • 2312.13150 • Published Dec 20, 2023 • 16

InstructVideo: Instructing Video Diffusion Models with Human Feedback

Paper • 2312.12490 • Published Dec 19, 2023 • 18

Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis

Paper • 2312.13834 • Published Dec 20, 2023 • 27

Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model

Paper • 2312.13252 • Published Dec 20, 2023 • 28

Generative Multimodal Models are In-Context Learners

Paper • 2312.13286 • Published Dec 20, 2023 • 36

DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation

Paper • 2312.13578 • Published Dec 21, 2023 • 29

DreamTuner: Single Image is Enough for Subject-Driven Generation

Paper • 2312.13691 • Published Dec 21, 2023 • 28