6 15 3

Xiaoke Huang

xk-huang

https://xk-huang.github.io/

xk-huang

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory

upvoted a paper 1 day ago

Scaling Zero-Shot Reference-to-Video Generation

upvoted a paper 9 days ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

View all activity

Organizations

upvoted a paper about 7 hours ago

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory

Paper • 2512.07802 • Published 2 days ago • 31

upvoted a paper 1 day ago

Scaling Zero-Shot Reference-to-Video Generation

Paper • 2512.06905 • Published 3 days ago • 28

upvoted a paper 9 days ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published 9 days ago • 60

upvoted 3 papers about 1 month ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5 • 80

MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs

Paper • 2510.25867 • Published Oct 29 • 6

Uniform Discrete Diffusion with Metric Path for Video Generation

Paper • 2510.24717 • Published Oct 28 • 39

upvoted a collection about 2 months ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 647

upvoted a paper 8 months ago

m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning with Large Language Models

Paper • 2504.00869 • Published Apr 1 • 10

upvoted a paper about 1 year ago

Story-Adapter: A Training-free Iterative Framework for Long Story Visualization

Paper • 2410.06244 • Published Oct 8, 2024 • 19

upvoted a paper over 1 year ago

VoCo-LLaMA: Towards Vision Compression with Large Language Models

Paper • 2406.12275 • Published Jun 18, 2024 • 31

upvoted a paper almost 2 years ago

SILC: Improving Vision Language Pretraining with Self-Distillation

Paper • 2310.13355 • Published Oct 20, 2023 • 9

upvoted 2 papers about 2 years ago

Segment and Caption Anything

Paper • 2312.00869 • Published Dec 1, 2023 • 21

Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 39

upvoted 2 papers over 2 years ago

Planting a SEED of Vision in Large Language Model

Paper • 2307.08041 • Published Jul 16, 2023 • 11

Improving Multimodal Datasets with Image Captioning

Paper • 2307.10350 • Published Jul 19, 2023 • 11

Xiaoke Huang

AI & ML interests

Recent Activity

Organizations

xk-huang's activity