Contact

Education: UCL(2020-2021) ---> Hunan University(2016-2020)
Work Email: mayuhang.26942 (at) bytedance.com
Personal Email: yuhang_ma0307 (at) 163.com / astronaut0307 (at) gmail.com

Research Interests

AIGC, Multimodule Pretraining including Text-to-image, LLM and MLLM pretrained model and Multiagent System

About me

Yuhang Ma (马宇航) serves as an AI researcher at Bytedance, focusing on Image Generation and Video Understanding. Previously, she worked at Fuxi AI Lab, NetEase Inc.(2022-2025), responsible for Danqing text-to-image generation model pretraining (Wechat mini app "丹青约"), Danqing VLM and LLM model fine-tuning and IP Consistency research. She obtained her master degree from University College London and bachelor degree from Hunan University. She studied at National University of Singapore as a visit student in 2019 Winnter semester.

Publications

* indicates equal contribution, † indicates project leader, ✉ indicates advising

HPSv3: Towards Wide-Spectrum Human Preference Score
Yuhang Ma*, Yunhao Shui*, Xiaoshi Wu, Keqiang Sun, Hongsheng Li
ICCV 2025
[ProjectPage] / [Paper] / [Code] / [Model] / [Dataset]

ComfyGPT: A Self-Optimizing Multi-Agent System for Comprehensive ComfyUI Workflow Generation
Oucheng Huang*, Yuhang Ma*, Zeng Zhao, Mingrui Wu, Jiayi Ji, Rongsheng Zhang, Zhipeng Hu, Xiaoshuai Sun, Rongrong Ji.
arXiv 2025
[ProjectPage] / [Paper] / [Code]

Storynizor: Consistent Story Generation via Inter-Frame Synchronized and Shuffled ID Injection
Yuhang Ma*, Wenting Xu*, Chaoyi Zhao*, Keqiang Sun, Qinfeng Jin, Zeng Zhao, Changjie Fan, Zhipeng Hu.
AAAI 2025
[ProjectPage] / [Paper] / [Code] /

Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization
Yuhang Ma*, Wenting Xu*, Jiji Tang*, Qinfeng Jin, Rongsheng Zhang, Zeng Zhao, Changjie Fan, Zhipeng Hu.
arXiv 2024
[ProjectPage] / [Paper] / [Code]

LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation
Mushui Liu*, Yuhang Ma*, Xinfeng Zhang, Zhen Yang, Zeng Zhao, Bai Liu, Changjie Fan, Zhipeng Hu.
AAAI 2025
[ProjectPage] / [Paper] / [Code]

You Can even Annotate Text with Voice: Transcription-only-Supervised Text Spotting
Jingqun Tang*, Qiao Su*, Benlei Cui*, Yuhang Ma, Sheng Zhang, Dimitrios Kanoulas.
ACM MM 2022
[Paper]

Career

[05/2022-03/2025] Fuxi AI Lab, NetEase Inc.
[03/2025-Present] Bytedance

Hobbies

Werewolves(狼人杀) fanatics, LOL, Piano, Vlogger, a cat person owning 4 cats.