Contact
Research Interests
AIGC, Multimodule Pretraining including Text-to-image, LLM and MLLM pretrained model and Multiagent System
About me
Yuhang Ma (马宇航) serves as an AI researcher at Bytedance, focusing on Image Generation and Video Understanding. Previously, she worked at Fuxi AI Lab, NetEase Inc.(2022-2025), responsible for Danqing text-to-image generation model pretraining (Wechat mini app "丹青约"), Danqing VLM and LLM model fine-tuning and IP Consistency research. She obtained her master degree from University College London and bachelor degree from Hunan University. She studied at National University of Singapore as a visit student in 2019 Winnter semester.
Publications
* indicates equal contribution, † indicates project leader, ✉ indicates advising
-
- HPSv3: Towards Wide-Spectrum Human Preference Score
- Yuhang Ma*, Yunhao Shui*, Xiaoshi Wu, Keqiang Sun✉, Hongsheng Li✉
- ICCV 2025
- [ProjectPage] / [Paper] / [Code] / [Model] / [Dataset]
-
- ComfyGPT: A Self-Optimizing Multi-Agent System for Comprehensive ComfyUI Workflow Generation
- Oucheng Huang*, Yuhang Ma*†, Zeng Zhao✉, Mingrui Wu, Jiayi Ji, Rongsheng Zhang, Zhipeng Hu, Xiaoshuai Sun, Rongrong Ji✉.
- arXiv 2025
- [ProjectPage] / [Paper] / [Code]
-
- Storynizor: Consistent Story Generation via Inter-Frame Synchronized and Shuffled ID Injection
- Yuhang Ma*†, Wenting Xu*, Chaoyi Zhao*, Keqiang Sun, Qinfeng Jin, Zeng Zhao✉, Changjie Fan, Zhipeng Hu.
- AAAI 2025
- [ProjectPage] / [Paper] / [Code] /
-
- Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization
- Yuhang Ma*†, Wenting Xu*, Jiji Tang*, Qinfeng Jin, Rongsheng Zhang, Zeng Zhao✉, Changjie Fan, Zhipeng Hu.
- arXiv 2024
- [ProjectPage] / [Paper] / [Code]
-
- LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation
- Mushui Liu*, Yuhang Ma*†, Xinfeng Zhang, Zhen Yang, Zeng Zhao✉, Bai Liu, Changjie Fan, Zhipeng Hu.
- AAAI 2025
- [ProjectPage] / [Paper] / [Code]
-
- You Can even Annotate Text with Voice: Transcription-only-Supervised Text Spotting
- Jingqun Tang*, Qiao Su*, Benlei Cui*, Yuhang Ma, Sheng Zhang, Dimitrios Kanoulas.
- ACM MM 2022
- [Paper]
Career
[05/2022-03/2025] Fuxi AI Lab, NetEase Inc.
[03/2025-Present] Bytedance
Hobbies
Werewolves(狼人杀) fanatics, LOL, Piano, Vlogger, a cat person owning 4 cats.