[Yuhang :)]
Yuhang Ma
AI researcher at Fuxi AI Lab, NetEase Inc.
Work Email: mayuhang (at) corp.netease.com
Personal Email: yuhang_ma0307 (at) 163.com / astronaut0307 (at) gmail.com
Research Interests
AIGC, Multimodule Pretraining including Text-to-image, LLM and MLLM pretrained model, Conditioned Text-to-image Generation especially focused on IP Consistency
Biography
Yuhang Ma (马宇航) serves as an AI researcher at Fuxi AI Lab, NetEase Inc., responsible for Danqing text-to-image generation model pretraining, Danqing VLM and LLM model fine-tuning and IP Consistency research. She obtained her master degree from University College London and bachelor degree from Hunan University. She studied at National University of Singapore as a visit student in 2019 Winnter semester.
Publications
(* indicates equal contribution, † indicates project leader, highlight indicates representative papers)
Yuhang Ma*†, Wenting Xu*, Chaoyi Zhao*, Keqiang Sun, Qinfeng Jin, Zeng Zhao, Changjie Fan, Zhipeng Hu
@misc{ma2024storynizorconsistentstorygeneration,
  title={Storynizor: Consistent Story Generation via Inter-Frame Synchronized and Shuffled ID Injection}, 
  author={Yuhang Ma and Wenting Xu and Chaoyi Zhao and Keqiang Sun and Qinfeng Jin and Zeng Zhao and Changjie Fan and Zhipeng Hu},
  year={2024},
  eprint={2409.19624},
  archivePrefix={arXiv},
  primaryClass={cs.CV},
  url={https://arxiv.org/abs/2409.19624}, 
}
Yuhang Ma*†, Wenting Xu*, Jiji Tang*, Qinfeng Jin, Rongsheng Zhang, Zeng Zhao, Changjie Fan, Zhipeng Hu
@misc{ma2024characteradapterpromptguidedregioncontrol,
  title={Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization}, 
  author={Yuhang Ma and Wenting Xu and Jiji Tang and Qinfeng Jin and Rongsheng Zhang and Zeng Zhao and Changjie Fan and Zhipeng Hu},
  year={2024},
  eprint={2406.16537},
  archivePrefix={arXiv},
  primaryClass={cs.CV},
  url={https://arxiv.org/abs/2406.16537}, 
}
Mushui Liu*, Yuhang Ma*†, Xinfeng Zhang, Zhen Yang, Zeng Zhao, Bai Liu, Changjie Fan, Zhipeng Hu
@misc{liu2024llm4genleveragingsemanticrepresentation,
  title={LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation}, 
  author={Mushui Liu and Yuhang Ma and Xinfeng Zhang and Yang Zhen and Zeng Zhao and Zhipeng Hu and Bai Liu and Changjie Fan},
  year={2024},
  eprint={2407.00737},
  archivePrefix={arXiv},
  primaryClass={cs.CV},
  url={https://arxiv.org/abs/2407.00737}, 
}
Jingqun Tang*, Qiao Su*, Benlei Cui*, Yuhang Ma, Sheng Zhang, Dimitrios Kanoulas
@inproceedings{tang2022you,
  title={You can even annotate text with voice: Transcription-only-supervised text spotting},
  author={Tang, Jingqun and Qiao, Su and Cui, Benlei and Ma, Yuhang and Zhang, Sheng and Kanoulas, Dimitrios},
  booktitle={Proceedings of the 30th ACM International Conference on Multimedia},
  pages={4154--4163},
  year={2022}
}
Career
[05/2022-Present] Fuxi AI Lab, NetEase Inc.
Hobbies
Werewolves(狼人杀) fanatics, LOL, Piano, Vlogger, a cat person owing 4 cats.