University of Washington, Seattle, WAPhD, March 2020
Advisors: Prof. Luis Ceze and Prof. Arvind Krishnamurthy
University of Pennsylvania, Philadelphia, PAMSE, May 2015
Advisor: Prof. Joseph Devietti
Nankai University, Tianjin, ChinaBE, June 2013
National Scholarship 2011
Work Experience
Facebook — Senior Staff Research Scientist2020 – Present
Research and development in system/model co-design for AI workloads.
University of Washington — Research Assistant2015 – 2020
Conducted research in the SAMPA, SAMPL and Systems lab, focusing on building systems for more efficient communication in a shared, dynamic environment.
University of Pennsylvania — Research Assistant2014 – 2015
Research on lightweight sharing detection on x86 processors.
Microsoft2012 – 2017
Researcher Intern, Systems Research(Redmond, 2017) Parameter Hub: Balancing Compute and Communication for Efficient Parameter Servers.
Researcher Intern, Mobility and Networking Research(Redmond, 2016) AUDIT: Troubleshooting Production Systems with Blame-proportional Logging.
Software Engineering Intern, Kernel Platform Group(Redmond, 2015) X86 Emulation on ARM-based Windows 10 Devices.
Software Engineering Intern, Operating Systems Group(Redmond, 2014) Always-on configuration monitor for XBOX Key Management Service (XKMS).
Contractor, Heterogeneous Application Solutions Accelerating multimedia apps with OpenCL.
Publications
*: equal contribution
Journal Publications
[TMLR'25 J2C Award] Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models.
Weixin Liang, Lili Yu*, Liang Luo*, Srinivasan Iyer, Ning Dong, Chunting Zhou, Gargi Ghosh, Mike Lewis, Wen-tau Yih, Luke Zettlemoyer, Xi Victoria Lin.
Powering Meta Segment Anything 3, ByteDance Mogao and Bagel, Xiaomi-Robotics-0, Tencent HY-Embodied-0.5, and Alibaba WAN Weaver
[IEEE TPDS 2023] P4SGD.
Hongjing Huang, Yingtao Li, Jie Sun, Xueying Zhu, Jie Zhang, Liang Luo, Jialin Li, Zeke Wang.
[IEEE Micro 2017] IncBricks: Enables in-network computation with a programmable network middlebox.
Ming Liu, Liang Luo, Jacob Nelson, Arvind Krishnamurthy, Luis Ceze.
Selected as MICRO Top Picks 2018 Honorable Mention
Conference Publications
[ICML'26] Implicit Turn-Wise Policy Optimization for Proactive User-LLM Interaction: Haoyu Peter Wang, Yuxin Chen, Liang Luo, Buyun Zhang, Ellie Dingqiao Wen, Pan Li.
[ACL'26 Findings] ReasonRec: A Reasoning-Augmented Multimodal Agent for Unified Recommendation. Yihua Zhang, Mingfu Liang, Jiyan Yang, Rong Jin, Wen-Yen Chen, Yiping Han, Huayu Li, Buyun Zhang, Liang Luo, Luke Simon, Sijia Liu, Tianlong Chen, Xi Liu.
[VLDB 2023] PyTorch FSDP. Yanli Zhao, Andrew Gu, Rohan Varma, Liang Luo, Chien-Chin Huang, Min Xu, Less Wright, Hamid Shojanazeri, Myle Ott, Sam Shleifer, Alban Desmaison, Can Balioglu, Bernard Nguyen, Geeta Chauhan, Yuchen Hao, Shen Li.
Runner-up Best Paper Award (Industry Track)
[MLSys 2023] NeuroShard: Learning to shard embedding tables with a pre-trained cost model. Daochen Zha, Louis Feng, Liang Luo, Bhargav Bhushanam, Zirui Liu, Yusuo Hu, Jade Nie, Yuzhen Huang, Yuandong Tian, Arun Kejariwal, Xia Hu.
[MLSys 2022] SRIFTY: A throughput and cost Optimizer for public cloud-based distributed training. Liang Luo, Peter West, Pratyush Patel, Arvind Krishnamurthy, Luis Ceze.
[ISCA 2022] NEO: Software-Hardware Co-design for Fast and Scalable Training of Deep Learning Recommendation Models. Dheevatsa Mudigere, Yuchen Hao, Jianyu Huang, Zhihao Jia, Andrew Tulloch, Srinivas Sridharan, Xing Liu, Mustafa Ozdal, Jade Nie, Jongsoo Park, Liang Luo, Jie (Amy) Yang, Leon Gao, Dmytro Ivchenko, Aarti Basant, Yuxi Hu, Jiyan Yang, Ehsan K. Ardestani, Xiaodong Wang, Rakesh Komuravelli, Ching-Hsiang Chu, Serhat Yilmaz, Huayu Li, Jiyuan Qian, Zhuobo Feng, Yinbin Ma, Junjie Yang, Ellie Wen, Hong Li, Lin Yang, Chonglin Sun, Whitney Zhao, Dimitry Melts, Krishna Dhulipala, KR Kishore, Tyler Graf, Assaf Eisenman, Kiran Kumar Matam, Adi Gangidi, Guoqiang Jerry Chen, Manoj Krishnan, Avinash Nayak, Krishnakumar Nair, Bharath Muthiah, Mahmoud Khorashadi, Pallab Bhattacharya, Petr Lapukhov, Maxim Naumov, Ajit Mathews, Lin Qiao, Mikhail Smelyanskiy, Bill Jia, Vijay Rao.
[NSDI 2022] NetHint: White-Box Networking for Multi-Tenant Data Centers. Jingrong Chen, Hong Zhang, Wei Zhang, Liang Luo, Jeffery Chase, Ion Stoica, Danyang Zhuo.
[IISWC 2021] Characterizing and Taming Resolution in Convolutional Neural Networks. Eddie Yan, Liang Luo, Luis Ceze.
[MLSys 2020] PLINK: Discovers and exploits datacenter network locality for efficient communication in cloud-based systems. Liang Luo, Peter West, Arvind Krishnamurthy, Luis Ceze, Jacob Nelson.
[USENIX ATC 2018] AUDIT: Troubleshoots transiently recurring errors in production systems with blame-proportional logging. Liang Luo, Lenin Ravindranath Sivalingam, Suman Nath, Madan Musuvathi, Luis Ceze.
[ACM SoCC 2018] Parameter Hub: Efficient software stack implementation for rack-level parameter servers. Liang Luo, Jacob Nelson, Luis Ceze, Amar Phanishayee, Arvind Krishnamurthy.
[MLSys 2018] Parameter Box: High performance parameter servers with balanced resource allocation. Liang Luo, Jacob Nelson, Luis Ceze, Amar Phanishayee, Arvind Krishnamurthy.
[ASPLOS 2017] IncBricks: Enables in-network computation with a programmable network middlebox. Ming Liu, Liang Luo, Jacob Nelson, Arvind Krishnamurthy, Luis Ceze.
[HPCA 2016] LASER: Uses PEBs events for sharing detection and online repair of false sharing with low overhead. Liang Luo, Akshitha Sriraman, Brooke Fugate, Shiliang Hu, Gilles Pokam, Chris Newburn, Joseph Devietti.
Workshop Publications
[KDD DLP 2022] DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction. Buyun Zhang*, Liang Luo*, Xi Liu, Jay Li, Zeliang Chen, Weilin Zhang, Xiaohan Wei, Yuchen Hao, Michael Tsang, Wenjun Wang, Yang Liu, Huayu Li, Yasmine Badr, Jongsoo Park, Jiyan Yang, Dheevatsa Mudigere, Ellie Wen.
[ASPLOS WAX 2017] INA: Motivates in-network aggregation for accelerating data-intensive applications. Liang Luo, Ming Liu, Jacob Nelson, Amar Phanishayee, Arvind Krishnamurthy, Luis Ceze.
Preprints
MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts. Xi Victoria Lin, Akshat Shrivastava, Liang Luo, Srinivasan Iyer, Mike Lewis, Gargi Ghosh, Luke Zettlemoyer, Armen Aghajanyan.
Accelerating SpMM Kernel with Cache-First Edge Sampling for GNN Inference. Chien-Yu Lin, Liang Luo, Luis Ceze.
Cloud Collectives: Towards Cloud-aware Collectives for ML Workloads with Rank Reordering. Liang Luo, Jacob Nelson, Arvind Krishnamurthy, Luis Ceze.
Community Service
Program Committee, MLSys, 2023–2026
Program Committee, IISWC 2024
Reviewer, Journal of Future Generation Computer Systems, 2023