Publications
All Learning Algorithms & Systems Model Serving & Inference Edge & On-Device ML Agentic Systems AI for Systems Miscellaneous
2026
- SenSys'26 WildFiT: Autonomous In-situ Model Adaptation for Resource-Constrained IoT Systems.
Mohammad Mehdi Rastikerdar, Jin Huang, Hui Guan, Deepak Ganesan
ACM/IEEE International Conference on Embedded Artificial Intelligence and Sensing Systems, 2026 - MLSys'26 ProTrain: Efficient LLM Training via Automatic Memory Management. [PDF]
Hanmei Yang, Jin Zhou, Yao Fu, Xiaoquan Wang, Ramine Roane, Hui Guan, Tongping Liu
The 9th Annual Conference on Machine Learning and Systems (MLSys 2026), Bellevue, WA, May 18-22, 2026 - PacificVis'26 A Four-Stage Framework of Visual Complexity and Trust as Mediated by Effort.
Kylie Lin, Hui Guan, David N Rapp, Cindy Xiong Bearfield
IEEE Pacific Visualization Conference, 2026 - MMSys'26 Atom: Efficient On-Device Video-Language Pipelines Through Modular Reuse.
Kunjal Panchal, Saayan Mitra, Somdeb Sarkhel, Haoliang Wang, Ishita Dasgupta, Gang Wu, Hui Guan
ACM Multimedia Systems Conference, 2026
2025
- arXiv'25 Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents. [PDF]
Zhenning Yang, Hui Guan, Victor Nicolet, Brandon Paulsen, Joey Dodds, Daniel Kroening, Ang Chen
arXiv, 2025 - VLDB'25 Graph neural network training systems: A performance comparison of full-graph and mini-batch. [PDF]
Saurabh Bajaj, Hojae Son, Juelin Liu, Hui Guan, and Marco Serafini
Proceedings of the VLDB Endowment, 2025 - EXAIT@ICML'25 Reimagining Parameter Space Exploration with Diffusion Models. [PDF]
Lijun Zhang, Xiao Liu, Hui Guan
First Exploration in AI Today Workshop at ICML (EXAIT at ICML 2025) - TKDD'25 Recurrent Neural Networks Meet Context-Free Grammar: Two Birds with One Stone. [PDF]
Hui Guan, Umang Chaudhary, Yuanchao Xu, Lin Ning, Lijun Zhang, Xipeng Shen
ACM Transactions on Knowledge Discovery from Data, Volume 20, Issue 2, 2025 - Neurocomputing@25 Attacking all tasks at once using adversarial examples in multi-task learning. [PDF]
Lijun Zhang, Xiao Liu, Kaleel Mahmood, Caiwen Ding, and Hui Guan
Neurocomputing, 2025 - ARITH'25 An Empirical Study of Microscaling Formats for Low-Precision LLM Training. [PDF]
Hanmei Yang, Summer Deng, Amit Nagpal, Maxim Naumov, Mohammad Janani, Tongping Liu, Hui Guan
32nd IEEE Symposium on Computer Arithmetic, Jun 23-25, 2025 - MLSys'25 SPA: Scaling Graph Neural Network Training on Large Graphs via Probablistic Splitting. [PDF]
Sandeep Polisetty, Juelin Liu, Yi Fung, Seung-Hwan Lim, Hui Guan, Marco Serafini
The Eighth Annual Conference on Machine Learning and Systems, Santa Clara, May 12-15, 2025 (Acceptance Rate = 22% (61/271)) - MLSys'25 DiffServe: Efficiently Serving Text-to-Image Diffusion Models with Query-Aware Model Scaling. [PDF]
Sohaib Ahmad (co-first author), Qizheng Yang (co-first author), Haoliang Wang, Ramesh K. Sitaraman, and Hui Guan
The Eighth Annual Conference on Machine Learning and Systems, Santa Clara, May 12-15, 2025. (Acceptance Rate = 22% (61/271)) - CHI EA'25 What Makes a Visualization Visually Complex?. [PDF]
Kylie Lin, Sean Sheng-tse Ru, David N Rapp, Hui Guan, Cindy Xiong Bearfield
Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2025
2024
- NeurIPS'24 Thinking Forward: Memory-Efficient Federated Finetuning of Language Models. [PDF] [Code]
Kunjal Panchal, Nisarg Parikh, Sunav Choudhary, Lijun Zhang, Yuriy Brun, Hui Guan
NeurIPS '24, Mon, Dec 9, 2024 – Sun, Dec 15, 2024, Vancouver - NeurIPS'24 Attack-Resilient Image Watermarking Using Stable Diffusion. [PDF] [Code]
Lijun Zhang, Xiao Liu, Antoni Viros i Martin, Cindy Xiong Bearfield, Yuriy Brun, Hui Guan
NeurIPS '24, Mon, Dec 9, 2024 – Sun, Dec 15, 2024, Vancouver - MLforSys@NeurIPS'24 Understanding and Alleviating Memory Issue in RLHF for LLMs. [PDF]
Jin Zhou, Hanmei Yang, Steven Jiaxun Tang, Mingcan Xiang, Hui Guan, Tongping Liu
NeurIPS'24 Workshop MLforSys, Dec 14, 2024, Vancouver - AI4Mat@NeurIPS'24 Integrating Graph Neural Networks and Many-Body Expansion Theory for Potential Energy Surfaces. [PDF]
Siqi Chen, Zhiqiang Wang, Xianqi Deng, Yili Shen, Cheng-Wei Ju, Jun Yi, Lin Xiong, Guo Ling, Dieaa Alhmoud, Hui Guan, Zhou Lin
NeurIPS'24 Workshop AI4Mat, Dec 15, 2024, Vancouver - ACM MM'24 AdapMTL: Adaptive Pruning Framework for Multitask Learning Model. [PDF]
Mingcan Xiang, Jiaxun Tang, Qizheng Yang, Hui Guan, Tongping Liu
ACM MM '24, October 28-November 1, 2024, Melbourne, VIC, Australia
https://doi.org/10.1145/3664647.3681426 - MIPR'24 Structured Pruning for Multi-Task Deep Neural Networks. [PDF]
Siddhant Garg, Lijun Zhang, Hui Guan
International Conference on Multimedia Information Processing and Retrieval, August 07, 2024 - MobiSys'24 CACTUS: Dynamically Switchable Context-aware micro-Classifiers for Efficient IoT Inference. [PDF] [Code]
Mohammad Mehdi Rastikerdar, Jin Huang, Shiwei Fang, Hui Guan, Deepak Ganesan
The 22nd ACM International Conference on Mobile Systems, Applications, and Services (MobiSys), Tokyo, Japan, June 3-7, 2024
https://doi.org/10.1145/3643832.3661888 - IEEE Access'24 Information-Enhanced Graph Neural Network for Transcending Homophily Barriers. [PDF]
Xiao Liu, Lijun Zhang, Hui Guan
In IEEE Access, 2024 - HPDC'24 Loki: A System for Serving ML Inference Pipelines with Hardware and Accuracy Scaling. [PDF]
Sohaib Ahmad, Hui Guan, Ramesh K. Sitaraman
The 33rd International Symposium on - EuroSys'24 GMorph: Accelerating Multi-DNN Inference via Model Fusion. [PDF] [Code]
Qizheng Yang, Tianyi Yang, Mingcan Xiang, Lijun Zhang, Haoliang Wang, Marco Serafini, Hui Guan
The 2024 European Conference on Computer Systems (EuroSys), April 22-25, 2024
https://doi.org/10.1145/3627703.3650074 - ASPLOS'24 Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling. [PDF] [Code]
Sohaib Ahmad, Hui Guan, Brain D. Friedman, Thomas Williams, Ramesh K. Sitaraman, Thomas Woo
The 2024 ACM Conference on Architectural Support for Programming Languages and Operating Systems, April 27-May 1, 2024
https://doi.org/10.1145/3617232.3624849
2023
- NeurIPS'23 Flow: Per-instance Personalized Federated Learning. [PDF] [Code]
Kunjal Panchal, Sunav Choudhary, Nisarg Parikh, Lijun Zhang, Hui Guan
The 2023 Conference on Neural Information Processing Systems, Dec. 10-16, 2023 - PACT'23 GraphMini: Accelerating Graph Pattern Matching Using Auxiliary Graphs. [PDF] [Code]
Juelin Liu, Sandeep Polisetty, Hui Guan, Marco Serafini
The 32nd International Conference on Parallel Architectures and Compilation Techniques, Oct. 21-25, 2023 - MobiCom'23 Re-thinking computation offload for efficient inference on IoT devices with duty-cycled radios. [PDF]
Jin Huang, Hui Guan, Deepak Ganesan
The 29th International Conference on Mobile Computing and Networking, Madrid, Spain, Oct. 2-6, 2023 - ICML'23 Flash: Concept Drift Adaptation in Federated Learning. [PDF]
Kunjal Panchal, Sunav Choudhary, Subrata Mitra, Koyel Mukherjee, Somdeb Sarkhel, Saayan Mitra, Hui Guan
40th International Conference on Machine Learning, Jul. 23-29, 2023 - ICML'23 Automatically marginalized MCMC in probabilistic programming. [PDF]
Jinlin Lai, Javier Burroni, Hui Guan, Daniel Sheldon
40th International Conference on Machine Learning, Jul. 23-29, 2023 - TNNLS'23 A Tree-Structured Multi-Task Model Architectures Recommendation System. [PDF] [Code]
Lijun Zhang, Xiao Liu, Hui Guan
IEEE Transactions on Neural Networks and Learning Systems, 2023 - ISMM'23 NUMAlloc: A Faster NUMA Memory Allocator. [PDF]
Hanmei Yang, Xin Zhao, Jin Zhou, Wei Wang, Sandip Kundu, Bo Wu, Hui Guan, and Tongping Liu
ACM SIGPLAN International Symposium on Memory Management, 2023 - IEEE Access'23 An Alternative Hard-Parameter Sharing Paradigm for Multi-Domain Learning. [PDF]
Lijun Zhang, Qizheng Yang, Xiao Liu, Hui Guan
In IEEE Access, 2023
2022
- NeurIPS'22 AutoMTL: A Programming Framework for Automating Efficient Multi-Task Learning. [PDF] [Code]
Lijun Zhang, Xiao Liu, Hui Guan
36th Conference on Neural Information Processing Systems (NeurIPS 2022), November 28, 2022. (Acceptance rate: 25.6%) - VLDB'22 COMET: A Novel Memory-Efficient Deep Learning Training Framework by Using Error-Bounded Lossy Compression. [PDF]
Sian Jin, Chengming Zhang, Xintong Jiang, Yunhe Feng, Hui Guan, Guanpeng Li, Shuaiwen Leon Song, and Dingwen Tao
In International Conference on Very Large Data Bases, 2022 - ICME'22 Rethinking Hard-Parameter Sharing in Multi-Domain Learning. [PDF]
Lijun Zhang, Qizheng Yang, Xiao Liu, Hui Guan
IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), Taipei, Taiwan, July 18-22, 2022. (Acceptance rate: 29%) - AutoML'22 A Tree-Structured Multi-Task Model Recommender. [PDF] [Code] [Teaser] [Video]
Lijun Zhang, Xiao Liu, Hui Guan
1st International Conference on Automated Machine Learning, July 25-27, 2022. (Acceptance rate: 19.2%) - AI4Science@ICML'22 Improving Subgraph Representation Learning via Multi-View Augmentation. [PDF] [talk]
Yili Shen, Jiaxu Yan, Cheng-Wei Ju, Jun Yi, Zhou Lin, Hui Guan
[ICML 2022 AI4Science Workshop](http://ai4science.net/icml22/) - CrossFL'22 Flow: Fine-grained Personalized Federated Learning through Dynamic Routing. [PDF] [Poster]
Kunjal Panchal, Hui Guan
[CrossFL 2022 Workshop @ MLSys'22](https://crossfl2022.github.io/program/) - CGO'22 Enabling Near Real-Time NLU-Driven Natural Language Programming through Dynamic Grammar Graph-Based Translation. [PDF]
Zifan Nan, Xipeng Shen, Hui Guan
The 2022 International Symposium on Code Generation and Optimization (CGO), Seoul, South Korea, 2022
2021
- ICDM'21 Recurrent Neural Networks Meet Context-Free Grammar: Two Birds with One Stone. [PDF]
Hui Guan, Umang Chaudhary, Yuanchao Xu, Lin Ning, Lijun Zhang, and Xipeng Shen
In IEEE International Conference on Data Mining, 2021 (short paper). (Acceptance rate: 20% (198/990)) - OSR'21 Scalable Graph Neural Network Training: The Case for Sampling. [PDF]
Marco Serafini, Hui Guan
In ACM SIGOPS Operating Systems Review, 2021 - MCHPC'21 FreeLunch: Compression-based GPU Memory Management for Convolutional Neural Networks. [PDF]
Shaurya Patel, Tongping Liu, Hui Guan
In MCHPC'21 Workshop - ICS'21 NumaPerf: Predictive and Comprehensive NUMA Profiling. [PDF]
Xin Zhao, Jin Zhou, Hui Guan, Wei Wang, Xu Liu, Tongping Liu
In Proceedings of International Conference on Supercomputing, 2021. (Acceptance rate: 25% (39/157)) - CACM'21 CoCoPIE: Enabling Real-Time AI on Off-the-Shelf Mobile Devices via Compression-Compilation Co-Design. [PDF]
Hui Guan, Shaoshan Liu, Xiaolong Ma, Wei Niu, Bin Ren, Xipeng Shen, Yanzhi Wang, Pu Zhao. (Authors in Alphabetical Order)
In Communications of the ACM, 2021 - InformationSystems'21 Reuse-Centric K-Means Configuration.
Lijun Zhang, Hui Guan, Yufei Ding, Xipeng Shen, Hamid Krim
Information Systems, 2021 - CC'21 Deep NLP-Based Co-Evolvement for Synthesizing Code Analysis from Natural Language. [PDF]
Zifan Nan, Hui Guan, Xipeng Shen, and Chunhua Liao
In The ACM SIGPLAN 2021 International Conference on Compiler Construction, 2021
2020
- TPDS'20 An Automatic Synthesizer of Advising Tools for High Performance Computing. [PDF]
Hui Guan, Xipeng Shen, and Hamid Krim
In IEEE Transactions on Parallel and Distributed Systems (TPDS), 2020 - FSE'20 HISyn: Human Learning-Inspired Natural Language Programming. [PDF]
Zifan Nan, Hui Guan, Xipeng Shen
In The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Sacramento, California, United States, November 2020. (Acceptance rate: 101/360=28%) - MLSys'20 FLEET: Flexible Efficient Ensemble Training for Heterogeneous Deep Neural Networks. [PDF]
Hui Guan, Laxmikant Kishor Mokadam, Xipeng Shen, Robert Patton
MLSys'20. (Acceptance rate: 20.0% (34/170))
2019
- NeurIPS'19 In-Place Zero-Space Memory Protection for CNN. [PDF]
Hui Guan, Lin Ning, Zhen Lin, Xipeng Shen, Huiyang Zhou, and Seung-Hwan Lim
In Advances in Neural Information Processing Systems, pp. 5735-5744. 2019. (Acceptance rate: 21.2% (1428/6743)) - MLSys@NeurIPS'19 Post-Training 4-bit Quantization on Embedding Tables. [PDF]
Hui Guan, Andrey Malevich, Jiyan Yang, Jongsoo Park, and Hector Yuen
MLSys Workshop on Systems for ML @ NeurIPS, 2019 - PLDI'19 Wootz: a Compiler-based Framework for Fast CNN Pruning via Composability. [PDF]
Hui Guan, Xipeng Shen, and Seung-Hwan Lim
In Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, pp. 717-730. ACM, 2019. (Acceptance rate: 27.7% (76/274)) - ICDE'19 Adaptive Deep Reuse: Accelerating CNN Training on the Fly. [PDF]
Lin Ning, Hui Guan, and Xipeng Shen
In 2019 IEEE 35th International Conference on Data Engineering (ICDE), pp. 1538-1549. IEEE, 2019. (Acceptance rate: 18%)
2018
- SC'18 Exploring Flexible Communications for Streamlining DNN Ensemble Training Pipelines. [PDF]
Randall Pittman, Hui Guan, Xipeng Shen, Seung-Hwan Lim, and Robert M. Patton
In Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, p. 64. IEEE, 2018. (Acceptance rate: 23%) - ICDE'18 Reuse-Centric K-Means Configuration. [PDF]
Hui Guan, Yufei Ding, Xipeng Shen, and Hamid Krim
In 2018 IEEE 34th International Conference on Data Engineering (ICDE), pp. 1224-1227. IEEE, 2018. (short paper) (Acceptance rate: 23%) - SysML'18 TOP: A Compiler-Based Framework for Optimizing Machine Learning Algorithms through Generalized Triangle Inequality.
Yufei Ding, Lin Ning, Hui Guan, Xipeng Shen, Madanlal Musuvathi, Todd Mytkowicz
SysML, Feb 16th, 2018, Stanford University, 2018
2017
- SC'17 Egeria: a Framework for Automatic Synthesis of HPC Advising Tools through Multi-Layered Natural Language Processing. [PDF]
Hui Guan, Xipeng Shen, and Hamid Krim
In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, p. 10. ACM, 2017. (Acceptance rate: 18% (61/327)) - PLDI'17 Generalizations of the Theory and Deployment of Triangular Inequality for Compiler-Based Strength Reduction. [PDF]
Yufei Ding, Lin Ning, Hui Guan, and Xipeng Shen
In Proceedings of the 38th ACM SIGPLAN Conference on Programming Language Design and Implementation, pp. 33-48. ACM, 2017. (Acceptance rate: 15% (47/322))
2016
- SPAWC'16 A topological collapse for document summarization. [PDF]
Hui Guan, Wen Tang, Hamid Krim, James Keiser, Andrew Rindos, and Radmila Sazdanovic
In 2016 IEEE 17th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), pp. 1-5. IEEE, 2016