DiffServe: Efficiently Serving Text-to-Image Diffusion Models with Query-Aware Model Scaling

Published in The Eighth Annual Conference on Machine Learning and Systems, Santa Clara, May 12-15, 2025. (Acceptance Rate = 22% (61/271)), 2025