Loki: A System for Serving ML Inference Pipelines with Hardware and Accuracy Scaling
Published in The 33rd International Symposium on High-Performance Parallel and Distributed Computing (HPDC'24), Pisa, Italy, June 3-7, 2024. (Acceptance Rate = 17% (26/152)), 2024