Proteus: A High-Throughput Inference-Serving System with Accuracy ScalingPublished in The 2024 ACM Conference on Architectural Support for Programming Languages and Operating Systems, April 27-May 1, 2024, 2024Share on Twitter Facebook LinkedIn Previous Next