Apparate: Early-Exit Models for ML Latency and Throughput Optimization - Latency-Focused Adjustments
:::info
Authors:
(1) Yinwei Dai, Princeton University (Equal contributions);
(2) Rui Pan, Princeton University (Equal co...
All Rights Reserved. Copyright , Central Coast Communications, Inc.