You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is a feature request for support of caching for hashable data types as implemented in academic serving systems Clipper: A Low-Latency Online Prediction Serving System section 4.2 and also similar frameworks like Triton inference server. This could be very useful for non custom runtimes like HuggingFace runtime.
The text was updated successfully, but these errors were encountered:
This is a feature request for support of caching for hashable data types as implemented in academic serving systems Clipper: A Low-Latency Online Prediction Serving System section 4.2 and also similar frameworks like Triton inference server. This could be very useful for non custom runtimes like HuggingFace runtime.
The text was updated successfully, but these errors were encountered: