After hours: March 14 at 6:05:11 PM EDT ...
A Rust, Python and gRPC server for text generation inference. Used in production at Hugging Face to power Hugging Chat, the Inference API and Inference Endpoint.