Add model card in response for ModelMetadata API #5750

yeahdongcn · 2023-05-06T06:46:35Z

Model Cards is a concept from Hugging Face that accompany the models and provide handy information. Under the hood, model cards are simple Markdown files with additional metadata.

As the Triton server is capable to integrate with various file-system providers such as local, S3, GCS and etc. It becomes easier for extending the model hierarchy to support model cards.

My proposal is to add a README.md for each model being served by the Triton server and from ModelMetadata API call, one can get the model card in the response.

Use a minimal model repository for a TorchScript model as an example:

  <model-repository-path>/
    <model-name>/
      config.pbtxt
      README.md
      1/
        model.pt

Other PRs for supporting this feature:

Testing done:

Build pass
Invoke ModelMetadata API using Golang client and Python HTTP/gRPC SDK, everything works fine for the models w/wo README.md.

Signed-off-by: Xiaodong Ye <[email protected]>

yeahdongcn · 2023-05-10T05:02:54Z

@GuanLuo Could you please also review this proposal? Thanks.

yeahdongcn · 2023-05-29T00:57:34Z

Any chance to get this discussed/reviewed? @dyastremsky

dyastremsky · 2023-05-30T17:28:39Z

Sure! Thank you for your contribution. Created a ticket, someone will look into this soon.

yeahdongcn · 2023-06-20T02:30:44Z

We are trying to turn the Triton server into an on-premises HuggingFace by building a web UI and a small aggregation server. This is still in the early stage and this model card (README.md) is optional in adding any models.

By leveraging the well-structured API implementation, we can easily fetch the contents of README.md and take further processes in the aggregation server.

Here are 2 screenshots of our hub:

GuanLuo · 2023-07-19T00:44:50Z

Hi @yeahdongcn, thanks for submitting the PR, it brought the attention to the KServe Open Inference Protocol group that there is a need for extending the model metadata protocol (Triton's ModelMetadata API is an implementation of the protocol).

The extension proposal is still being worked on to be a generic solution for providing model properties not only in HuggingFace model card but also in other formats. That being said, I think the change to the existing PR will be minimal, although we will need to wait until the protocol has been relaxed.

yeahdongcn · 2023-07-19T01:05:08Z

@GuanLuo Thanks for letting me know this. It will be great if they can extend the protocol.

Just want to know your thoughts about how to store these metadata, do you prefer to put everything in config.pbtxt?

GuanLuo · 2023-07-20T00:19:44Z

I think in this case (HF model card), it would be better to be stored as a separate file in the model directory and link the relative path in the config.pbtxt (i.e. parameters [ {key: "HuggingFaceModelCard", value: "README.md"}]), and when returned from model meta, Triton will read the file content and put it into the response. Which is basically the change you made in the core PR, but the file name will be read from config instead of fixed string (README.md)

Add model card in response for ModelMetadata API

df11a25

Signed-off-by: Xiaodong Ye <[email protected]>

This was referenced May 6, 2023

ModelMetadataResponse: add new card field triton-inference-server/common#87

Open

Get model card in TRITONSERVER_ServerModelMetadata triton-inference-server/core#198

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add model card in response for ModelMetadata API #5750

Add model card in response for ModelMetadata API #5750

yeahdongcn commented May 6, 2023 •

edited

Loading

yeahdongcn commented May 10, 2023

yeahdongcn commented May 29, 2023

dyastremsky commented May 30, 2023

yeahdongcn commented Jun 20, 2023

GuanLuo commented Jul 19, 2023

yeahdongcn commented Jul 19, 2023 •

edited

Loading

GuanLuo commented Jul 20, 2023

Add model card in response for ModelMetadata API #5750

Are you sure you want to change the base?

Add model card in response for ModelMetadata API #5750

Conversation

yeahdongcn commented May 6, 2023 • edited Loading

yeahdongcn commented May 10, 2023

yeahdongcn commented May 29, 2023

dyastremsky commented May 30, 2023

yeahdongcn commented Jun 20, 2023

GuanLuo commented Jul 19, 2023

yeahdongcn commented Jul 19, 2023 • edited Loading

GuanLuo commented Jul 20, 2023

yeahdongcn commented May 6, 2023 •

edited

Loading

yeahdongcn commented Jul 19, 2023 •

edited

Loading