Add initial tests for repo subcommand #21

rmccorm4 · 2024-01-10T02:24:17Z

Will follow-up with some of the TODOs and other tests separately. Want to get some baseline automated tests running and keep PRs on the smaller-side for easier reviews.

…when no mode is specified, unify server/profile code to helper functions, add more argparse default info

…onflicts

Co-authored-by: Francesco Petrini <[email protected]>

…triton_cli into rmccormick-test

tests/test_cli.py

fpetrini15 · 2024-01-10T03:03:46Z

tests/test_cli.py

+    def test_repo_add_vllm(self, model, source):
+        self.repo_clear()
+        self.repo_add(model, source)
+        # TODO: Parse repo to find model, with vllm backend in config


Sounds like we should re-tool this to accept a search target parameter so it can detect any arbitrary backend.

Some kind of repo validator would be great. For well-defined backends, it can validate more strictly:

all backends: - config.pbtxt contains 'backend: "<backend>"' - contains version folder vllm: - version folder contains model.json # NOTE: would probably hold back on spending time on trt-llm validation right now, # I think there are several competing simplification efforts of the backend going on. # This validation logic may not hold true for long. trt-llm - contains expected/necessary composing models - sets gpt_model_path/required params onnx: - version folder contains some *.onnx ...

If you have some ideas, feel free to add a ticket. I probably won't implement this for initial testing.

This repo subcommand may actually be a great place to experiment with an "ensemble" generator/helper/tool in the future - I think @oandreeva-nv was interested in something like this and has a ticket somewhere

For all outputs -> all inputs cases, I think it may be straightforward. For some inputs -> some outputs cases, it may be a cumbersome thing to define on a CLI. If there was some alternative way to define ensembles via a simple config (json, DAG, etc.) - then maybe the CLI would just consume that to generate the triton configs.

A repo validator would be nice! I think we can leverage our client's tritonclient.grpc.model_config_pb2 module to validate the major things in the config file (e.g., syntax is valid, backend field exists, etc.) and then take the nice JSON form it dumps to perform more complex validation with ease (e.g., specified backend is valid, dims are correctly formed, etc.).

I think this this the ticket you are referring to. For my own clarity, "all outputs -> all inputs" refers to checking the config.pbtxt file for the ensemble model and verifying the inputs/outputs specified in the ensemble_scheduling steps exist within the submodels, correct?

I think this this the ticket you are referring to.

Yep that's the one. There are actually two features here - validating an ensemble config is correct (and providing perhaps a more useful error when incorrect, compared to what core outputs), and generating or simplifying the generation of an ensemble config.

For my own clarity, "all outputs -> all inputs" refers to checking the config.pbtxt file for the ensemble model and verifying the inputs/outputs specified in the ensemble_scheduling steps exist within the submodels, correct?

For this statement, I was thinking in terms of generating ensemble configs, rather than validating the. I meant if there was an unambiguous mapping that could be inferred when connecting Model A -> Model B (ex: Model A has 1 output, Model B has 1 input), we could just do it. If there is an ambiguous mapping (ModelA has 3 outputs, ModelB has 2 inputs, which get mapped?), we probably can't do much.

src/triton_cli/main.py

src/triton_cli/parser.py

…into rmccormick-test

… use

fpetrini15 · 2024-01-10T21:34:23Z

LGTM 🎊

rmccorm4 and others added 6 commits January 9, 2024 15:58

Fix vLLM profiler throughput bug, add fallback logic to server start …

901187d

…when no mode is specified, unify server/profile code to helper functions, add more argparse default info

Clarify help text on server start --mode

7bef17e

Add initial set of tests for 'repo' subcommand

a2d05d2

Cleanup fallback server error logging, remove outdated logs on port c…

390f481

…onflicts

Fix typo

8625747

Co-authored-by: Francesco Petrini <[email protected]>

Merge branch 'rmccormick-vllm' of github.com:triton-inference-server/…

ab54a41

…triton_cli into rmccormick-test

rmccorm4 requested a review from fpetrini15 January 10, 2024 02:24

Implement pairwise to remove python 3.10 restriction

bdb6051

fpetrini15 reviewed Jan 10, 2024

View reviewed changes

tests/test_cli.py Outdated Show resolved Hide resolved

fpetrini15 reviewed Jan 10, 2024

View reviewed changes

src/triton_cli/main.py Outdated Show resolved Hide resolved

Fix negative tests to expect and catch failures

f97e5c2

fpetrini15 reviewed Jan 10, 2024

View reviewed changes

src/triton_cli/parser.py Outdated Show resolved Hide resolved

Base automatically changed from rmccormick-vllm to main January 10, 2024 20:03

rmccorm4 added 4 commits January 10, 2024 12:03

Merge branch 'main' of github.com:triton-inference-server/triton_cli …

cd9b8a5

…into rmccormick-test

Separate entrypoint into separate function for testing vs interactive…

81c11fc

… use

Clarify argv comments

cb3a3eb

Use python 3.8 compliant type union

e769f1a

rmccorm4 requested a review from fpetrini15 January 10, 2024 20:31

fpetrini15 approved these changes Jan 10, 2024

View reviewed changes

rmccorm4 merged commit cfb4bbb into main Jan 10, 2024
3 checks passed

rmccorm4 deleted the rmccormick-test branch January 10, 2024 23:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add initial tests for repo subcommand #21

Add initial tests for repo subcommand #21

rmccorm4 commented Jan 10, 2024 •

edited

Loading

fpetrini15 Jan 10, 2024

rmccorm4 Jan 10, 2024

rmccorm4 Jan 10, 2024

rmccorm4 Jan 10, 2024 •

edited

Loading

fpetrini15 Jan 10, 2024

rmccorm4 Jan 10, 2024 •

edited

Loading

fpetrini15 commented Jan 10, 2024

Add initial tests for repo subcommand #21

Add initial tests for repo subcommand #21

Conversation

rmccorm4 commented Jan 10, 2024 • edited Loading

fpetrini15 Jan 10, 2024

Choose a reason for hiding this comment

rmccorm4 Jan 10, 2024

Choose a reason for hiding this comment

rmccorm4 Jan 10, 2024

Choose a reason for hiding this comment

rmccorm4 Jan 10, 2024 • edited Loading

Choose a reason for hiding this comment

fpetrini15 Jan 10, 2024

Choose a reason for hiding this comment

rmccorm4 Jan 10, 2024 • edited Loading

Choose a reason for hiding this comment

fpetrini15 commented Jan 10, 2024

rmccorm4 commented Jan 10, 2024 •

edited

Loading

rmccorm4 Jan 10, 2024 •

edited

Loading

rmccorm4 Jan 10, 2024 •

edited

Loading