What API / spec is this serving / using? #84

cdrage · 2024-08-22T18:57:01Z

cdrage
Aug 22, 2024

When using this, I'm confused what API / spec it uses?

Is this a drop-in replacement for ollama? It is openapi spec compatible? is it ollamaspec? unsure what spec this is offering so I can use it as a drop-in replacement container for ollama.

rhatdan · 2024-08-23T18:05:31Z

rhatdan
Aug 23, 2024
Maintainer

The goal is to be a drop-in replacement for ollama using lama.cpp in the same way. We eventually want to expand to other runtimes like vllm.

We want to support pulling and pushing models from multiple Image Registries, which I am thinking of calling transports.

huggingface, ollama and OCI

These currently is not a lot of code here. Just mainly wrappers around other tools.

0 replies

ericcurtin · 2024-08-23T22:01:00Z

ericcurtin
Aug 23, 2024
Maintainer

Ollama models can be pulled, for inferencing, it's all llama.cpp, although it's likely we will add vllm as a backend also

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What API / spec is this serving / using? #84

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

What API / spec is this serving / using? #84

cdrage Aug 22, 2024

Replies: 2 comments

rhatdan Aug 23, 2024 Maintainer

ericcurtin Aug 23, 2024 Maintainer

cdrage
Aug 22, 2024

rhatdan
Aug 23, 2024
Maintainer

ericcurtin
Aug 23, 2024
Maintainer