inference
: Examples for sending inference requestsllm/chat_completions.py
: Python example of sending chat completion requests to OpenAI compatible serverllm/completions.py
: Python example of sending completion requests to OpenAI compatible serverllm/completions.sh
: Bash example of sending completion requests to OpenAI compatible server, supports JSON modetext_embedding/embeddings.py
: Python example of sending text embedding requests to OpenAI compatible servervlm/vision_completions.py
: Python example of sending chat completion requests with image attached to prompt to OpenAI compatible server for vision language models
logits
: Example for logits generationlogits.py
: Python example of getting logits from hosted model.
api
: Examples for using the Python APIbasic_usage.py
: Basic Python example demonstrating the Vector Inference API
slurm_dependency
: Example of launching a model withvec-inf
and running a downstream SLURM job that waits for the server to be ready before sending a request.
Files
examples
Folders and files
Name | Name | Last commit date | ||
---|---|---|---|---|
parent directory.. | ||||