Skip to content
@gpustack

GPUStack

Open-source GPU cluster manager for running large language models(LLMs)

Pinned Loading

  1. gpustack gpustack Public

    Manage GPU clusters for running AI models

    Python 2.7k 275

  2. gguf-parser-go gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    Go 169 16

  3. llama-box llama-box Public

    LM inference server implementation based on *.cpp.

    C++ 196 16

  4. vox-box vox-box Public

    A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

    Python 115 14

Repositories

Showing 9 of 9 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…