Skip to content
Change the repository type filter

All

    Repositories list

    • llmaz

      Public
      ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!
      Go
      442693910Updated Nov 24, 2025Nov 24, 2025
    • 🎉 An awesome & curated list of best LLMOps tools.
      Python
      3117110Updated Nov 24, 2025Nov 24, 2025
    • ⚒️ AlphaTrion is an open-source framework to help build GenAI applications, including experiment tracking, adaptive model routing, prompt optimization and performance evaluation.
      Python
      31060Updated Nov 22, 2025Nov 22, 2025
    • website

      Public
      Website and Blogs.
      HTML
      1120Updated Nov 20, 2025Nov 20, 2025
    • omnistore

      Public
      🎯 An unified python client to communicate with various kinds of object-store providers.
      Python
      3230Updated Nov 18, 2025Nov 18, 2025
    • AMRS

      Public
      The adaptive model routing system for exploration and exploitation.
      Makefile
      41910Updated Nov 14, 2025Nov 14, 2025
    • .github

      Public
      1000Updated Nov 12, 2025Nov 12, 2025
    • template-repo

      Public template
      A template repo.
      1000Updated Sep 19, 2025Sep 19, 2025
    • karpenter

      Public
      This is a fork of Karpenter focused on GPU scaling with llmaz.
      Go
      388010Updated Jun 18, 2025Jun 18, 2025
    • Scheduler-Plugins maintains a list of kubernetes scheduler plugins used in InftyAI community.
      Go
      31110Updated Jun 16, 2025Jun 16, 2025
    • community

      Public
      The InftyAI community.
      1010Updated Apr 18, 2025Apr 18, 2025
    • Manta

      Public archive
      💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSIX promise 🎯
      Go
      324141Updated Dec 6, 2024Dec 6, 2024
    • PR-Copilot

      Public archive
      Your AI pair programmer 🤖️ specialized in code review, code summary and even code completion. 🧑‍💻🐛
      Python
      3320Updated Jun 29, 2024Jun 29, 2024