Skip to content
Change the repository type filter

All

    Repositories list

    • vllm-fork

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      11k851065Updated Nov 22, 2025Nov 22, 2025
    • Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
      Python
      268604Updated Nov 21, 2025Nov 21, 2025
    • slurm

      Public
      Slurm: A Highly Scalable Workload Manager
      C
      747301Updated Nov 20, 2025Nov 20, 2025
    • Python
      4815130Updated Nov 19, 2025Nov 19, 2025
    • gohlml

      Public
      HABANA Management Library bindings for Go
      Go
      4302Updated Oct 29, 2025Oct 29, 2025
    • vllm-tutorials

      Public
      Shell
      3100Updated Oct 22, 2025Oct 22, 2025
    • hccl_demo

      Public
      C++
      202402Updated Oct 9, 2025Oct 9, 2025
    • drivers.accel.habanalabs.kernel

      Public
      C
      0210Updated Sep 27, 2025Sep 27, 2025
    • NIC drivers (Ethernet, IBverbs and common) for the NIC IP that is inside Intel's data-center GPU
      C
      2108Updated Sep 24, 2025Sep 24, 2025
    • Model-References

      Public
      Reference models for Intel(R) Gaudi(R) AI Accelerator
      Python
      9016813Updated Sep 23, 2025Sep 23, 2025
    • Gaudi-tutorials

      Public archive
      Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://developer.habana.ai/
      Jupyter Notebook
      526266Updated Sep 18, 2025Sep 18, 2025
    • Setup and Installation Instructions for Habana binaries, docker image creation
      Python
      192767Updated Sep 17, 2025Sep 17, 2025
    • perftest

      Public
      Gaudi RDMA Performance Test
      Python
      2000Updated Sep 16, 2025Sep 16, 2025
    • C++
      51710Updated Sep 5, 2025Sep 5, 2025
    • Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      26k405Updated Sep 4, 2025Sep 4, 2025
    • C++
      1200Updated Sep 4, 2025Sep 4, 2025
    • AutoGPTQ

      Public
      An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
      Python
      528002Updated Sep 4, 2025Sep 4, 2025
    • SGLang is a fast serving framework for large language models and vision language models.
      Python
      3.5k000Updated Sep 4, 2025Sep 4, 2025
    • DeepSpeed

      Public
      DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
      Python
      4.6k1403Updated Sep 4, 2025Sep 4, 2025
    • Ongoing research training transformer models at scale
      Python
      3.3k500Updated Sep 4, 2025Sep 4, 2025
    • HCL

      Public
      C++
      71000Updated Jul 31, 2025Jul 31, 2025
    • Intel® Gaudi® Software is an implementation of the runtime and graph compiler for Gaudi3
      C++
      71011Updated Jun 17, 2025Jun 17, 2025
    • Apptainer: Application containers for Linux
      Go
      162000Updated Jun 13, 2025Jun 13, 2025
    • Habana_Custom_Kernel

      Public
      Provides the examples to write and build Habana custom kernels using the HabanaTools
      C++
      252433Updated Apr 15, 2025Apr 15, 2025
    • The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
      Python
      3.6k1015Updated Apr 11, 2025Apr 11, 2025
    • OxM-Specific-Solutions

      Public
      0000Updated Apr 7, 2025Apr 7, 2025
    • SynapseAI_Core

      Public archive
      SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi
      C
      64220Updated Feb 3, 2025Feb 3, 2025
    • pyhlml

      Public archive
      Python
      0100Updated Feb 3, 2025Feb 3, 2025
    • DL1-Workshop

      Public archive
      Jupyter Notebook
      0200Updated Feb 3, 2025Feb 3, 2025
    • TOWL

      Public
      HTML
      3300Updated Jan 16, 2025Jan 16, 2025