LLM-perf Backend 🏋️

The official backend system powering the LLM-perf Leaderboard. This repository contains the infrastructure and tools needed to run standardized benchmarks for Large Language Models (LLMs) across different hardware configurations and optimization backends.

About 📝

LLM-perf Backend is designed to:

Run automated benchmarks for the LLM-perf leaderboard
Ensure consistent and reproducible performance measurements
Support multiple hardware configurations and optimization backends
Generate standardized performance metrics for latency, throughput, memory usage, and energy consumption

Key Features 🔑

Standardized benchmarking pipeline using Optimum-Benchmark
Support for multiple hardware configurations (CPU, GPU)
Multiple backend implementations (PyTorch, Onnxruntime, etc.)
Automated metric collection:
- Latency and throughput measurements
- Memory usage tracking
- Energy consumption monitoring
- Quality metrics integration with Open LLM Leaderboard

Installation 🛠️

Clone the repository:

git clone https://github.com/huggingface/llm-perf-backend
cd llm-perf-backend

Create a python env

python -m venv .venv
source .venv/bin/activate

Install the package with required dependencies:

pip install -e "." 
# or
pip install -e ".[all]" # to install optional dependency like Onnxruntime

Usage 📋

Command Line Interface

Run benchmarks using the CLI tool:

llm-perf run-benchmark --hardware cpu --backend pytorch

Configuration Options

View all the options with

llm-perf run-benchmark --help

--hardware: Target hardware platform (cpu, cuda)
--backend: Backend framework to use (pytorch, onnxruntime, etc.)

Benchmark Dataset 📊

Results are published to the official dataset: optimum-benchmark/llm-perf-leaderboard

Benchmark Specifications 📑

All benchmarks follow these standardized settings:

Single GPU usage to avoid communication-dependent results
Energy monitoring via CodeCarbon
Memory tracking:
- Maximum allocated memory
- Maximum reserved memory
- Maximum used memory (via PyNVML for GPU)

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
.github/workflows		.github/workflows
docker		docker
llm_perf		llm_perf
.dockerignore		.dockerignore
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM-perf Backend 🏋️

About 📝

Key Features 🔑

Installation 🛠️

Usage 📋

Command Line Interface

Configuration Options

Benchmark Dataset 📊

Benchmark Specifications 📑

About

Uh oh!

Releases

Packages

Uh oh!

Languages

huggingface/llm-perf-backend

Folders and files

Latest commit

History

Repository files navigation

LLM-perf Backend 🏋️

About 📝

Key Features 🔑

Installation 🛠️

Usage 📋

Command Line Interface

Configuration Options

Benchmark Dataset 📊

Benchmark Specifications 📑

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages