HAMi DRA Webhook

A Kubernetes mutating webhook that converts GPU device resources to Dynamic Resource Allocation (DRA) ResourceClaims.

Overview

This webhook automatically transforms Pod specifications that request GPU resources (e.g., nvidia.com/gpu) into DRA ResourceClaims, enabling dynamic resource allocation for GPU workloads in Kubernetes.

Features

Automatic Resource Conversion: Converts GPU resource requests to ResourceClaims
Resource Cleanup: Automatically removes GPU resources from Pod specs and creates corresponding ResourceClaims
Annotation Support: Supports device selection via Pod annotations (UUID, device type)
Metrics Monitoring: Optional monitor component that collects and exposes GPU resource metrics via Prometheus

Installation

Prerequisites

Kubernetes version >= 1.34 with DRA Consumable Capacity featuregate enabled
CDI must be enabled in the underlying container runtime (such as containerd or CRI-O).
NVIDIA GPU Driver 440 or later

Configure and install with Helm

You need to ensure cert-manager is installed before installing the webhook.

Then you can install the webhook with the following command:

helm install hami-dra ./charts/hami-dra

If you are not using gpu-operator provided containerd drivers, you can use the following command to install the webhook:

helm install hami-dra ./charts/hami-dra \
--set drivers.nvidia.containerDriver=false

To disable the monitor component:

helm install hami-dra ./charts/hami-dra \
--set monitor.enabled=false

Then use the same as hami.

Configuration

Device Resources

Configure device resources in charts/hami-dra/values.yaml:

resourceName: "nvidia.com/gpu"
resourceMem: "nvidia.com/gpumem"
resourceCores: "nvidia.com/gpucores"

Monitor Component

The monitor component is an optional feature that collects and exposes GPU resource metrics via Prometheus. It is enabled by default.

Quick Start:

Set the monitor service to NodePort so we can access it outside the cluster:

monitor:
  enabled: true
  service:
    type: NodePort
    nodePort:
      metrics: 31995

Access metrics:

# With NodePort
curl http://<node-ip>:31995/metrics

you will see metrics like:

For detailed configuration, metrics documentation, and Prometheus integration, see MONITOR.md.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.githooks		.githooks
.github/workflows		.github/workflows
charts/hami-dra		charts/hami-dra
cmd		cmd
docker		docker
docs		docs
internal/configmapgen		internal/configmapgen
pkg		pkg
scripts		scripts
.gitignore		.gitignore
.golangci.yml		.golangci.yml
.license-header.txt		.license-header.txt
LICENSE		LICENSE
Makefile		Makefile
OWNERS		OWNERS
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HAMi DRA Webhook

Overview

Features

Installation

Prerequisites

Configure and install with Helm

Configuration

Device Resources

Monitor Component

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

HAMi DRA Webhook

Overview

Features

Installation

Prerequisites

Configure and install with Helm

Configuration

Device Resources

Monitor Component

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages