GaussianFlowOcc: Sparse and Weakly Supervised Occupancy Estimation using Gaussian Splatting and Temporal Flow

Simon Boeder^1,2, Fabian Gigengack¹, Benjamin Risse²
¹Bosch Research, ²University of Münster

ICCV 2025

Visualization videos can be found at assets/.

Installation

1. Create virtual env

conda create -n gaussianflowocc python=3.8 -y
conda activate gaussianflowocc

2. Install Repository

Please make sure to have CUDA 11.3 installed and in your PATH.

# install pytorch
pip install torch==1.11.0+cu113 torchvision==0.12.0+cu113 torchaudio==0.11.0+cu113 -f https://download.pytorch.org/whl/torch_stable.html

# install openmim, used for installing mmcv
pip install -U openmim

# install mmcv
mim install mmcv-full==1.6.0 -f https://download.openmmlab.com/mmcv/dist/cu113/torch1.11.0/index.html

# install mmdet and ninja
pip install mmdet==2.25.1 ninja==1.11.1

# Install GaussianFlowOcc (as mmdet3d fork)
pip install -v -e .

# Install gsplat
pip install git+https://github.com/nerfstudio-project/[email protected]

# install GroundedSAM (for pseudo labels)
python -m pip install -e groundedsam/segment_anything
python -m pip install -e groundedsam/GroundingDINO
pip install diffusers transformers accelerate scipy safetensors

Data Preparation

Please create a directory ./data and ./ckpts in the root directory of the repository.
Download nuScenes [https://www.nuscenes.org/download].
Download the Occ3D-nuScenes dataset from [https://github.com/Tsinghua-MARS-Lab/Occ3D]. The download link can be found in their README.md.
Generate the annotation files. This will put the annotation files into the ./data directory by default. The process can take up to ~1h.

python tools/create_data_bevdet.py

Copy or softlink the files into the ./data directory. The structure of the data directory should be as follows:

gaussianflowocc
    ├──data
    │   ├── nuscenes
    │   │  ├── v1.0-trainval (Step 2, nuScenes+nuScenes-panoptic files)
    │   │  ├── sweeps (Step 2, nuScenes files)
    │   │  ├── samples (Step 2, nuScenes files)
    │   │  └── panoptic (Step 2, nuScenes-panoptic files)
    │   ├── gts (Step 3)
    │   ├── bevdetv2-nuscenes_infos_train.pkl (Step 4)
    │   ├── bevdetv2-nuscenes_infos_val.pkl (Step 4)
    │   ├── bevdetv2-nuscenes_infos_test.pkl (Step 4)
    │   ├── metric_3d_nusc (See next chapter)
    │   └── groundedsam (See next chapter)
    ├──ckpts
    └──...

Generate Pseudo-Labels

Please create two directories metric_3d_nusc and groundedsam in a location with enough disk space and softlink them into ./data, as the following scripts will write data to these locations (the ./data directory should look like in the tree above).

1. Pseudo Depth

First, we generate the pseudo depth labels using Metric3D (~550 GB).

python tools/generate_m3d_nusc.py

You can parallelize this by starting multiple runs and specify a scene range for each run. Example:

python tools/generate_m3d_nusc.py --scene-prefix scene-00 scene-01 scene-02
python tools/generate_m3d_nusc.py --scene-prefix scene-03 scene-04 scene-05
python tools/generate_m3d_nusc.py --scene-prefix scene-06 scene-07 scene-08
python tools/generate_m3d_nusc.py --scene-prefix scene-09 scene-10 scene-11

2. Pseudo Semantics

Next, we generate the pseudo semantic labels using GroundedSAM (~276 GB).

Download the SAM checkpoint

# Download SAM checkpoint into ./ckpts
cd ckpts
wget https://dl.fbaipublicfiles.com/segment_anything/sam_vit_h_4b8939.pth 
cd ..

Run the generation script:

# single GPU
python3 groundedsam/generate_grounded_sam.py --single-gpu
# multi GPU with 4 GPU's
python -m torch.distributed.launch --nproc_per_node 4 --master_port 29582 groundedsam/generate_grounded_sam.py

As for the pseudo depth labels, you can run multiple generation scripts simultaneously and restrict each run to a certain range of scenes by using the --scene-prefixes argument.
If you would like to generate the masks also for the validation set, use the --split val argument.

Train model

We provide configuration files for training our model with or without pseudo depth labels.

# In the root directory of the repository:
# single gpu
python tools/train.py configs/gaussianflowocc.py
# multiple gpu (replace "num_gpu" with the number of available GPUs) - 4 GPU's are reccomended.
./tools/dist_train.sh configs/gaussianflowocc.py num_gpu

In our experiments, we use 4 GPU's. Due to some non-deterministic operations, the results may deviate slightly (up or down) from the results presented in the paper.

Evaluate model

After training, you can evaluate the model on Occ3D-nuScenes.

# mIoU & IoU on Occ3D-nuScenes
python tools/test.py configs/gaussianflowocc.py work_dirs/gaussianflowocc/epoch_18_ema.pth --eval mIoU

If you want to evaluate the RayIoU metric, please first run the standard evaluation with the extra flag --save-occ-path to store the predictions. Afterwards, we can run the RayIoU eval script.

# Store predictions
python tools/test.py configs/gaussianflowocc.py work_dirs/gaussianflowocc/epoch_18_ema.pth --eval mIoU --save-occ-path ./occ/gaussianflowocc

# Run RayIoU eval
python tools/eval_ray_mIoU.py --pred-dir ./occ/gaussianflowocc

You can also increase the influence range of each Gaussian during voxelization to potentially increase the accuracy in favor of runtime performance using the --nbh parameter. By default, --nbh is set to 4.

# mIoU & IoU on Occ3D-nuScenes with max_neighborhood of 5
python tools/test.py configs/gaussianflowocc.py work_dirs/gaussianflowocc/epoch_18_ema.pth --eval mIoU --nbh 5

Resume Runs

If the training is interrupted at any point and you want to resume from a checkpoint, you can simply use the --resume-from command as follows:

./tools/dist_train.sh configs/gaussianflowocc.py num_gpu --resume-from /path/to/checkpoint/latest.pth

The checkpoints are usually saved under the work_dirs directory. By default, a checkpoint is created every 6 epochs.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
configs		configs
groundedsam		groundedsam
lib/dvr		lib/dvr
mmdet3d		mmdet3d
requirements		requirements
tools		tools
.gitignore		.gitignore
3rd-party-licenses.txt		3rd-party-licenses.txt
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GaussianFlowOcc: Sparse and Weakly Supervised Occupancy Estimation using Gaussian Splatting and Temporal Flow

Installation

1. Create virtual env

2. Install Repository

Data Preparation

Generate Pseudo-Labels

1. Pseudo Depth

2. Pseudo Semantics

Train model

Evaluate model

Resume Runs

Copyright

About

Uh oh!

Releases

Packages

Languages

License

boschresearch/GaussianFlowOcc

Folders and files

Latest commit

History

Repository files navigation

GaussianFlowOcc: Sparse and Weakly Supervised Occupancy Estimation using Gaussian Splatting and Temporal Flow

Installation

1. Create virtual env

2. Install Repository

Data Preparation

Generate Pseudo-Labels

1. Pseudo Depth

2. Pseudo Semantics

Train model

Evaluate model

Resume Runs

Copyright

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages