gnss_gpu

GPU-accelerated GNSS positioning for the urban canyon — particle filters, ray-traced NLOS, and factor-graph experiments in real cities.

Live results snapshot · Benchmarks · Examples · GSDC2023 solution · Experiment log · Decisions · How it's built

What is this?

gnss_gpu is a research workspace for pushing smartphone- and survey-grade GNSS positioning in dense cities, where buildings block and reflect satellite signals and classic EKF/RTK pipelines fall apart. It pairs CUDA/C++ kernels with Python tooling to run GPU particle filters, double-difference carrier tracking, ray-traced line-of-sight checks against 3D city meshes, and factor-graph optimization — then scores them honestly against RTKLIB and EKF baselines on real public datasets (UrbanNav, PLATEAU, and the GSDC2023 Kaggle smartphone-decimeter challenge).

Why you might care

🛰️ It beats the classic baseline where it hurts most. On UrbanNav Tokyo Odaiba, the PF 100K (DD + smoother + stop-detect) filter reaches 1.36 m P50 / 4.11 m RMS versus RTKLIB demo5 at 2.67 m / 13.08 m over 12,228 aligned epochs — a 49% better median and 69% better RMS.
⚡ It's genuinely fast. A full 1,000,000-particle filter step (predict → weight → resample → estimate) runs in 81 ms (≈12 Hz) on a consumer Ada GPU; a 10,000-epoch batch WLS solve takes ~1 ms. See benchmarks/RESULTS.md.
🏙️ City-aware NLOS handling. Ray tracing against PLATEAU 3D building meshes does line-of-sight / non-line-of-sight classification with a 57.8× BVH speedup, so urban multipath can be rejected instead of trusted.
📈 Honest, reproducible scoring. Every headline number comes from a fixed same-input/same-metric comparison, and the live snapshot is regenerated straight from the committed result CSVs.

Results at a glance

Method	Dataset	P50	RMS 2D
PF 100K (DD + smoother + stop-detect)	UrbanNav Tokyo Odaiba	1.36 m	4.11 m
RTKLIB demo5	UrbanNav Tokyo Odaiba	2.67 m	13.08 m
PF + RobustClear-10K (external mainline)	UrbanNav, 5 seq / 2 cities	—	66.6 m
EKF baseline	UrbanNav, 5 seq / 2 cities	—	93.25 m

Particle-count scaling: PF crosses EKF near 1K particles

The external-validation RMS is high in absolute terms because it averages the hardest deep-urban sequences (including failure stretches). The point is the relative gap: the GPU PF stack consistently wins against EKF and RTKLIB on the same epochs. Full tables, figures, and limitations live on the results snapshot.

Particle-filter localization on OpenStreetMap

The README headline is not just a table: the sampled particle cloud is localized on the real street network, with the posterior contracting around the driven UrbanNav route while the full-view trail is drawn from the continuous trajectory.

Open the Odaiba particle-cloud video

For the zero-data terminal demo behind this visual:

PYTHONPATH=python:. python3 examples/demo_pf_localization_improvement.py

It reads checked-in artifacts and prints the UrbanNav Odaiba PF-vs-RTKLIB improvement plus the PLATEAU LOS/NLOS mask replay gain for PF.

Ray-traced NLOS diffraction on real city data

Beyond rejecting blocked satellites, the package models why an urban pseudorange is biased — knife-edge (ITU-R P.526) and UTD (Kouyoumjian–Pathak) diffraction plus specular reflection over PLATEAU 3D building meshes — and scores the physics against real UrbanNav residuals.

Open the full LOS/NLOS deck.gl sweep

A subtle but decisive step is correcting each satellite to signal-transmission time (with the Sagnac rotation). Without it a per-satellite tens-of-metres range error swamps the multipath signal; with it the residual becomes a clean NLOS ground truth (LOS median 1.0 m, AUC 0.92). On that clean reference, UTD reproduces the measured multipath-bias distribution better than knife-edge — reproducing the literature (Zhang & Hsu, 2021) on properly corrected real data.

$Ray-traced NLOS diffraction (UTD vs knife-edge) vs real UrbanNav Odaiba residuals$

Diffraction model	Wasserstein-1 ↓	KS ↓
knife-edge (ITU-R P.526)	1.84	0.46
UTD (Kouyoumjian–Pathak)	1.70	0.29

UrbanNav Tokyo Odaiba, 60 epochs over a 249k-triangle PLATEAU mesh. Reproduce with PYTHONPATH=examples python examples/plot_nlos_diffraction_figure.py Odaiba 60 (uses the installed package's CUDA ray-tracing for line-of-sight checks).

Quick start

Zero install: run the urban-canyon demo — with sky plot and trajectory figures — straight in your browser:

Or locally:

git clone --recurse-submodules https://github.com/rsasaki0109/gnss_gpu.git
cd gnss_gpu

python3 -m venv .venv && source .venv/bin/activate
python3 -m pip install --upgrade pip
python3 -m pip install -r requirements.txt
python3 -m pip install pytest pandas scipy requests matplotlib plotly

Run the demo (no GPU, no data, ~1 second)

The fastest way to see what this repo is about. It simulates a car driving through an urban canyon where buildings block some satellites (NLOS multipath), then solves each epoch with plain least squares vs. the package's robust SPP solver:

PYTHONPATH=python python3 examples/demo_urban_canyon_sim.py

method                         P50 err     RMS err
--------------------------------------------------
naive WLS (L2)                 10.30 m     10.21 m
robust SPP (Cauchy)             2.00 m      2.39 m
--------------------------------------------------
robust vs naive: 81% better P50, 77% better RMS

Robust down-weighting of NLOS-biased measurements is the same idea the GPU particle-filter stack scales up to beat RTKLIB demo5 on real UrbanNav data.

Use the robust SPP solver from Python

For library code, the same CPU-only solver is available from the package top level:

import numpy as np
from gnss_gpu import robust_spp

sat_ecef = np.asarray(...)       # shape: (n_sat, 3), metres
pseudoranges = np.asarray(...)   # shape: (n_sat,), metres
weights = np.ones(len(pseudoranges))
coarse_ecef = np.asarray(...)    # shape: (3,), metres

position_ecef = robust_spp(
    sat_ecef,
    pseudoranges,
    weights=weights,
    init_pos=coarse_ecef,
    weight_func="cauchy",
    threshold=15.0,
)
if position_ecef is None:
    raise RuntimeError("SPP failed; check satellite count and geometry")

Bad input shapes, non-finite values, negative weights, and invalid solver options raise ValueError with messages that name the offending argument.

For a measurement-level NLOS simulator with explicit ray-cast building blockage, C/N0 attenuation, excess delay, and a geometry-aware SPP comparison:

PYTHONPATH=python python3 examples/demo_nlos_simulation.py
PYTHONPATH=python python3 examples/demo_plateau_nlos_simulation.py
PYTHONPATH=python python3 examples/demo_plateau_nlos_visualization.py
PYTHONPATH=python:. python3 experiments/run_plateau_nlos_demo_suite.py

The suite command exports the mask, replays SPP/PF/FGO, and writes combined JSON/Markdown/CSV summaries. The individual replay commands are:

Replay consumer	Baseline RMS	Mask-soft RMS	RMS gain
SPP	11.85 m	4.07 m	65.6%
PF	11.18 m	1.40 m	87.4%
local-FGO	8.10 m	0.38 m	95.4%

PYTHONPATH=python:. python3 experiments/export_plateau_nlos_demo_mask.py \
  --out-csv experiments/results/plateau_nlos_demo_mask.csv \
  --summary-json experiments/results/plateau_nlos_demo_mask_summary.json
PYTHONPATH=python:. python3 experiments/replay_plateau_nlos_demo_spp.py \
  --mask-csv experiments/results/plateau_nlos_demo_mask.csv \
  --summary-json experiments/results/plateau_nlos_demo_spp_replay_summary.json
PYTHONPATH=python:. python3 experiments/replay_plateau_nlos_demo_pf.py \
  --mask-csv experiments/results/plateau_nlos_demo_mask.csv \
  --summary-json experiments/results/plateau_nlos_demo_pf_replay_summary.json
PYTHONPATH=python:. python3 experiments/replay_plateau_nlos_demo_fgo.py \
  --mask-csv experiments/results/plateau_nlos_demo_mask.csv \
  --summary-json experiments/results/plateau_nlos_demo_fgo_replay_summary.json

The PLATEAU visualization is also checked into the Pages assets at docs/assets/media/demos/plateau_nlos_visualization.html. The exported mask CSV uses the existing experiment contract tow,epoch_idx,prn,is_los; the SPP, particle-filter, and local-FGO replays consume only that mask path and show mask-soft downstream estimators recovering the simulated NLOS error.

Run the test suite

The pure-Python helpers and experiment logic run without a GPU; tests that exercise the native CUDA kernels are skipped or fail until you build them (see below):

PYTHONPATH=python python3 -m pytest tests/ -q

Browse examples/ for runnable demos (acquisition, full pipeline, interference, urban PLATEAU, real-data replay, visualization). The GPU-accelerated demos import native modules, so build the kernels first.

Building the CUDA/C++ kernels

The native kernels back the signal-sim, particle-filter, ray-tracing, and multi-GNSS solver paths:

mkdir -p build && cd build
cmake .. -DCMAKE_CUDA_ARCHITECTURES=native
make -j"$(nproc)"
# then copy the generated .so files into python/gnss_gpu/

Once built, try a demo, e.g. signal simulation → acquisition round-trip:

PYTHONPATH=python python3 examples/demo_signal_sim.py

ROS 2 node

For outdoor robots, ros2/gnss_gpu_ros packages the trajectory-filtering ideas validated on GSDC2023 as a ROS 2 node: it gates multipath/NLOS spikes in sensor_msgs/NavSatFix streams (Hampel + CV Kalman) before they reach your fusion stack, and publishes an RViz-friendly path.

ros2 run gnss_gpu_ros robust_navsat_filter --ros-args -r fix:=/your_gnss_driver/fix

Repository layout

python/gnss_gpu/              Reusable Python package code
src/                          CUDA/C++ kernels and native bindings
examples/                     Runnable demos (start here)
benchmarks/                   GPU throughput benchmarks (+ RESULTS.md)
experiments/                  Experiment runners, sweeps, reports, one-off probes
experiments/results/          Generated CSV/HTML/plot outputs
docs/                         Generated visual snapshot site (the live demo)
ros2/gnss_gpu_ros/            ROS 2 robust NavSatFix filter node
internal_docs/                Working notes, decisions, handoffs, current state
third_party/gnssplusplus/     C++ GNSS/RTK/PPP/CLAS solver subproject
tests/                        Python tests for stable helpers and experiment logic

flowchart LR
    Data["PPC / UrbanNav / GSDC data"] --> Lib["libgnss++\nSPP/RTK/diagnostics"]
    Lib --> Floor[".pos / diagnostics\nhybrid floor and candidates"]
    Data --> GPU["gnss_gpu\nPF/RBPF/DD/FGO experiments"]
    Floor --> GPU
    GPU --> Score["honest scoring\nCSV/HTML reports\nKaggle/PPC artifacts"]

Where to look next

Goal	First place to look
See the live, regenerated results	Results snapshot site
Run a demo	`examples/`
Check GPU throughput	`benchmarks/RESULTS.md`
Continue current GSDC2023 Kaggle work	`internal_docs/plan.md`
Understand current PPC production state	`internal_docs/ppc_current_status.md`
Find durable decisions and negative results	`internal_docs/decisions.md`
Work on reusable Python code	`python/gnss_gpu/`
Work on native CUDA/C++ code	`src/`
Work on the C++ GNSS solver baseline	`third_party/gnssplusplus/README.md`

A note on scope

This is not a single polished application — it is intentionally experiment-first. Stable code lives in the library/native directories (python/gnss_gpu/, src/), while fast-moving runs, sweeps, generated reports, and Kaggle/PPC handoffs live in experiments/ and internal_docs/. Many CSV/HTML files are generated or local-only; before trusting one, check that it is listed in experiments/results/README.md and that its build command is recorded in internal_docs/plan.md.

Development policy

Keep stable reusable code in python/gnss_gpu/ or src/; keep variant-heavy logic in experiments/ until it survives fixed evaluation.
Do not promote a method because it wins one pilot split. Prefer same-input, same-metric comparisons over new abstractions.
Record durable decisions in internal_docs/decisions.md.
Do not vendor, link, or derive production code/config from GPL-3.0 reference sources such as gici-open.

License

Apache-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 469 Commits
.github		.github
benchmarks		benchmarks
data		data
docs		docs
examples		examples
experiments		experiments
include/gnss_gpu		include/gnss_gpu
internal_docs		internal_docs
output		output
python/gnss_gpu		python/gnss_gpu
ros2/gnss_gpu_ros		ros2/gnss_gpu_ros
scripts/ci		scripts/ci
src		src
tests		tests
third_party		third_party
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gnss_gpu

What is this?

Why you might care

Results at a glance

Particle-filter localization on OpenStreetMap

Ray-traced NLOS diffraction on real city data

Quick start

Run the demo (no GPU, no data, ~1 second)

Use the robust SPP solver from Python

Run the test suite

Building the CUDA/C++ kernels

ROS 2 node

Repository layout

Where to look next

A note on scope

Development policy

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

gnss_gpu

What is this?

Why you might care

Results at a glance

Particle-filter localization on OpenStreetMap

Ray-traced NLOS diffraction on real city data

Quick start

Run the demo (no GPU, no data, ~1 second)

Use the robust SPP solver from Python

Run the test suite

Building the CUDA/C++ kernels

ROS 2 node

Repository layout

Where to look next

A note on scope

Development policy

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages