chore: avoid some unnecessary hash lookups by fengjiachun · Pull Request #7576 · GreptimeTeam/greptimedb

fengjiachun · 2026-01-15T08:24:15Z

I hereby agree to the terms of the GreptimeDB CLA.

Refer to a related PR or issue link (optional)

What's changed and what's your intention?

As @copilot said

PR Checklist

Please convert it to a draft if some of the following conditions are not met.

I have written the necessary rustdoc comments.
I have added the necessary unit tests and integration tests.
This PR requires documentation updates.
API changes are backward compatible.
Schema or data changes are backward compatible.

Signed-off-by: jeremyhi <fengjiachun@gmail.com>

gemini-code-assist · 2026-01-15T08:24:20Z

Important

Installation incomplete: to start using Gemini Code Assist, please ask the organization owner(s) to visit the Gemini Code Assist Admin Console and sign the Terms of Services.

fengjiachun · 2026-01-15T08:24:43Z

@codex review

chatgpt-codex-connector · 2026-01-15T08:26:24Z

Codex Review: Didn't find any major issues. Nice work!

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Copilot

Pull request overview

This pull request optimizes the convert_bulk_part function by pre-computing column indices to avoid repeated hash map lookups during batch processing. The changes improve performance without altering the function's behavior.

Changes:

Pre-compute column indices for primary key, field, timestamp, and sparse primary key dictionary columns before the main processing loops
Use SmallVec instead of Vec for primary key values to avoid heap allocations for typical small primary key sizes (≤16 columns)
Pre-compute column ID and vector pairs to avoid repeated zip operations in the row processing loop

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/mito2/src/memtable/bulk/part.rs

fengjiachun · 2026-01-20T08:01:14Z

@v0y4g3r @evenyag PTAL

evenyag · 2026-01-20T11:37:12Z

src/mito2/src/memtable/bulk/part.rs

+    // Pre-compute column indices.
+    // For sparse encoding, primary key columns are not in the input schema (already encoded)
+    let pk_col_indices = if !is_sparse {


Can we add a benchmark for it to see the how much is the improvement?

waynexia · 2026-02-24T05:56:07Z

src/mito2/src/memtable/bulk/part.rs

+                .as_str(),
+        )
+        .context(ColumnNotFoundSnafu {
+            column: &region_metadata.time_index_column().column_schema.name,


nit: with_context

time_index_column has one hash loop up inside it

killme2008 · 2026-03-05T13:42:28Z

What's the status of this PR? @fengjiachun

fengjiachun · 2026-03-06T00:23:50Z

What's the status of this PR? @fengjiachun

Will work on it (will add benchmark).

chore: avoid some unnessary hash lookups

2f0d75b

Signed-off-by: jeremyhi <fengjiachun@gmail.com>

fengjiachun requested review from evenyag, v0y4g3r and waynexia as code owners January 15, 2026 08:24

github-actions bot added the size/S label Jan 15, 2026

fengjiachun requested a review from Copilot January 15, 2026 08:24

github-actions bot added the docs-not-required This change does not impact docs. label Jan 15, 2026

Copilot started reviewing on behalf of fengjiachun January 15, 2026 08:24 View session

Copilot AI reviewed Jan 15, 2026

View reviewed changes

src/mito2/src/memtable/bulk/part.rs Show resolved Hide resolved

evenyag reviewed Jan 20, 2026

View reviewed changes

waynexia reviewed Feb 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: avoid some unnecessary hash lookups#7576

chore: avoid some unnecessary hash lookups#7576
fengjiachun wants to merge 1 commit intoGreptimeTeam:mainfrom
fengjiachun:feat/avoid_hash_lookups

fengjiachun commented Jan 15, 2026

Uh oh!

gemini-code-assist bot commented Jan 15, 2026

Uh oh!

fengjiachun commented Jan 15, 2026

Uh oh!

chatgpt-codex-connector bot commented Jan 15, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

fengjiachun commented Jan 20, 2026

Uh oh!

evenyag Jan 20, 2026

Uh oh!

waynexia Feb 24, 2026

Uh oh!

killme2008 commented Mar 5, 2026

Uh oh!

fengjiachun commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

fengjiachun commented Jan 15, 2026

Refer to a related PR or issue link (optional)

What's changed and what's your intention?

PR Checklist

Uh oh!

gemini-code-assist bot commented Jan 15, 2026

Uh oh!

fengjiachun commented Jan 15, 2026

Uh oh!

chatgpt-codex-connector bot commented Jan 15, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

fengjiachun commented Jan 20, 2026

Uh oh!

evenyag Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

waynexia Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

killme2008 commented Mar 5, 2026

Uh oh!

fengjiachun commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants