Fix typos #1240

kianmeng · 2025-11-12T15:53:13Z

Found via codespell -S *.json,*.lock,tests_data -L hve,afterall and typos --hidden --format brief

Found via `codespe -S *.json,*.lock,tests_data -L hve,afterall` and `typos --hidden --format brief`

gemini-code-assist

Code Review

This pull request fixes a number of typos across the codebase. The changes are mostly correct. I've found a couple of minor issues:

In python/scripts/test_magika_model.py, a typo was fixed but replaced with another incorrect word. I've suggested a fix.
In python/CHANGELOG.md, the numbering of a list is inconsistent, which I've pointed out for better readability.

Overall, good catch on the typos!

gemini-code-assist · 2025-11-12T15:54:12Z

python/CHANGELOG.md

  ```

-1. **CLI Output Format Change (v0.6.0):** The JSON output format of the CLI has changed. Those changes are analogous to the changes to the Python APIs. The `score` field is now at the top level, alongside `dl` and `output`, and is no longer nested within `dl` or `output`. The output also includes `is_text` and `extensions` fields. The `magic` metadata has been removed as it was often either incorrect or reduntant; use `description` instead. Moreover, similarly to what happens under the hood with the `StatusOr` pattern, `result.status` indicates whether the prediction was successful, and the prediction results are available under the `result.value` key.
+1. **CLI Output Format Change (v0.6.0):** The JSON output format of the CLI has changed. Those changes are analogous to the changes to the Python APIs. The `score` field is now at the top level, alongside `dl` and `output`, and is no longer nested within `dl` or `output`. The output also includes `is_text` and `extensions` fields. The `magic` metadata has been removed as it was often either incorrect or redundant; use `description` instead. Moreover, similarly to what happens under the hood with the `StatusOr` pattern, `result.status` indicates whether the prediction was successful, and the prediction results are available under the `result.value` key.


While this PR fixes a typo, I noticed that the numbered list items in the 'Breaking Changes and Migration Guide' section all start with 1.. For better readability of the raw markdown file, it would be clearer to number them sequentially. This item should be 2., and subsequent items in this list should be numbered 3., 4., and so on.

Suggested change

1. **CLI Output Format Change (v0.6.0):** The JSON output format of the CLI has changed. Those changes are analogous to the changes to the Python APIs. The `score` field is now at the top level, alongside `dl` and `output`, and is no longer nested within `dl` or `output`. The output also includes `is_text` and `extensions` fields. The `magic` metadata has been removed as it was often either incorrect or redundant; use `description` instead. Moreover, similarly to what happens under the hood with the `StatusOr` pattern, `result.status` indicates whether the prediction was successful, and the prediction results are available under the `result.value` key.

2. **CLI Output Format Change (v0.6.0):** The JSON output format of the CLI has changed. Those changes are analogous to the changes to the Python APIs. The `score` field is now at the top level, alongside `dl` and `output`, and is no longer nested within `dl` or `output`. The output also includes `is_text` and `extensions` fields. The `magic` metadata has been removed as it was often either incorrect or redundant; use `description` instead. Moreover, similarly to what happens under the hood with the `StatusOr` pattern, `result.status` indicates whether the prediction was successful, and the prediction results are available under the `result.value` key.

gemini-code-assist · 2025-11-12T15:54:13Z

python/scripts/test_magika_model.py

+            predicated_content_type = mr.prediction.output.label
            expected_content_type = test_path.parent.name
-            if prediceted_content_type != expected_content_type:
+            if predicated_content_type != expected_content_type:
                with_error = True
                log_error(
-                    f'{test_path} predicted as "{prediceted_content_type}" (score: {mr.prediction.score:.4f}), expected "{expected_content_type}".'
+                    f'{test_path} predicted as "{predicated_content_type}" (score: {mr.prediction.score:.4f}), expected "{expected_content_type}".'
                )


This fixes one typo (prediceted), but introduces another. The correct spelling is predicted, not predicated. 'Predicated' means 'based on', while 'predicted' refers to a future outcome, which is what the model does.

Suggested change

predicated_content_type = mr.prediction.output.label

expected_content_type = test_path.parent.name

if prediceted_content_type != expected_content_type:

if predicated_content_type != expected_content_type:

with_error = True

log_error(

f'{test_path} predicted as "{prediceted_content_type}" (score: {mr.prediction.score:.4f}), expected "{expected_content_type}".'

f'{test_path} predicted as "{predicated_content_type}" (score: {mr.prediction.score:.4f}), expected "{expected_content_type}".'

)

predicted_content_type = mr.prediction.output.label

expected_content_type = test_path.parent.name

if predicted_content_type != expected_content_type:

with_error = True

log_error(

f'{test_path} predicted as "{predicted_content_type}" (score: {mr.prediction.score:.4f}), expected "{expected_content_type}".'

)

reyammer · 2025-11-13T20:37:09Z

Thanks! This touches many files, and I'll first need to update the versioning of the packages as well. I'll get to it.

BTW, is the git diff coming from just using those tools above automatically, or you used the tools and then you manually checked what were reasonable findings? I'm wondering whether we could add these commands as part of our CI...

kianmeng · 2025-11-17T13:59:01Z

Thanks! This touches many files, and I'll first need to update the versioning of the packages as well. I'll get to it.

Noted.

BTW, is the git diff coming from just using those tools above automatically, or you used the tools and then you manually checked what were reasonable findings? I'm wondering whether we could add these commands as part of our CI...

Manually check one by one, as suggested by the tools. Corrections were applied based on my understanding of the context or intended usage of the doc/code.

Fix typos

43ff590

Found via `codespe -S *.json,*.lock,tests_data -L hve,afterall` and `typos --hidden --format brief`

kianmeng requested review from invernizzi and reyammer as code owners November 12, 2025 15:53

gemini-code-assist bot reviewed Nov 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix typos #1240

Fix typos #1240

Uh oh!

kianmeng commented Nov 12, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 12, 2025

Uh oh!

gemini-code-assist bot Nov 12, 2025

Uh oh!

reyammer commented Nov 13, 2025

Uh oh!

kianmeng commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix typos #1240

Are you sure you want to change the base?

Fix typos #1240

Uh oh!

Conversation

kianmeng commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

reyammer commented Nov 13, 2025

Uh oh!

kianmeng commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kianmeng commented Nov 12, 2025 •

edited

Loading