Add support for `ministral3` and `mistral3` model types by sealad886 · Pull Request #860 · ml-explore/mlx-lm

sealad886 · 2026-02-07T22:45:30Z

This pull request adds support for the new ministral3 and mistral3 model types across the codebase, ensuring they are properly recognized, handled in quantization, and thoroughly tested. The changes also improve consistency in model type remapping and GGUF export logic.

Model support and integration:

Added ministral3 and mistral3 model types to the AWQ quantization configuration, ensuring these models are supported during quantization and mapped to the correct configuration (mlx_lm/quant/awq.py).
Updated the GGUF export logic to use MODEL_REMAPPING for consistent model type handling and included ministral3 and mistral3 as supported types (mlx_lm/fuse.py). [1] [2]

Testing:

Added comprehensive unit tests for both ministral3 and mistral3 models, including model instantiation and prompt cache construction, to ensure correct behavior and compatibility (tests/test_models.py).

Co-authored-by: sealad886 <155285242+sealad886@users.noreply.github.com>

…prove tests, fix GGUF export Co-authored-by: sealad886 <155285242+sealad886@users.noreply.github.com>

…tization and GGUF export

Copilot

Pull request overview

This PR adds recognition and handling for the ministral3 and mistral3 model types across quantization, GGUF export gating, and unit tests so these model variants can be instantiated and processed consistently within mlx_lm.

Changes:

Added ministral3/mistral3 entries to AWQ model configuration mapping.
Updated GGUF export gating to apply MODEL_REMAPPING and allow ministral3.
Added unit tests for ministral3 and mistral3 model construction and prompt-cache creation.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File	Description
`tests/test_models.py`	Adds coverage for `ministral3` and `mistral3` instantiation plus prompt-cache construction.
`mlx_lm/quant/awq.py`	Extends AWQ configuration mapping to support the new model types (including `language_model` nesting for `mistral3`).
`mlx_lm/fuse.py`	Uses `MODEL_REMAPPING` when checking GGUF export support and expands the allowlist.

Comments suppressed due to low confidence (1)

mlx_lm/fuse.py:103

convert_to_gguf() derives RoPE metadata from top-level config["rope_theta"]/config["rope_scaling"], but ministral3 uses rope_parameters (see mlx_lm/models/ministral3.py) with rope_theta nested under that dict. As-is, GGUF export for ministral3 will ignore non-default rope_parameters["rope_theta"] and any associated scaling params, producing incorrect metadata for some checkpoints. Consider normalizing the config before calling convert_to_gguf() (e.g., copy rope_parameters["rope_theta"] into rope_theta, and map any scaling fields as needed).

        # Apply MODEL_REMAPPING to match load() behavior
        remapped_type = MODEL_REMAPPING.get(model_type, model_type)
        if remapped_type not in ["llama", "mixtral", "mistral", "ministral3"]:
            raise ValueError(
                f"Model type {model_type} not supported for GGUF conversion."
            )
        weights = dict(tree_flatten(model.parameters()))
        convert_to_gguf(save_path, weights, config, str(save_path / args.gguf_path))

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-07T22:49:38Z

mlx_lm/fuse.py

        model_type = config["model_type"]
-        if model_type not in ["llama", "mixtral", "mistral"]:
+        # Apply MODEL_REMAPPING to match load() behavior
+        remapped_type = MODEL_REMAPPING.get(model_type, model_type)
+        if remapped_type not in ["llama", "mixtral", "mistral", "ministral3"]:
            raise ValueError(
                f"Model type {model_type} not supported for GGUF conversion."


The PR description says GGUF export supports both ministral3 and mistral3, but this allowlist only adds ministral3. If mistral3 is intended to be supported, GGUF conversion likely also needs special handling because mistral3 configs are nested under text_config and parameter names are prefixed with language_model., which convert_to_gguf()/translate_weight_names() won’t match. Either add the missing mistral3 support end-to-end (flatten config + strip/translate weight prefixes) or update the PR description to reflect that only ministral3 is supported for GGUF export.

Copilot · 2026-02-07T22:49:38Z

tests/test_models.py

+        self.model_test_runner(
+            model, args.model_type, text_config["vocab_size"], text_config["num_hidden_layers"]
+        )


This model_test_runner(...) call exceeds Black’s default line length and will be reformatted by the pre-commit hook (which runs in CI). Please run pre-commit/Black so the arguments are wrapped consistently with the rest of the file.

Copilot AI and others added 7 commits December 3, 2025 14:28

Initial plan

554aacc

Add tests for ministral3 and mistral3 models

c91f416

Co-authored-by: sealad886 <155285242+sealad886@users.noreply.github.com>

Fix formatting in test_models.py

39f4fd1

Co-authored-by: sealad886 <155285242+sealad886@users.noreply.github.com>

Add ministral3/mistral3 support to AWQ quantization and GGUF export

5450aeb

Co-authored-by: sealad886 <155285242+sealad886@users.noreply.github.com>

Address PR review comments: add llava AWQ support, fix formatting, im…

68dafa4

…prove tests, fix GGUF export Co-authored-by: sealad886 <155285242+sealad886@users.noreply.github.com>

feat: Add comprehensive support for ministral-3 models including quan…

371ad11

…tization and GGUF export

Merge branch 'ml-explore:main' into main

86f8756

Copilot AI review requested due to automatic review settings February 7, 2026 22:45

Copilot started reviewing on behalf of sealad886 February 7, 2026 22:46 View session

Copilot AI reviewed Feb 7, 2026

View reviewed changes

Merge branch 'ml-explore:main' into main

bfdd490

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for `ministral3` and `mistral3` model types#860

Add support for `ministral3` and `mistral3` model types#860
sealad886 wants to merge 8 commits intoml-explore:mainfrom
sealad886:main

sealad886 commented Feb 7, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 7, 2026

Uh oh!

Copilot AI Feb 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sealad886 commented Feb 7, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 7, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants