Add EngGPT MoE model support by robertobissanti · Pull Request #1199 · ml-explore/mlx-lm

robertobissanti · 2026-04-25T18:32:46Z

Summary

This PR adds initial support for the enggpt_moe architecture used by engineering-group/EngGPT2-16B-A3B.

The model is a decoder-only MoE language model with:

The implementation is based on the existing Mixtral/SwitchGLU infrastructure, adapted to match the EngGPT MoE checkpoint structure and routing logic.

The EngGPT2-16B-A3B model cannot currently be loaded or converted with mlx-lm because its config.json declares:

"model_type": "enggpt_moe"

Add EngGPT MoE model support

f943de3