-
Notifications
You must be signed in to change notification settings - Fork 461
Pull requests: ml-explore/mlx-lm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Improve OOM handling and add memory controls for long-running coding agent sessions
#948
opened Mar 4, 2026 by
dmitryryabkov
Loading…
Support KV cache quantization with continuous batching
#941
opened Mar 2, 2026 by
ochafik
Loading…
3 tasks done
feat: add --kv-bits CLI args for server KV cache quantization
#934
opened Feb 27, 2026 by
lichengzhe
Loading…
fix(types): improve type hints for
mlx_lm.utils.load method
#919
opened Feb 22, 2026 by
rahuliyer95
Loading…
fix: handle tool call parse errors gracefully in server
#899
opened Feb 16, 2026 by
shanemmattner
Loading…
Fix dynamic_quant for MoE and VL models
#870
opened Feb 10, 2026 by
Taderich73
Loading…
3 tasks done
refactor: use time.perf_counter() for duration measurements
#848
opened Feb 6, 2026 by
m92y
Loading…
feat: enhance chat CLI with readline history, line editing, and distributed support
#841
opened Feb 3, 2026 by
Vlor999
Loading…
moved more activation functions to activations module
#774
opened Jan 19, 2026 by
Goekdeniz-Guelmez
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-02-07.