-
Notifications
You must be signed in to change notification settings - Fork 623
Pull requests: ml-explore/mlx-lm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(server): opt-in disk-backed L2 prompt cache (--prompt-cache-disk-dir)
#1218
opened Apr 27, 2026 by
freddyhaddad
Loading…
Add Metal VJP kernel for gated_delta_update (trainable Qwen3.5 / Qwen3-Next LoRA on Apple Silicon)
#1217
opened Apr 27, 2026 by
SudarkinV
Loading…
fix(utils): skip already-quantized layers in load_model._quantize predicate
#1216
opened Apr 27, 2026 by
adurham
Contributor
Loading…
Skip quantizing Gemma 4 per_layer_model_projection for Swift compatibility
#1209
opened Apr 27, 2026 by
kr1s0404
Loading…
3 tasks
feat(server): add OpenAI Responses API endpoint (/v1/responses)
#1207
opened Apr 27, 2026 by
cassiolpaixao90
Loading…
5 tasks
fix(gemma4): drop KV-shared layer projections in sanitize
#1205
opened Apr 26, 2026 by
Fox13
Loading…
minimax: validate head_dim against checkpoint, drop unused shared_intermediate_size
#1204
opened Apr 26, 2026 by
adurham
Contributor
Loading…
Add TurboQuantKVCache: 3-bit/4-bit KV cache compression for generation
#1202
opened Apr 26, 2026 by
dedalien
Loading…
Add DeepSeek-V4 (Flash) model support
#1201
opened Apr 26, 2026 by
akashgoswami
•
Draft
3 of 6 tasks
fix: prevent double-shift of norm weights for converted VLM checkpoints
#1198
opened Apr 25, 2026 by
Thump604
Loading…
feat: add thinking budget with early-stopping prompt injection
#1196
opened Apr 25, 2026 by
Thump604
Loading…
feat: add DeepSeek-V4 (Pro/Flash) model support
#1189
opened Apr 24, 2026 by
machiabeli
Loading…
5 of 7 tasks
Include context_length in /v1/models response (#1183)
#1184
opened Apr 23, 2026 by
seikixtc
Loading…
Auto-discover tool-call markers from tokenizer config fields
#1163
opened Apr 18, 2026 by
michaelstingl
Loading…
6 tasks done
feat(nemotron_h): add Multi-Token Prediction (MTP) module
#1161
opened Apr 16, 2026 by
Thump604
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-04-24.