File tree
11 files changed
+507
-38
lines changed- recipes
- configs/llama4
- torchtune
- modules
- moe
- training
11 files changed
+507
-38
lines changedLines changed: 20 additions & 7 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
18 | 18 |
| |
19 | 19 |
| |
20 | 20 |
| |
21 |
| - | |
| 21 | + | |
22 | 22 |
| |
23 | 23 |
| |
24 | 24 |
| |
| |||
73 | 73 |
| |
74 | 74 |
| |
75 | 75 |
| |
76 |
| - | |
77 |
| - | |
78 |
| - | |
79 |
| - | |
80 |
| - | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
81 | 81 |
| |
82 | 82 |
| |
83 | 83 |
| |
| |||
93 | 93 |
| |
94 | 94 |
| |
95 | 95 |
| |
96 |
| - | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + |
Lines changed: 32 additions & 4 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
16 | 16 |
| |
17 | 17 |
| |
18 | 18 |
| |
| 19 | + | |
19 | 20 |
| |
20 | 21 |
| |
21 | 22 |
| |
| |||
147 | 148 |
| |
148 | 149 |
| |
149 | 150 |
| |
150 |
| - | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
151 | 155 |
| |
152 | 156 |
| |
153 | 157 |
| |
| |||
328 | 332 |
| |
329 | 333 |
| |
330 | 334 |
| |
| 335 | + | |
| 336 | + | |
331 | 337 |
| |
332 | 338 |
| |
333 | 339 |
| |
| |||
343 | 349 |
| |
344 | 350 |
| |
345 | 351 |
| |
346 |
| - | |
| 352 | + | |
347 | 353 |
| |
348 | 354 |
| |
349 | 355 |
| |
| |||
380 | 386 |
| |
381 | 387 |
| |
382 | 388 |
| |
| 389 | + | |
383 | 390 |
| |
384 | 391 |
| |
385 | 392 |
| |
| |||
413 | 420 |
| |
414 | 421 |
| |
415 | 422 |
| |
416 |
| - | |
| 423 | + | |
417 | 424 |
| |
418 | 425 |
| |
419 | 426 |
| |
| |||
586 | 593 |
| |
587 | 594 |
| |
588 | 595 |
| |
589 |
| - | |
| 596 | + | |
590 | 597 |
| |
591 | 598 |
| |
592 | 599 |
| |
| |||
1068 | 1075 |
| |
1069 | 1076 |
| |
1070 | 1077 |
| |
| 1078 | + | |
| 1079 | + | |
| 1080 | + | |
| 1081 | + | |
| 1082 | + | |
| 1083 | + | |
| 1084 | + | |
| 1085 | + | |
| 1086 | + | |
| 1087 | + | |
| 1088 | + | |
| 1089 | + | |
| 1090 | + | |
| 1091 | + | |
| 1092 | + | |
| 1093 | + | |
| 1094 | + | |
| 1095 | + | |
| 1096 | + | |
| 1097 | + | |
1071 | 1098 |
| |
1072 | 1099 |
| |
1073 | 1100 |
| |
| |||
1081 | 1108 |
| |
1082 | 1109 |
| |
1083 | 1110 |
| |
| 1111 | + | |
1084 | 1112 |
| |
1085 | 1113 |
| |
1086 | 1114 |
| |
|
Lines changed: 1 addition & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
47 | 47 |
| |
48 | 48 |
| |
49 | 49 |
| |
50 |
| - | |
| 50 | + | |
51 | 51 |
| |
52 | 52 |
| |
53 | 53 |
| |
|
Lines changed: 55 additions & 19 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
11 | 11 |
| |
12 | 12 |
| |
13 | 13 |
| |
| 14 | + | |
14 | 15 |
| |
15 | 16 |
| |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
16 | 21 |
| |
17 | 22 |
| |
18 | 23 |
| |
| |||
50 | 55 |
| |
51 | 56 |
| |
52 | 57 |
| |
| 58 | + | |
53 | 59 |
| |
54 | 60 |
| |
55 | 61 |
| |
| |||
64 | 70 |
| |
65 | 71 |
| |
66 | 72 |
| |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
67 | 95 |
| |
68 |
| - | |
69 |
| - | |
70 |
| - | |
71 |
| - | |
72 |
| - | |
73 |
| - | |
74 |
| - | |
75 |
| - | |
76 |
| - | |
77 |
| - | |
78 |
| - | |
79 |
| - | |
80 |
| - | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
81 | 105 |
| |
82 |
| - | |
83 |
| - | |
84 |
| - | |
85 |
| - | |
86 |
| - | |
87 |
| - | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
88 | 124 |
| |
89 | 125 |
| |
90 | 126 |
| |
|
0 commit comments