Commit fd5694d
authored
Consolidate lm-eval scripts: merge AnyModel auto-detection into lm_eval_hf.py (#1206)
## Summary
- Merge `examples/puzzletron/evaluation/lm_eval_anymodel.py` into the
existing
`examples/llm_eval/lm_eval_hf.py` so there is a single evaluation entry
point
for both standard HF and AnyModel/Puzzletron checkpoints.
- AnyModel support is auto-detected at load time via
`resolve_descriptor_from_pretrained`;
the puzzletron extra is optional
## Notes
AnyModel auto-detection uses `resolve_descriptor_from_pretrained`, which
currently
relies on a hardcoded `_MODEL_TYPE_TO_DESCRIPTOR` dict that must be kept
in sync
manually with descriptor registrations. This should be addressed in the
future.
<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit
* **New Features**
* Automated detection and correct loading of Puzzletron heterogeneous
pruned checkpoints via the main evaluation entrypoint.
* **Documentation**
* Added a “Heterogeneous Pruned Checkpoints (Puzzletron)” subsection
with install notes, example evaluation commands, and smoke-test
guidance.
* **Chores**
* Removed the separate Puzzletron evaluation script and consolidated
evaluation into the primary lm-eval workflow.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
---------
Signed-off-by: jrausch <jrausch@nvidia.com>1 parent 25266b8 commit fd5694d
5 files changed
Lines changed: 65 additions & 119 deletions
File tree
- examples
- llm_eval
- puzzletron
- evaluation
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
96 | 96 | | |
97 | 97 | | |
98 | 98 | | |
99 | | - | |
100 | 99 | | |
101 | 100 | | |
102 | 101 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
40 | 40 | | |
41 | 41 | | |
42 | 42 | | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
43 | 59 | | |
44 | 60 | | |
45 | 61 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
36 | 36 | | |
37 | 37 | | |
38 | 38 | | |
| 39 | + | |
39 | 40 | | |
40 | 41 | | |
41 | 42 | | |
| |||
50 | 51 | | |
51 | 52 | | |
52 | 53 | | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
53 | 78 | | |
54 | 79 | | |
55 | | - | |
| 80 | + | |
56 | 81 | | |
57 | 82 | | |
58 | 83 | | |
| |||
72 | 97 | | |
73 | 98 | | |
74 | 99 | | |
75 | | - | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
76 | 104 | | |
77 | 105 | | |
78 | 106 | | |
| |||
109 | 137 | | |
110 | 138 | | |
111 | 139 | | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
112 | 156 | | |
| 157 | + | |
113 | 158 | | |
114 | 159 | | |
115 | 160 | | |
| 161 | + | |
116 | 162 | | |
117 | 163 | | |
118 | 164 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
235 | 235 | | |
236 | 236 | | |
237 | 237 | | |
238 | | - | |
| 238 | + | |
239 | 239 | | |
240 | 240 | | |
241 | 241 | | |
| |||
This file was deleted.
0 commit comments