Skip to content

Pull requests: EleutherAI/lm-evaluation-harness

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

fix storycloze datanames
#2409 opened Oct 19, 2024 by t1101675 Loading…
Fix Type Hints for vLLM CausalLM model
#2408 opened Oct 18, 2024 by qthequartermasterman Loading…
Support for IBM watsonx_llm
#2397 opened Oct 11, 2024 by Medokins Loading…
Update citation links to Zenodo and DOI to 0.4.5
#2391 opened Oct 9, 2024 by LSinev Loading…
add Russian mmlu
#2378 opened Oct 3, 2024 by tatiana-iazykova Loading…
Add the BlueBench benchmark
#2369 opened Oct 1, 2024 by shachardon Loading…
Remove unnecessary space prefix
#2368 opened Oct 1, 2024 by eldarkurtic Loading…
MMLU Pro Plus
#2366 opened Sep 30, 2024 by asgsaeid Loading…
fix cost_estimate script
#2359 opened Sep 26, 2024 by baberabb Draft
Add metabench task to LM Evaluation Harness
#2357 opened Sep 26, 2024 by kozzy97 Loading…
Support pipeline parallel with OpenVINO models
#2349 opened Sep 25, 2024 by sstrehlk Loading…
Mathvista
#2321 opened Sep 18, 2024 by baberabb Draft
mmlu translated professionally by OpenAI
#2312 opened Sep 17, 2024 by giuliolovisotto Loading…
Scrolls branch
#2309 opened Sep 16, 2024 by blitzionic Loading…
add new truncation strategy
#2300 opened Sep 15, 2024 by artemorloff Draft
Gen Prefix
#2274 opened Sep 2, 2024 by baberabb Loading…
Nvidia TensorRT-LLM
#2271 opened Sep 1, 2024 by abhishekvijeev Draft
Add Yue-Benchmark and update tasks description
#2270 opened Aug 31, 2024 by cpa2001 Loading…
Ifeval: Dowload punkt_tab on rank 0
#2267 opened Aug 30, 2024 by baberabb Loading…
[Draft] llm-as-judge
#2251 opened Aug 25, 2024 by baberabb Draft
Minor features
#2249 opened Aug 25, 2024 by artemorloff Loading…
Add MBPP
#2247 opened Aug 23, 2024 by hjlee1371 Loading…
Add GPTQModel support for inferencing GPTQ models
#2217 opened Aug 16, 2024 by Qubitium Loading…
ProTip! Filter pull requests by the default branch with base:main.