Researchclopedia
Research
Researchers
Institutions
Topics
Submit
About
Search...
⌘
K
Command Palette
Search for a command to run...
Back to research
Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory
2026
0 citations
Journal Article
diamond Open Access
Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory | Researchclopedia
Harbin Institute of Technology
Wei Bao
·
China Electronics Standardization Institute
Jian Dong
·
China Electronics Standardization Institute
Bing Xu
·
Harbin Institute of Technology
Conghui Zhu
·
Harbin Institute of Technology
Han Cao
·
Harbin Institute of Technology
Tiejun Zhao
·
Harbin Institute of Technology