Researchclopedia
Research
Researchers
Institutions
Topics
Submit
About
Search...
⌘
K
Command Palette
Search for a command to run...
Back to research
Agent-Diff: Benchmarking LLM Agents on Enterprise API Tasks via Code Execution with State-Diff-Based Evaluation
2026
0 citations
Journal Article
green Open Access
Field-Weighted Citation Impact:
0.00
Agent-Diff: Benchmarking LLM Agents on Enterprise API Tasks via Code Execution with State-Diff-Based Evaluation | Researchclopedia