DualSchool: Robust Quantitative Evaluation of Leading LLMs on Simple OR Tasks?
Under Review, 2025
Klamkin, M., Deza, A., Cheng, S., Zhao, H., & Van Hentenryck, P. (2025). "DualSchool: Robust Quantitative Evaluation of Leading LLMs on Simple OR Tasks?" (under review). arXiv:2505.21775.
