DualSchool: Robust Quantitative Evaluation of Leading LLMs on Simple OR Tasks?

Under Review, 2025

Klamkin, M., Deza, A., Cheng, S., Zhao, H., & Van Hentenryck, P. (2025). "DualSchool: Robust Quantitative Evaluation of Leading LLMs on Simple OR Tasks?" (under review). arXiv:2505.21775.

Download Paper