The similarities are way too excellent to ignore. They most likely experienced the product on a synthetic dataset created by GPT-4o. Notice: +MC represents the addition of twenty million Chinese many-alternative issues gathered through the Internet. It can be crucial to note that we conducted deduplication to the C-Eval validation https://x.com/kidtsang/status/1884008035535782292