Benchmark accuracy of Klear-Reasoner-8B on AIME 2024/2025 (avg@64), LiveCodeBench V5 (2024/08/01-2025/02/01, avg@8), and v6 (2025/02/01-2025/05/01, avg@8).
⭐ If SSDiff is helpful to your paper or project, please consider star this repo or cite our paper. Thanks! 🤗 2025.12.07: Codes are relased. 2025.12.03: Checkpoints and scripts are relased. 2025.12.02 ...