BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper β’ 2510.08697 β’ Published Oct 9 β’ 36