Ebisu: Benchmarking Large Language Models in Japanese Finance Paper โข 2602.01479 โข Published 15 days ago โข 17
MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation Paper โข 2506.14028 โข Published Jun 16, 2025 โข 93
Plutus: Benchmarking Large Language Models in Low-Resource Greek Finance Paper โข 2502.18772 โข Published Feb 26, 2025 โข 32
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance Paper โข 2502.08127 โข Published Feb 12, 2025 โข 59
Running on CPU Upgrade 13.9k Open LLM Leaderboard ๐ 13.9k Track, rank and evaluate open LLMs and chatbots