Polish Language Benchmarks
Evaluates linguistic and cultural understanding in Polish
Vision Language Model benchmark for Polish cultural understanding
A 15-taxonomy reasoning model benchmark for text tasks of varying difficulty
International Benchmarks (Bielik Evaluated)
Regional knowledge benchmark for European Languages
Reading comprehension benchmark for European languages
Multilingual translation benchmark for European languages
Multi-language European language model evaluation
European multilingual model evaluation platform
Original comprehensive LLM evaluation leaderboard
Updated version of the Open LLM Leaderboard
Mixed evaluation benchmark for language models
Evaluates function calling capabilities of LLMs
Large-scale multilingual translation evaluation
Czech language model benchmark suite
Portuguese language model evaluation platform