HyperAI
HyperAI
Main
Home
GPU
Console
Docs
Pricing
Pulse
News
Resources
Papers
Notebooks
Datasets
Wiki
Benchmarks
SOTA
LLM Models
GPU Leaderboard
Community
Events
Utility
Search
About
Terms of Service
Privacy Policy
English
HyperAI
HyperAI
Toggle Sidebar
⌘
K
Command Palette
Search for a command to run...
Sign In
HyperAI
Papers
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
8 months ago
Benchmarks
Preference Modeling
Reasoning
Summary
Paper
Benchmarks
Resources
opengvlab/multi-modality-arena
561
pytorch
lm-sys/routellm
4.8k
pytorch
formulamonks/llm-benchmarker-suite
49
pytorch
ojiyumm/mt_bench_rwkv
0
pytorch
lm-sys/fastchat
39.5k
Official
pytorch
ilyagusev/ping_pong_bench
117
theoremone/llm-benchmarker-suite
49
pytorch
PAIR-code/llm-comparator
526
tf
kuk/rulm-sbs2
61
dongping-chen/mllm-as-a-judge
92
pytorch
bjoernpl/fasteval
1
HyperAI
HyperAI
Main
Home
GPU
Console
Docs
Pricing
Pulse
News
Resources
Papers
Notebooks
Datasets
Wiki
Benchmarks
SOTA
LLM Models
GPU Leaderboard
Community
Events
Utility
Search
About
Terms of Service
Privacy Policy
English
HyperAI
HyperAI
Toggle Sidebar
⌘
K
Command Palette
Search for a command to run...
Sign In
HyperAI
Papers
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
8 months ago
Benchmarks
Preference Modeling
Reasoning
Summary
Paper
Benchmarks
Resources
opengvlab/multi-modality-arena
561
pytorch
lm-sys/routellm
4.8k
pytorch
formulamonks/llm-benchmarker-suite
49
pytorch
ojiyumm/mt_bench_rwkv
0
pytorch
lm-sys/fastchat
39.5k
Official
pytorch
ilyagusev/ping_pong_bench
117
theoremone/llm-benchmarker-suite
49
pytorch
PAIR-code/llm-comparator
526
tf
kuk/rulm-sbs2
61
dongping-chen/mllm-as-a-judge
92
pytorch
bjoernpl/fasteval
1