HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Claude 3.5 Sonnet Model Card Addendum

{Anthropic}

Claude 3.5 Sonnet Model Card Addendum

Abstract

This addendum to our Claude 3 Model Card describes Claude 3.5 Sonnet, a new model which outperformsour previous most capable model, Claude 3 Opus, while operating faster and at a lower cost. Claude 3.5Sonnet offers improved capabilities, including better coding and visual processing. Since it is an evolution ofthe Claude 3 model family, we are providing an addendum rather than a new model card. We provide updatedkey evaluations and results from our safety testing.

Benchmarks

BenchmarkMethodologyMetrics
code-generation-on-humanevalGPT-4o (0-shot)
Pass@1: 90.2
mmr-total-on-mrr-benchmarkClaude 3.5 Sonnet
Total Column Score: 463
multi-task-language-understanding-on-mmluClaude 3.5 Sonnet (5-shot)
Average (%): 88.7
question-answering-on-newsqaAnthropic/claude-3-5-sonnet
EM: 74.23
F1: 82.3
visual-question-answering-on-mm-vetClaude 3.5 Sonnet (claude-3-5-sonnet-20240620)
GPT-4 score: 74.2±0.2
visual-question-answering-on-mm-vet-v2Claude 3.5 Sonnet (claude-3-5-sonnet-20240620)
GPT-4 score: 71.8±0.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Claude 3.5 Sonnet Model Card Addendum | Papers | HyperAI