HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Mistral 7B

Albert Q. Jiang; Alexandre Sablayrolles; Arthur Mensch; Chris Bamford; Devendra Singh Chaplot; Diego de las Casas; Florian Bressand; Gianna Lengyel; Guillaume Lample; Lucile Saulnier; Lélio Renard Lavaud; Marie-Anne Lachaux; Pierre Stock; Teven Le Scao; Thibaut Lavril; Thomas Wang; Timothée Lacroix; William El Sayed

Mistral 7B

Abstract

We introduce Mistral 7B v0.1, a 7-billion-parameter language model engineered for superior performance and efficiency. Mistral 7B outperforms Llama 2 13B across all evaluated benchmarks, and Llama 1 34B in reasoning, mathematics, and code generation. Our model leverages grouped-query attention (GQA) for faster inference, coupled with sliding window attention (SWA) to effectively handle sequences of arbitrary length with a reduced inference cost. We also provide a model fine-tuned to follow instructions, Mistral 7B -- Instruct, that surpasses the Llama 2 13B -- Chat model both on human and automated benchmarks. Our models are released under the Apache 2.0 license.

Code Repositories

mgmalek/efficient_cross_entropy
pytorch
Mentioned in GitHub
mistralai/mistral-src
Official
pytorch
facebookresearch/fairseq2
pytorch
Mentioned in GitHub
ninglab/ecellm
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
answerability-prediction-on-peerqaMistral-IT-v02-7B-32k
Macro F1: 0.4703
arithmetic-reasoning-on-gsm8kMistral 7B (maj@8)
Accuracy: 52.2
Parameters (Billion): 7
code-generation-on-mbppMistral 7B (3-shot)
Accuracy: 47.5
common-sense-reasoning-on-arc-challengeMistral 7B (0-shot)
Accuracy: 55.5
common-sense-reasoning-on-arc-easyMistral 7B (0-shot)
Accuracy: 80.0
common-sense-reasoning-on-winograndeMistral 7B (0-shot)
Accuracy: 75.3
math-word-problem-solving-on-mathMistral 7B (maj@4)
Accuracy: 13.1
Parameters (Billions): 7
multi-task-language-understanding-on-mmluMistral 7B (5-shot)
Average (%): 60.1
question-answering-on-natural-questionsMistral 7B (5-shot)
EM: 28.8
question-answering-on-peerqaMistral-v02-7B-32k
AlignScore: 0.0827
Prometheus-2 Answer Correctness: 3.4245
Rouge-L: 0.1922
question-answering-on-piqaMistral 7B (0-shot)
Accuracy: 83.0
question-answering-on-triviaqaMistral 7B (5-shot)
EM: 69.9
zero-shot-video-question-answer-on-intentqaMistral (7B)
Accuracy: 50.4
zero-shot-video-question-answer-on-next-gqaMistral (7B)
Acc@GQA: 9.2
zero-shot-video-question-answer-on-next-qaMistral (7B)
Accuracy: 51.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp