Resources - LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale | Papers | HyperAI

HyperAI

Main

GPU

Console
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

8 months ago

Method/Architecture

Summary Paper Benchmarks Resources

timdettmers/bitsandbytes

Official

pytorch

kohjingyu/fromage

pytorch

huggingface/transformers-bloom-inference

pytorch

alextmallen/adaptive-retrieval

pytorch

Build the Future of Artificial Intelligence

About

About Us Dataset Help

Products

News Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

HyperAI

Main

GPU

Console
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

8 months ago

Method/Architecture

Summary Paper Benchmarks Resources

timdettmers/bitsandbytes

Official

pytorch

kohjingyu/fromage

pytorch

huggingface/transformers-bloom-inference

pytorch

alextmallen/adaptive-retrieval

pytorch

Build the Future of Artificial Intelligence

About

About Us Dataset Help

Products

News Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

7.9k

7.9k

566

566

486

486

187

187