HyperAI
HyperAI
Main
Home
GPU
Console
Docs
Pricing
Pulse
News
Resources
Papers
Notebooks
Datasets
Wiki
Benchmarks
SOTA
LLM Models
GPU Leaderboard
Community
Events
Utility
Search
About
Terms of Service
Privacy Policy
English
HyperAI
HyperAI
Toggle Sidebar
⌘
K
Command Palette
Search for a command to run...
Sign In
HyperAI
Papers
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale
8 months ago
LLM
Transformer
AI Compiler
AI Infra
Method/Architecture
Summary
Paper
Benchmarks
Resources
timdettmers/bitsandbytes
Official
pytorch
kohjingyu/fromage
pytorch
huggingface/transformers-bloom-inference
pytorch
alextmallen/adaptive-retrieval
pytorch
HyperAI
HyperAI
Main
Home
GPU
Console
Docs
Pricing
Pulse
News
Resources
Papers
Notebooks
Datasets
Wiki
Benchmarks
SOTA
LLM Models
GPU Leaderboard
Community
Events
Utility
Search
About
Terms of Service
Privacy Policy
English
HyperAI
HyperAI
Toggle Sidebar
⌘
K
Command Palette
Search for a command to run...
Sign In
HyperAI
Papers
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale
8 months ago
LLM
Transformer
AI Compiler
AI Infra
Method/Architecture
Summary
Paper
Benchmarks
Resources
timdettmers/bitsandbytes
Official
pytorch
kohjingyu/fromage
pytorch
huggingface/transformers-bloom-inference
pytorch
alextmallen/adaptive-retrieval
pytorch
7.9k
7.9k
566
566
486
486
187
187