HyperAIHyperAI

Command Palette

Search for a command to run...

Tongyi Qianwen 72B Chat Int4 Model Gradio Demo

Qwen-72B-Chat-Int4 demo

Model Introduction

Tongyi Qianwen-72B (Qwen-72B) is a 72 billion parameter model of the Tongyi Qianwen large model series developed by Alibaba Cloud. Qwen-72B is a large language model based on Transformer, trained on ultra-large-scale pre-training data. The pre-training data types are diverse and cover a wide range, including a large number of online texts, professional books, codes, etc. At the same time, based on Qwen-72B, the research team used the alignment mechanism to create Qwen-72B-Chat, an AI assistant based on a large language model. This repository is the repository of the Int4 quantization model of Qwen-72B-Chat. 1

One-click deployment

This tutorial is about running the Int4 quantized model of Tongyi Qianwen 72B Chat on OpenBayes.

How to run

  1. After the cloned container starts, open a new terminal page 2
  2. Enter the command python web_ui.py to run the Gradio demo 3
  3. Follow the prompts to open the link 4
  4. You can start talking to the model 5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Tongyi Qianwen 72B Chat Int4 Model Gradio Demo | Tutorials | HyperAI