HyperAIHyperAI

Command Palette

Search for a command to run...

vLLM+Open WebUI Deployment FairyR1-14B-Preview

1. Tutorial Introduction

FairyR1-14B-Preview is a lightweight, high-performance model released in May 2025 by Professor Yang Tong's team at the School of Computer Science at Peking University, focusing on math and code tasks. The model is based on the DeepSeek-R1-Distill-Qwen-32B base and is built by combining fine-tuning and model merging techniques. The study explored the possibility of achieving comparable or even better performance on specific tasks than larger models with a significant reduction in the number of parameters. The research was funded by the National Natural Science Foundation of China (62372009).

This tutorial uses a single RTX A6000 card as the resource.

2. Project Examples

3. Operation steps

1. After starting the container, click the API address to enter the Web interface

2. After entering the webpage, you can start a conversation with the model

If "Model" is not displayed, it means the model is being initialized. Since the model is large, please wait about 2-3 minutes and refresh the page.

How to use

4. Discussion

🖌️ If you see a high-quality project, please leave a message in the background to recommend it! In addition, we have also established a tutorial exchange group. Welcome friends to scan the QR code and remark [SD Tutorial] to join the group to discuss various technical issues and share application effects↓

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
vLLM+Open WebUI Deployment FairyR1-14B-Preview | Tutorials | HyperAI