Command Palette
Search for a command to run...
ShareGPT 90k Chinese and English Bilingual human-machine Question Answering Dataset
ShareGPT-Chinese-English-90k is a high-quality human-machine question-answering dataset in parallel in Chinese and English, covering user questions in real and complex scenarios. It can be used to train high-quality dialogue models (which are more robust in instruction distribution than those generated by repeatedly calling API interfaces to simulate machine questions and answers).
The characteristics of this dataset are:
- At the same time, it provides Chinese and English parallel comparison corpora with exactly the same meaning, which can be used for bilingual dialogue model training.
- All questions are not artificially imagined or fake data created by API polling (such as Moss), which is more in line with the command distribution and question expression of real user scenarios.
- The Sharegpt dataset is collected through spontaneous sharing by netizens, which is equivalent to a very natural filtering (through human sense), screening out most of the conversations with bad experiences.
ShareGPT-Chinese-English-90k.torrent
Seeding 1Downloading 0Completed 294Total Downloads 723
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.
AI Co-coding
Ready-to-use GPUs
Best Pricing
Hyper Newsletters
Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp