HyperAIHyperAI

Command Palette

Search for a command to run...

MSR-VTT Video Caption Dataset

Date

3 years ago

Size

8.08 GB

Organization

Microsoft

License

Other

Featured Image

MSR-VTT, the full name of Microsoft Research Video to Text, is a large-scale video captioning dataset for open domains.

The dataset includes 10,000 video clips from 20 categories, each with 20 English sentences annotated by Amazon Mechanical Turks. There are about 29,000 different words in all the captions. The standard split uses 6,513 clips for training, 497 clips for validation, and 2,990 clips for testing.

MSR-VTT.torrent
Seeding 2Downloading 0Completed 932Total Downloads 2,055
  • MSR-VTT/
    • README.md
      1.22 KB
    • README.txt
      2.44 KB
      • data/
        • test-video_ustc.zip
          1.97 GB
        • test_videodatainfo.json
          1.98 GB
        • train-video.zip
          8.07 GB
        • train_val_videodatainfo.json
          8.08 GB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
MSR-VTT Video Caption Dataset | Datasets | HyperAI