HyperAIHyperAI

Command Palette

Search for a command to run...

Bootstrap Sampling / Repeatable Sampling / Sampling With Replacement

Date

6 years ago

For a sample, the probability of being collected in a random sampling of a training set containing m samples is 1m, and the probability of not being collected is 1−1m.

If the probability that no data is collected after m samplings is (1−1m)m, then when m→∞, (1−1m)m→1/e≃0.368, that is, in each round of random sampling, approximately 36.8% of data in the training set is not collected in the sampling set.

After replacement sampling, the data set will have some data duplication and some data missing. K samples are sampled from N samples, and the expectation of different sample numbers is U(K)=N(1−(N−1N)K).

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Bootstrap Sampling / Repeatable Sampling / Sampling With Replacement | Wiki | HyperAI