Date

6 years ago

For a sample, the probability of being collected in a random sampling of a training set containing m samples is 1m, and the probability of not being collected is 1−1m.

If the probability that no data is collected after m samplings is (1−1m)m, then when m→∞, (1−1m)m→1/e≃0.368, that is, in each round of random sampling, approximately 36.8% of data in the training set is not collected in the sampling set.

After replacement sampling, the data set will have some data duplication and some data missing. K samples are sampled from N samples, and the expectation of different sample numbers is U(K)=N(1−(N−1N)K).

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Date

6 years ago

For a sample, the probability of being collected in a random sampling of a training set containing m samples is 1m, and the probability of not being collected is 1−1m.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Bootstrap Sampling / Repeatable Sampling / Sampling With Replacement

Build AI with AI

HyperAI Newsletters

Command Palette

Bootstrap Sampling / Repeatable Sampling / Sampling With Replacement

Build AI with AI

HyperAI Newsletters

Command Palette

Bootstrap Sampling / Repeatable Sampling / Sampling With Replacement

Build AI with AI

HyperAI Newsletters