Вопрос по data preprocessing. Цитирую: An important point to make about the

Question

Вопрос по data preprocessing. Цитирую: An important point to make about the

preprocessing is that any preprocessing statistics (e.g. the data mean) must only be computed on the training data, and then applied to the validation / test data. E.g. computing the mean and subtracting it from every image across the entire dataset and then splitting the data into train/val/test splits would be a mistake. Instead, the mean must be computed only over the training data and then subtracted equally from all splits (train/val/test).

По каким причинам рекомендуется считать среднее по всему train сету, а потом это среднее применять к валидации и тесту, А НЕ считать среднее изначально на всем сете (не только на train)?

0

29.12.2017

1 ответов

14 просмотров

Evgenii Zheltonozhskii🇮🇱 · Accepted Answer

Evgenii Zheltonozhskii🇮🇱

чтобы не иметь никакой информации о валидации

0

29.12.2017

Похожие чаты

Вопрос по data preprocessing. Цитирую: An important point to make about the

1 ответов

Похожие вопросы