Explain what resampling methods are and why they are useful

Data Science Interview QuestionsCategory: Data ScienceExplain what resampling methods are and why they are useful
1 Answers
MockInterview Staff answered 4 years ago
  • repeatedly drawing samples from a training set and refitting a model of interest on each sample in order to obtain additional information about the fitted model
  • example: repeatedly draw different samples from training data, fit a linear regression to each new sample, and then examine the extent to which the resulting fit differ
  • most common are: cross-validation and the bootstrap
  • cross-validation: random sampling with no replacement
  • bootstrap: random sampling with replacement
  • cross-validation: evaluating model performance, model selection (select the appropriate level of flexibility)
  • bootstrap: mostly used to quantify the uncertainty associated with a given estimator or statistical learning method

SOURCE

Your Answer

20 + 20 =