What is the curse of dimensionality and how should one deal with it when building machine-learning models?

Data Science Interview QuestionsCategory: Data ScienceWhat is the curse of dimensionality and how should one deal with it when building machine-learning models?
1 Answers
MockInterview Staff answered 6 years ago
  • refers to various phenomena that arise when analyzing and organizing data in high dimensional spaces
  • Common theme: when number of dimensions increases, the volume of the space increases so fast that the available data becomes sparse
  • Issue with any method that requires statistical significance: the amount of data needed to support the result grows exponentially with the dimensionality
  • Issue when algorithms don’t scale well on high dimensions typically when O(nkn)O(nkn)
  • Everything becomes far and difficult to organize

Source

Your Answer

13 + 14 =