- What does 望むところだ mean?
- Should a non native try to adapt his German when traveling to Austria?
- Can one be compelled to testify?
- Why divorcing your first wife should be done only in extreme cases?
- Using sql to get testing job
- Hibernate Criteria Date restriction between two dates
- Need help isolating a laptop camera related issue (camera does not start on Chrome, but would otherwise work fine, only happens to 1 tester)
- how do you tune speaker systems?
- Other than tone, are there reasons to consider a semi-hollow over a solid body electric guitar?
- Is there any reason not to use TRS cables for typically TS applications?
- Tools to automate “tape-like” audio manipulation
- How to add SPBuiltInFieldId.LinkFilenameNoMenu to library view
- How to display a column of another list in display form of a list
- Sharepoint On Premise with Azure Application insights
- Enable Document Set Missing from Site Settings - Site Features
- TPMS sensor manufacturers
- What lug pattern fits a 1994 Dodge Dakota?
- How does a tire's diameter and width impact fuel economy?
- engine not working properly after the distributor change
- bent tabular rear control arm
K-means: why reduce dimensions first?
I'm a bit confused about the usefulness of reducing dimensions before doing a k-means clustering.
Suppose you want to apply k-means to a set points $(x_i)$ with high dimension. You want to minimize the cost function $\sum_i \|x_i-c_i\|^2$ where $c_i$ is the center of the cluster $x_i$ belongs to.
You have basically two methods:
A: do a k-means (Lloyd) directly on $(x_i)$
B: reduce the number of dimensions with some dimensional reduction method (such as SVD/PCA), and then apply k-means to the points with reduced dimensions
On the one hand, A is unlikely to find the global minimum, replications will help getting closer to it. It might have a high computational cost due to handling high dimension vectors and many replications.
On the other hand, B is more likely to get close to the global minimum (or even reach it) with fewer replications. But the minimum on the reduced version is not the original minimum (it is however known to be close to it). There is of course an