Can I user two KernelPCA after using TruncatedSVD before clustering?

by Sayed Shazeb   Last Updated July 12, 2019 05:19 AM

I am working on a project at a company where I have to make clustering/unsupervised model. The data I am working on is very sparse with high dimensions and after some research, I found out TruncatedSVD is good for sparse data. So I applied TruncatedSVD on my data and ran a few clustering(k-means and GMM) algorithms on it and then I used top2 and top3 featured from truncatedSVD transformation to visualize the data in 2d and 3d space but the clusters are not distinctively separated.

So I did some more research I found out that kernel PCA is good at finding the separation between the data but it might not be good for sparse data. So my question is Can I use kernel PCA after transforming the data with truncatedSVD?

Will this be a correct approach that I can try? or are there are any other suggestion on how to tackle this problem?

To give you an idea about the dataset My data includes user_ids(unique) and their corresponding click frequency on multiple webpages and the click frequency on topics also there are some more variables related to time like time different between session and number of distinct days the user has visited the website etc.



Related Questions


Updated March 04, 2018 10:19 AM

Updated May 07, 2018 10:19 AM

Updated September 03, 2016 08:08 AM

Updated August 21, 2018 21:19 PM

Updated May 07, 2018 16:19 PM