[D] Exact kNN for high dimensional data

xero786@alien.top · 1 year ago

[D] Exact kNN for high dimensional data

PM_ME_YOUR_BAYES@alien.top · 1 year ago

1k features are a lot but not really A LOT. Also you didn’t mention how many samples you have. Without any other knowledge, off the top of my head I would try to fit a self-organizing map and then use it as an “index” to retrieve the closest samples most similar to the query and finish with a knn only on those.

xero786@alien.top · 1 year ago

My dataset is about 8000 points, and the reason I am not using ANN is that I am trying to study and experiment how exact kNNs work, what can I do with them, what’s best amongst them in high dimensional space…

PM_ME_YOUR_BAYES@alien.top · 1 year ago

SOMs are not like neural network predictors you would see around here, in the sense that they do not learn new feature spaces. It would have been the same if I suggested you to use kmeans to reduce the search space and then doing knn