want to label them on the basis of parent domain and child domain by using the text present in them what's the best way to do this?
Should I use any unsupervised algo or topic modelling will help on this?
Is the content just text? see https://stackoverflow.com/a/2453229/154762
Обсуждают сегодня