ProductTitleClustering
PublicThis project clusters products by their titles and assigns topics. Initially using BERT, PCA, and t-SNE, the results were noisy. The improved approach with SBERT, UMAP, and HDBSCAN provides clearer clusters. Topics are assigned using Llama-3-8b.
content-clusteringdbscan-clusteringhdbscanllama3-8bllm-inferencesentence-bertumapunsupervised-learning
Creat:2024-07-11T11:15:18
Update:2024-12-15T02:51:42
5
Stars
0
Stars Increase