PMID- 37510011 OWN - NLM STAT- PubMed-not-MEDLINE LR - 20230801 IS - 1099-4300 (Electronic) IS - 1099-4300 (Linking) VI - 25 IP - 7 DP - 2023 Jul 14 TI - A Preprocessing Manifold Learning Strategy Based on t-Distributed Stochastic Neighbor Embedding. LID - 10.3390/e25071065 [doi] LID - 1065 AB - In machine learning and data analysis, dimensionality reduction and high-dimensional data visualization can be accomplished by manifold learning using a t-Distributed Stochastic Neighbor Embedding (t-SNE) algorithm. We significantly improve this manifold learning scheme by introducing a preprocessing strategy for the t-SNE algorithm. In our preprocessing, we exploit Laplacian eigenmaps to reduce the high-dimensional data first, which can aggregate each data cluster and reduce the Kullback-Leibler divergence (KLD) remarkably. Moreover, the k-nearest-neighbor (KNN) algorithm is also involved in our preprocessing to enhance the visualization performance and reduce the computation and space complexity. We compare the performance of our strategy with that of the standard t-SNE on the MNIST dataset. The experiment results show that our strategy exhibits a stronger ability to separate different clusters as well as keep data of the same kind much closer to each other. Moreover, the KLD can be reduced by about 30% at the cost of increasing the complexity in terms of runtime by only 1-2%. FAU - Shi, Sha AU - Shi S AD - State Key Laboratory of Integrated Services Network, Xidian University, 2 South TaiBai Road, Xi'an 710071, China. FAU - Xu, Yefei AU - Xu Y AUID- ORCID: 0009-0001-2742-6760 AD - State Key Laboratory of Integrated Services Network, Xidian University, 2 South TaiBai Road, Xi'an 710071, China. FAU - Xu, Xiaoyang AU - Xu X AD - State Key Laboratory of Integrated Services Network, Xidian University, 2 South TaiBai Road, Xi'an 710071, China. FAU - Mo, Xiaofan AU - Mo X AD - National Astronomical Observatories, Chinese Academy of Sciences, 20A Datun Road, Chaoyang District, Beijing 100101, China. FAU - Ding, Jun AU - Ding J AD - Institute of Information Sensing, Xidian University, 2 South TaiBai Road, Xi'an 710071, China. LA - eng GR - 2020ZDLGY08-06 and 2023-YBGY-206/the Key Research and Development Project of Shannxi Province/ GR - 2023A1515010671/GuangDong Basic and Applied Basic Research Foundation/ PT - Journal Article DEP - 20230714 PL - Switzerland TA - Entropy (Basel) JT - Entropy (Basel, Switzerland) JID - 101243874 PMC - PMC10378244 OTO - NOTNLM OT - dimensionality reducing OT - k-nearest neighbor OT - manifold learning OT - t-SNE COIS- The authors declare no conflicts of interest. EDAT- 2023/07/29 11:42 MHDA- 2023/07/29 11:43 PMCR- 2023/07/14 CRDT- 2023/07/29 01:15 PHST- 2023/04/24 00:00 [received] PHST- 2023/07/01 00:00 [revised] PHST- 2023/07/05 00:00 [accepted] PHST- 2023/07/29 11:43 [medline] PHST- 2023/07/29 11:42 [pubmed] PHST- 2023/07/29 01:15 [entrez] PHST- 2023/07/14 00:00 [pmc-release] AID - e25071065 [pii] AID - entropy-25-01065 [pii] AID - 10.3390/e25071065 [doi] PST - epublish SO - Entropy (Basel). 2023 Jul 14;25(7):1065. doi: 10.3390/e25071065.