Ros F. Feature and Dimensionality Reduction for Clustering...Deep Learning 2024
- Type:
- Other > E-books
- Files:
- 1
- Size:
- 3.63 MiB (3801915 Bytes)
- Uploaded:
- 2025-01-13 10:39:53 GMT
- By:
- andryold1
- Seeders:
- 32
- Leechers:
- 1
- Comments
- 0
- Info Hash: 9A9C0DA9ED2D9AA7D32FD87540640E7F54873412
(Problems with magnets links are fixed by upgrading your torrent client!)
Textbook in PDF format How to dig unknown information out of datasets is a widely concerning problem. Clustering, as an unsupervised method to partition a dataset naturally, has the ability to discover the potential and internal knowledge, laws, and rules of data. Clustering is one of the major unsupervised learning techniques and has been applied in many fields. In recent decades, researchers have proposed many clustering algorithms (Ezugwu et al., 2022) based on different theories and models, which are generally divided into several categories such as partitioning, hierarchical, density, grid, and model-based methods. When the data are easy to cluster, meaning that the groups are well-separated, most of the existing algorithms are likely to yield a good result, but clustering algorithms have to deal with more complex situations such as different types of attributes, various shapes, and densities and must include outlier and noise management. Despite the continuing stream of clustering algorithms proposed over the years to handle more complex structures, there are still remaining issues. In addition to the recurrent challenges, there are several issues concerning the dimension and volume of twenty-first century databases. They possess noisy, irrelevant, and redundant along with the most useful information. The general issues involve the curse of dimensionality and algorithm scalability. These issues are not novel but more challenging today in the era of big data. In this era, the size of data increases drastically day by day. Data grow in terms of both the number of instances and features. This increasing dimensionality degrades the performance of machine learning algorithms including clustering ones. Performance degradation is often observed when tackling either unprocessed supports such as images or high-dimensional features extracted from processed supports. In addition, there is a significant increase of computational time and space. Image clustering has recently attracted significant attention due to the increased availability of unlabeled datasets. In this case, unsupervised scenarios are shifted from their initial goal of knowledge discovery as the process is somewhere supervised-driven to improve supervised problems
Ros F. Feature and Dimensionality Reduction for Clustering...Deep Learning 2024.pdf | 3.63 MiB |