06 - Dimensionality reduction : from PCA to recent nonlinear techniques ( John Lee (Université Catholique de Louvain, Belgique))
Listen now
Description
Dimensionality reduction is an old yet unsolved problem, with many applications in data visualization, knowledge discovery, and machine learning in general. Our aim in this talk will be to review several developments in the field of dimensionality reduction, with a particular focus on nonlinear methods. As an introduction, we will point out some weird properties of high dimensional spaces, which will motivate the use of dimensionality reduction. Next, we will go back in time and start our review with a short reminder about well known techniques such as principal component analysis and multidimensional scaling. Our travel into time will also bring us to visit Sammon mapping and other methods based on distance preservation. Next, we will come across self-organizing maps and auto-encoders with bottleneck neural networks. Some spectral methods such as Isomap and locally linear embedding will be reviewed as well. A glance at recent methods based on similarity preservation such as stochastic neighbor embedding will close the survey. Finally, we will try to identify the relationships between the different approaches, and say a few words about quality criteria for dimensionality reduction techniques. Vous pouvez entendre l'intervention, tout en visualisant le Power Point, en cliquant sur ce lien : http://epn.univ-paris1.fr/modules/UFR27semSAMOS/SemSamos20090529L_ee/SemSamos20090529_Lee.html. Ecouter l'intervention : Bande son disponible au format mp3 Durée : 1H14
More Episodes
Résumé : Nous rappelons les bases de l'approche du problème de la classification non supervisée par les modèles de mélange. La méthode usuelle repose sur le maximum de vraisemblance et le choix du nombre de classes à former se fait par des critères pénalisés. Nous nous intéressons...
Published 10/02/09
Dans ce travail nous utilisons des méthodes de régularisation pour la résolution de problèmes inverses mal posés. La principale difficulté dans l'application d'une méthode de régularisation est la détermination du paramètre de régularisation. L'approche choisie dans ce travail s'appuie...
Published 02/07/09