The art of data science via Mondrian Forests
Probabilités et Statistique
The recent and ongoing digital world expansion now allows anyone to have access to a tremendous amount of information. However collecting data is not an end in itself and thus techniques must be designed to gain in-depth knowledge from these large data bases.
This has led to a growing interest for statistics, as a tool to find patterns in complex data structures, and particularly for turnkey algorithms which do not require specific skills from the user.
Such algorithms are quite often designed based on a hunch without any theoretical guarantee. Indeed, the overlay of several simple steps (as in random forests or neural networks) makes the analysis more arduous. Nonetheless, the theory is vital to give assurance on how algorithms operate thus preventing their outputs to be misunderstood.
In this talk, we analyze a stylized version of random forest called Mondrian Forests and prove that it reaches minimax rates of consistency for Lipschitz and twice differentiable regression functions. This is the first result showing the optimality of a particular random forest algorithm in arbitrary dimension. We will also elaborate on the importance of aggregation in the forest.
- Accueil
- Annuaire
- Equipes
- Evènements
- Congrès
- Invités
- Séminaires, Groupes de Travail et Colloquium
- Séminaires
- Analyse Complexe et Equations Différentielles
- Analyse Fonctionnelle
- Analyse Numérique et Equations Aux Dérivées Partielles
- Arithmétique
- Formes Automorphes
- Géométrie Algébrique
- Géométrie des espaces singuliers
- Géométrie Dynamique
- Histoire des Mathématiques
- Physique Mathématique
- Probabilités et Statistique
- Singularités et Applications
- Théorie Analytique et Analyse Harmonique
- Topologie
- Colloquium
- Groupes de Travail
- Analyse harmonique et théorie analytique
- Autour des fractales
- Calcul de Malliavin et processus fractionnaires
- Déformations des singularités de surfaces
- Equations aux dérivées partielles
- Extraction du signal
- Fondements mathématiques du deep learning
- Géométrie Non-Archimédienne
- Géométrie Stochastique
- Idéaux de Hodge
- Leçons d'Analyse
- Matrices Aléatoires
- Probabilités
- Statistique et Grande Dimension
- Systèmes Dynamiques
- Topologie
- W-algèbres
- Doctorants et Post-doctorants
- Séminaires
- Soutenances
- Anciens Séminaires et Groupes de Travail
- Formation par la Recherche
- Laboratoire
- Liens utiles
- Projets
- Recrutements
- Services