4.1 Data-driven penalties: heuristics, results and thoughts...

StatLearn 2012 - Workshop on "Challenging...

4.1 Data-driven penalties: heuristics, results and thoughts... (Pascal Massart)

Listen now

Description

The idea of selecting a model via penalizing a log-likelihood type criterion goes back to the early seventies with the pioneering works of Mallows and Akaike. One can find many consistency results in the literature for such criteria. These results are asymptotic in the sense that one deals with a given number of models and the number of observations tends to infinity. A non asymptotic theory for these type of criteria has been developed these last years that allows the size as well as the number of models to depend on the sample size. For practical relevance of these methods, it is desirable to get a precise expression of the penalty terms involved in the penalized criteria on which they are based. We will discuss some heuristics to design data-driven penalties, review some new results and discuss some open problems.

More Episodes

See all »

StatLearn 2012 - Workshop on "Challenging problems in Statistical Learning"

Published 12/03/14

2.2 Functional estimation in high dimensional data : Application to classification (Sophie Dabo-Niang)

Functional data are becoming increasingly common in a variety of fields. Many studies underline the importance to consider the representation of data as functions. This has sparked a growing attention in the development of adapted statistical tools that allow to analyze such kind of data :...

Published 12/03/14

4.2 A sliced inverse regression approach for block-wise evolving data streams (Jérôme Saracco)

In this communication, we focus on data arriving sequentially by block in a stream. A semiparametric regression model involving a common EDR (Effective Dimension Reduction) direction B is assumed in each block. Our goal is to estimate this direction at each arrival of a new block. A simple direct...

Published 12/03/14