Description
Recent technological advances in molecular biology have given rise to numerous large scale datasets whose analysis have risen serious methodological challenges mainly relating to the size and complex structure of the data. Considerable experience has been gained over the past decade, mainly in genetics, from the Genome-Wide Association Study (GWAS) era, and more recently in transcriptomics and metabolomics. Building upon the corresponding wide literature, we present methods used to analyze OMICS data within each of the three main types of approaches : univariate models, dimension reduction techniques, and variable selection models. We focus on methods for which available ready-to-use packages are available. In this context, we propose R2GUESS an R package which interface a C++ implementation of a fully sparse Bayesian variable selection (BVS) approach for linear regression that can analyze single and multiple responses in a integrated way. A simulation and an illustration in the context of GWAS is presented and show the performance of the BVS approach.
Network inference methods based upon sparse Gaussian Graphical Models (GGM) have recently emerged as a promising exploratory tool in genomics. They give a sounded representation of direct relationships between genes and are accompanied with sparse inference strategies well suited to the high...
Published 05/16/13
When an unbiased estimator of the likelihood is used within an Markov chain Monte Carlo (MCMC) scheme, it is necessary to tradeoff the number of samples used against the computing time. Many samples for the estimator will result in a MCMC scheme which has similar properties to the case where the...
Published 05/16/13