Christophe Gaillac

I am an Associate Professor at the University of Geneva (IEE/GSEM).

I am also a Research Affiliate at CREST-ENSAE and the Toulouse School of Economics.

I was a Postdoctoral Fellow at Nuffield College, University of Oxford, after receiving my PhD from the Toulouse School of Economics and studying at Ecole Polytechnique, La Sorbonne, and ENSAE Paris. You can find my CV here.

My research focuses on Econometrics, Statistics, Machine Learning, and Labor Economics.

Contact: christophe.gaillac@unige.ch

BOOK

PUBLICATIONS

ML CONFERENCE PROCEEDINGS

WORKING PAPERS

OTHER SOFTWARE PROGRAMS

TEACHING

BOOK

"Machine learning for Econometrics"

With Jérémy L'hour we published end of May 2025 a translated and enriched version of the French book ''Machine learning pour l'économétrie" through Oxford University Press!

You can order it online: Paperback, Hardcover or Ebook/Chapters at OUP or from Amazon.

Description and Table of Contents

Machine Learning for Econometrics is a book for economists seeking to grasp modern machine learning techniques - from their predictive performance to the revolutionary handling of unstructured data - in order to establish causal relationships from data.

The volume covers automatic variable selection in various high-dimensional contexts, estimation of treatment effect heterogeneity, natural language processing (NLP) techniques, as well as synthetic control and macroeconomic forecasting. The foundations of machine learning methods are introduced to provide both a thorough theoretical treatment of how they can be used in econometrics and numerous economic applications, and each chapter contains a series of empirical examples, programs, and exercises to facilitate the reader's adoption and implementation of the techniques.

Table of Contents

1:Introduction

Part I. Statistics and Econometrics Prerequisites

2:Statistical tools

3:Causal inference

Part II. High-dimension and variable selection

4:Post-selection inference

5:Generalization and methodology

6:High dimension and endogeneity

7:Going further

Part III. Treatment effect heterogeneity

8:Inference on heterogeneous effects

9:Optimal policy learning

Part IV. Aggregated data and macroeconomic forecasting

10:The synthetic control method

11:Forecasting in high-dimension

Part V. Textual data

12:Working with text data

13:Word embeddings

14:Modern language models

Part VI. Exercises

15:Exercises

Bibliography

Index

Our textbook with Jérémy L'hour on machine learning methods for econometrics, "Machine Learning pour l'économétrie" (in French) has been released on October, 2023 through Economica.

You can order it online.

Keywords: High-Dimension, Variable Selection, Post-Selection Inference, Methodology, Endogeneity, Synthetic Control Method, Heterogeneous Treatment Effects, Policy Evaluation, Text Data, Natural Language Processing.

Machine Learning for Econometrics is a book for economists who want to understand modern machine learning techniques, from their predictive power to their revolutionary processing of unstructured data, to infer causal relationships from data.

It covers automatic variable selection in various high-dimensional contexts, heterogeneity estimation of treatment effects, natural language processing (NLP) techniques, and synthetic control and macroeconomic forecasting.

The fundamentals of machine learning methods are presented in such a way as to provide both an in-depth theoretical treatment of their use in econometrics and numerous economic applications. Each chapter includes a series of empirical examples, programs, and exercises to facilitate the reader's adoption and implementation of the techniques.

This book is aimed at Master's and Grandes Ecoles students, researchers, and practitioners who want to understand and perfect their knowledge of machine learning and apply it in a context traditionally reserved for econometrics.

PUBLICATIONS

Partially Linear Models under Data Combination, with Xavier D’Haultfoeuille (CREST) and Arnaud Maurel (Duke university). Review of Economic Studies, 92(1), 238-267, (2025).

Keywords: Partially Linear Model; Data combination; Partial Identification; Intergenerational Mobility.

R Package available on CRAN: RegCombin and vignette with several simulated and real examples.

We study partially linear models when the outcome of interest and some of the covariates are observed in two different datasets that cannot be linked. This type of data combination problem arises very frequently in empirical microeconomics. Using recent tools from optimal transport theory, we derive a constructive characterization of the sharp identified set. We then build on this result and develop a novel inference method that exploits the specific geometric properties of the identified set. Our method exhibits good performances in finite samples, while remaining very tractable. We apply our approach to study intergenerational income mobility over the period 1850-1930 in the United States. Our method allows us to relax the exclusion restrictions used in earlier work, while delivering confidence regions that are informative.

Adaptive estimation in the linear random coefficients model when regressors have limited variation, with Eric Gautier (TSE), Bernoulli, 28 (1): 504 - 524, (2022).

Keywords: Adaptation, Ill-posed Inverse Problem, Minimax, Random Coefficients.

R Package archive available on CRAN: RandomCoefficients and vignette.

Summary: We consider a linear model where the coefficients - intercept and slopes - are random with a distribution in a nonparametric class and independent from the regressors. The main drawback of this model is that identification usually requires the regressors to have a support which is the whole space. This is rarely satisfied in practice. Rather, in this paper, the regressors can have a support which is a proper subset. This is possible by assuming that the slopes do not have heavy tails. Lower bounds on the supremum risk for the estimation of the joint density of the random coefficients density are derived for this model and a related white noise model. We present an estimator, its rates of convergence, and a data-driven rule which delivers adaptive estimators.

R Package: RandomCoefficients. This package implements the estimator proposed in Gaillac and Gautier (2019), which is based on Prolate Spheroidal Wave functions which are computed efficiently in RandomCoefficients based on Osipov, Rokhlin, and Xiao (2013). This package also provides a parallel implementation of the estimator.

Rationalizing Rational Expectations? Characterization and Tests, with Xavier D’Haultfoeuille (CREST) and Arnaud Maurel (Duke university), Quantitative Economics, 12 (3): 817-842 (2021).

Keywords: Rational expectations, Test, Subjective expectations, Data combination.

R Package available on CRAN: RationalExp and vignette.

Summary: In this paper, we build a new test of rational expectations based on the marginal distributions of realizations and subjective beliefs. This test is widely applicable, including in the common situation where realizations and beliefs are observed in two different datasets that cannot be matched. We show that whether one can rationalize rational expectations is equivalent to the distribution of realizations being a mean-preserving spread of the distribution of beliefs. The null hypothesis can then be rewritten as a system of many moment inequality and equality constraints, for which tests have been recently developed in the literature. The test is robust to measurement errors under some restrictions and can be extended to account for aggregate shocks. Finally, we apply our methodology to test for rational expectations about future earnings. While individuals tend to be right on average about their future earnings, our test strongly rejects rational expectations.

R Package: RationalExp. This package implements a test of the rational expectations hypothesis based on the marginal distributions of realizations and subjective beliefs. The package also computes the estimator of the minimal deviations from rational expectations than can be rationalized by the data. R and the package RationalExp are open-source software projects and can be freely downloaded from CRAN: http://cran.r-project.org

Estimates for the SVD of the truncated Fourier transform on L2(exp(b|×|)) and stable analytic continuation, with Eric Gautier (TSE), Journal of Fourier Analysis and Applications (2021) 27:72.

Keywords: Analytic continuation, Nonbandlimited functions, Heavy tails, Uniform estimates, Extrapolation, Singular value decomposition, Truncated Fourier transform, Singular Sturm Liouville Equations, Superresolution.

Summary: The Fourier transform truncated on [-c,c] is usually analyzed when acting on L2(-1/b,1/b) and its right-singular vectors are the prolate spheroidal wave functions. This paper considers the operator acting on the larger space L2(exp(b|.|)) on which it remains injective. We give nonasymptotic upper and lower bounds on the singular values with similar qualitative behavior in m (the index), b, and c. The lower bounds are used to obtain rates of convergence for stable analytic continuation of possibly nonbandlimited functions whose Fourier transform belongs to L2(exp(b|.|)). We also derive bounds on the sup-norm of the singular functions. Finally, we propose a numerical method to compute the SVD and apply it to stable analytic continuation when the function is observed with error on an interval.

ML CONFERENCE PROCEEDINGS

Fairness in job recommendations: estimating, explaining, and reducing gender gaps, by G. Bied, C. Gaillac, M. Hoffmann, P.Caillou, B. Crépon, S. Nathan, M. Sebag, Proceedings of ECAI-workshop AEQUITAS 2023.

Presentation at the French Ministry of Labor (DARES) available here.

Keywords: Fairness, Job recommender systems, Adversarial de-biasing, Gender gaps, Human ressources

Algorithmic recommendations of job ads have the potential to reduce frictional unemployment, but raise concerns about fairness due to biases in past data. Our research investigates the issue of algorithmic fairness with a specific focus on gender in a hybrid job recommendation system developed in partnership with the French Public Employment Service (PES), which is trained on past hires. First, by viewing job ads as a set of characteristics (such as wage and contract type), we document how the algorithm treats job seekers differently based on gender, both unconditionally and conditionally on their search parameters and qualifications. Second, we discuss the notion(s) of algorithmic fairness applicable in this context and the trade-offs involved. We show that the considered system reflects some existing differences in hiring or applications but does not exacerbate them. Finally, we consider adversarial de-biasing technique as a practical tool to demonstrate the trade-offs between recall and reduced differentiated treatment.

Toward Job Recommendation for All, by Guilaume Bied, Solal Nathan, Elia Perennes, Morgane Hoffmann, Philippe Caillou, Bruno Crépon, Christophe Gaillac and Michèle Sebag, Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI), AI for Good. Pages 5906-5914. Also presented at ECML PKDD, AI4HR Workshop, 2023.

Keywords: Job recommender systems, E-recruitment, Sparse data, Matching, Fairness, RCT

This paper presents a job recommendation algorithm designed and validated in the context of the French Public Employment Service. The challenges, owing to the confidential data policy, are related with the extreme sparsity of the interaction matrix and the mandatory scalability of the algorithm, aimed to deliver recommendations to millions of job seekers in quasi real-time, considering hundreds of thousands of job ads. The experimental validation of the approach shows similar or better performances than the state of the art in terms of recall, with a gain in inference time of 2 orders of magnitude. The study includes some fairness analysis of the recommendation algorithm. The gender related gap is shown to be statistically similar in the true data and in the counter-factual data built from the recommendations

Congestion-Avoiding Job Recommendation with Optimal Transport, Guillaume Bied, Elia Perennes, Victor Alfonso Naya, Philippe Caillou, Bruno Crépon, Christophe Gaillac, Michele Sebag, FEAST workshop ECML-PKDD 2021, Sep 2021, Bilbao, Spain (2021) 27:72.

Keywords: Job recommender systems, Congestion, Matching, Optimal Transport

WORKING PAPERS

Linear Regressions with Combined Data, with Xavier D’Haultfoeuille (CREST) and Arnaud Maurel (Duke university).

Keywords: Best linear prediction; data combination; partial identification; inference.

We study best linear predictions in a context where the outcome of interest and some of the covariates are observed in two different datasets that cannot be matched. Traditional approaches obtain point identification by relying, often implicitly, on exclusion restrictions. We show that without such restrictions, coefficients of interest can still be partially identified and we derive a constructive characterization of the sharp identified set. We then build on this characterization to develop computationally simple and asymptotically normal estimators of the corresponding bounds. We show that these estimators exhibit good finite sample performances.

Predicting Unobserved Individual-level Causal Effects

I introduce a new method to predict individual-level heterogeneity in the causal effect of a variable, conditional on the latter but also on the observed outcome.

I show how to identify these ''posterior effects'' then derive tractable estimators in various empirical contexts.

In an example application it turns out that they reveal substantial variations in the effects of teachers’ knowledge of the program on their performance and could substantially improve the cost-effectiveness of training programs.

Keywords: Empirical Bayes, teacher’s value-added, random coefficients, optimal transport, generalized Tweedie’s formula, voting analysis, inverse problem.

R Package RegPE soon available.

Measuring accurately heterogeneous effects is key for the design of efficient public policies. This paper considers the prediction of unobserved individual-level causal effects in linear random coefficients models, conditional on all the available data. In the application I consider, these ``posterior effects'' are the average effects of teachers' knowledge on their students' performance, conditional on both variables. I derive two strategies for recovering these posterior effects nonparametrically, assuming independence between the effects and the covariates. The first strategy recovers the distribution of the random coefficients by a minimum distance approach, and then obtains the posterior effects from this distribution. The corresponding estimator can be computed using an optimal transport algorithm. The second approach, which is only valid for continuous regressors, expresses the posterior effects directly as a function of the data. The corresponding estimator is rate optimal. I discuss several extensions, in particular the relaxation of the independence condition. Finally, the application reveals large heterogeneity in the effect of teacher knowledge, suggesting that we could substantially improve the cost-effectiveness of their training.

Designing Labor Market Recommender Systems: How to Improve Human-based Search, with Guillaume Bied (CREST-LISN), Philippe Caillou (LISN), Bruno Crépon (CREST), Elia Perennes (CREST) and Michele Sebag (LISN).

Keywords: Job Recommender Systems, Two-sided Markets, Value Misalignment.

This paper questions the design of job recommender systems (RS) and their potential to enhance job search. We argue that RS should align with a rational version of job seekers' objectives. Policy makers thus need combining hiring probabilities and job seekers' utilities, both of which are challenging to estimate. Otherwise, our empirical findings underscore that even state-of-the-art machine learning RS may not enhance job seekers' outcomes. We address three key dimensions of RS: the differences between algorithms, the optimal objective they should pursue, and the needs of job seekers. Our results highlight the value of RS in revealing unexplored opportunities.

Designing Labor Market Recommender Systems: the Importance of Job Seeker Preferences and Competition, with Guillaume Bied (CREST-LISN), Philippe Caillou (LISN), Bruno Crépon (CREST), Elia Perennes (CREST) and Michele Sebag (LISN).

Keywords: Job Recommender Systems, Two-sided Market, Congestion, Optimal Transport.

This paper questions the design of job recommender systems (RS). A direct application of sophisticated Machine Learning (ML) algorithms to build recommendations, such as identifying offers most likely to lead to a job from the prediction of successful matches, does not necessarily lead to an improvement in the situation of job seekers. This is because the objectives of these recommendations do not align with the ones of the job seekers and they are usually generated independently of each other, without taking into account the competition. Using a theoretical model of two-sided market with a step of applications, we show that the ML tools from which the recommendations are directly derived can be more usefully mobilized to identify quantities that job seekers might have difficulties to access. Our empirical analysis confirms these insights using the RS designed inside the framework of a long-term project we are conducting with the French Public Employment Service (Pôle Emploi), which leverages rich and detailed data on applicants, firms, and past job searches. It illustrates that RS based solely on the chances of being hired or on the utility of the jobs are dominated by ones that would mix the two dimensions, to come closer to the expected utility. We also discuss how RS can avoid increasing congestion in using a collective objective rather that an individual one to generate the recommendations, using optimal transport to make it tractable.

Non Parametric Classes for Identification in Random Coefficients Models when Regressors have Limited Variation, with Eric Gautier (TSE).

Keywords: Random Coefficients, Quasi-analyticity, Deconvolution, Identification.

Summary: This paper studies point identification of the distribution of the coefficients in some random coefficients models with exogenous regressors when their support is a proper subset, possibly discrete but countable. We exhibit trade-offs between restrictions on the distribution of the random coefficients and the support of the regressors. We consider linear models including those with nonlinear transforms of a baseline regressor, with an infinite number of regressors and deconvolution, the binary choice model, and panel data models such as single-index panel data models and an extension of the Kotlarski lemma.

OTHER SOFTWARE PROGRAMS

Stata program mfelogit and the vignette with Xavier D’Haultfoeuille (CREST), Laurent Davezies (CREST), and Louise Laage (Georgetown University), associated with their paper Identification and Estimation of Average Marginal Effects in Fixed Effect Logit Models.

Install it by typing: ssc install mfelogit

Keywords: Fixed effects logit models, Panel Data, Partial Identification.

mfelogit implements the estimators of the sharp bounds on the AME and the related confidence intervals on the AME and ATE from Davezies et al. (DDL hereafter). It also implements the second method proposed in DDL, which is faster to compute but may result in larger confidence intervals. When the covariate is binary, the command computes the ATE; otherwise it computes the AME.

R Package MarginalFElogit and the vignette with Xavier D’Haultfoeuille (CREST), Laurent Davezies (CREST), and Louise Laage (Georgetown University), associated with their paper Identification and Estimation of Average Marginal Effects in Fixed Effect Logit Models.

Keywords: Fixed effects logit models, Panel Data, Partial Identification.

This package implements the estimators of the sharp bounds on the AME and the related confidence intervals on the AME and ATE from Davezies et al. (DDL hereafter). It also implements the second method proposed in DDL, which is faster to compute but may result in larger confidence intervals. When the covariate is binary, the command computes the ATE; otherwise it computes the AME.

TEACHING

Lectures:

Machine Learning for Econometrics (PhD course, 13-16 Nov. 2024), University of Luxembourg, DSEFM.

Econometrics, BA (Fall 2024), University of Geneva, GSEM

High-dimensional econometrics at the The Fime Lab Summer School on Big Data & Finance, 12-16 June 2023.

Advanced econometrics: forecasting (2023), Saïd Business School, Oxford, on High-Dimensional Macroeconomic Forecasting.

Machine Learning for Econometrics (2018, 2019, 2020), ENSAE Paris and Institut Polytechnique de Paris (previously "High-Dimensional Econometrics"), joint with Jérémy L'hour (INSEE, CREST) and Bruno Crépon (CREST).

Mathematics for Economists (Analysis and Optimisation) (2018), Master in Economics, Paris-Saclay university, Phd track

Mathematics for Economists (2017, 2018), Sciences-Po Paris, Phd track

Mathematics for Economists (2018), ENSAE Paris, Specialised Master

Algebra and Python (2018), HEC Paris and ENSAE Paris, Undergraduate.

TA sessions:

Advanced Econometrics (2024), University of Oxford, Bent Nielsen and Martin Weidner

Advanced Econometrics (2021-2023), University of Oxford, Anders Kock and Martin Weidner

Statistics 1 (2017-2018), ENSAE Paris, Nicolas Chopin

Numerical Analysis (2016-2018), ENSAE Paris, Cristina Butucea

Econometrics 2, (2017-2018), ENSAE Paris, Xavier D’Haultfoeuille

Simulations and Monte-Carlo (2018), ENSAE Paris, Nicolas Chopin

Time Series analysis (2015-2017), ENSAE Paris, Christian Franck

Page updated

Google Sites

Report abuse