Rethinking Variable Importance in Machine Learning

Date:

Share post:


We study which firm characteristics drive the economic value of machine learning portfolios. Three results stand out. First, in-sample variable importance overfits and provides little reliable guidance, highlighting the need for out-of-sample evaluation using economic criteria. Second, conventional models are dominated by microcaps, which inflate returns and concentrate gains in costly-to-trade stocks; excluding microcaps is essential for meaningful inference. Third, some predictors carry negative importance and consistently degrade performance; removing them improves risk-adjusted returns and clarifies which characteristics matter. These findings show that only with economic restrictions can machine learning deliver robust asset pricing insights.

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Related articles

Trump says he is nominating former Oklahoma state trooper Lance Schroyer as ICE director

President Donald Trump on Saturday said he is nominating Lance Schroyer, a former Oklahoma state trooper, as...

Don’t Blame Indexing for Your Problems

Has the rise of passive investing broken the stock market? Is the level of passive ownership too...

1 Number MercadoLibre Investors Need to See

MercadoLibre (MELI +3.59%) has slumped over the last year, and it's clear why. The company's profits have fallen...