Rethinking Variable Importance in Machine Learning

Date:

Share post:


We study which firm characteristics drive the economic value of machine learning portfolios. Three results stand out. First, in-sample variable importance overfits and provides little reliable guidance, highlighting the need for out-of-sample evaluation using economic criteria. Second, conventional models are dominated by microcaps, which inflate returns and concentrate gains in costly-to-trade stocks; excluding microcaps is essential for meaningful inference. Third, some predictors carry negative importance and consistently degrade performance; removing them improves risk-adjusted returns and clarifies which characteristics matter. These findings show that only with economic restrictions can machine learning deliver robust asset pricing insights.

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Related articles

Don’t Blame Indexing for Your Problems

Has the rise of passive investing broken the stock market? Is the level of passive ownership too...

1 Number MercadoLibre Investors Need to See

MercadoLibre (MELI +3.59%) has slumped over the last year, and it's clear why. The company's profits have fallen...

Hyatt Opening First Park Hyatt & Grand Hyatt All-Inclusive Resorts

Hyatt Opening First Park Hyatt & Grand Hyatt All-Inclusive Resorts Hyatt has announced plans to bring its Park...