TY - JOUR
T1 - Reconstructing production networks using machine learning
AU - Mungo, Luca
AU - Lafond, François
AU - Astudillo-Estévez, Pablo
AU - Farmer, J. Doyne
N1 - Publisher Copyright:
© 2023
PY - 2023/3
Y1 - 2023/3
N2 - The vulnerability of supply chains and their role in the propagation of shocks has been highlighted multiple times in recent years, including by the recent pandemic. However, while the importance of micro data is increasingly recognised, data at the firm-to-firm level remains scarcely available. In this study, we formulate supply chain networks’ reconstruction as a link prediction problem and tackle it using machine learning, specifically Gradient Boosting. We test our approach on three different supply chain datasets and show that it works very well and outperforms three benchmarks. An analysis of features’ importance suggests that the key data underlying our predictions are firms’ industry, location, and size. To evaluate the feasibility of reconstructing a network when no production network data is available, we attempt to predict a dataset using a model trained on another dataset, showing that the model's performance, while still better than a random predictor, deteriorates substantially.
AB - The vulnerability of supply chains and their role in the propagation of shocks has been highlighted multiple times in recent years, including by the recent pandemic. However, while the importance of micro data is increasingly recognised, data at the firm-to-firm level remains scarcely available. In this study, we formulate supply chain networks’ reconstruction as a link prediction problem and tackle it using machine learning, specifically Gradient Boosting. We test our approach on three different supply chain datasets and show that it works very well and outperforms three benchmarks. An analysis of features’ importance suggests that the key data underlying our predictions are firms’ industry, location, and size. To evaluate the feasibility of reconstructing a network when no production network data is available, we attempt to predict a dataset using a model trained on another dataset, showing that the model's performance, while still better than a random predictor, deteriorates substantially.
KW - Link prediction
KW - Machine learning
KW - Network reconstruction
KW - Supply chains
UR - http://www.scopus.com/inward/record.url?scp=85148358068&partnerID=8YFLogxK
U2 - 10.1016/j.jedc.2023.104607
DO - 10.1016/j.jedc.2023.104607
M3 - Artículo
AN - SCOPUS:85148358068
SN - 0165-1889
VL - 148
JO - Journal of Economic Dynamics and Control
JF - Journal of Economic Dynamics and Control
M1 - 104607
ER -