EA

Article http://dx.doi.org/10.26855/ea.2026.03.011

Constructing a CMBS Default Risk Sentiment Index Using Financial Text Embeddings and Evaluating Its Predictive Effectiveness

TOTAL VIEWS: 487

Jingzhi Yin

Department of Mathematics, Columbia University, New York, NY 10017, USA.

*Corresponding author: Jingzhi Yin

Published: March 31,2026

Abstract

In the commercial real estate finance system, the timely identification of CMBS default risk is of critical importance for maintaining financial stability and preventing systemic risk. With advances in information technology and declining data acquisition costs, unstructured financial text data have accumulated rapidly, and textual information exhibits advantages that traditional structured data cannot easily replace in reflecting changes in market sentiment, the evolution of risk perceptions, and potential default signals. Focusing on financial text embedding techniques, this study systematically characterizes risk sentiment features related to CMBS defaults and, based on large-scale financial text data, constructs a CMBS default risk sentiment index that reflects market risk expectations. The index is then incorporated into a default risk prediction framework and evaluated using multiple econometric models to empirically examine its role in explaining CMBS default behavior and enhancing default risk predictive performance. The results show that risk sentiment information extracted through financial text embeddings effectively complements traditional structured indicators based on asset characteristics, tranche structure, and macroeconomic variables, exhibiting sig-nificant predictive advantages across various model specifications while maintaining strong stability in out-of-sample forecasts. Further robustness and extension analyses confirm the reliability and generalizability of the findings. This study provides a new quantitative perspective for the forward-looking identification of CMBS default risk and offers valuable insights into the application of financial text analysis methods in structured credit risk research and macroprudential regulatory practice.

Keywords

Text embedding; Default risk; Sentiment index; Risk prediction

References

[1] Amrik S, OJW. Appraisal bias in the lodging sector: evidence from CMBS transactions. Int J Hosp Manag. 2022;106:103451.

https://doi.org/10.1016/j.ijhm.2022.103451

[2] Trivedi A, Sangeetha S. Enhancing neural network predictions with finetuned numeric embeddings for stock trend forecasting. Soft Comput. 2025;29(3):1739-54.
https://doi.org/10.1007/s00500-024-10525-w

[3] M DG. Characterizing accident narratives with word embeddings: improving accuracy, richness, and generalizability. J Saf Res. 2022;80:441-55.
https://doi.org/10.1016/j.jsr.2021.12.013

[4] Gawade MS, Lakhani R, Patil Y. A descriptive study to measure the reliability of Braden scale score calculated by clinical nurses and evaluate its predictive value for pressure ulcer risk among ICU patients. J Pharm Res Int. 2021;33(58B):57-63.

https://doi.org/10.9734/jpri/2021/v33i58B34205

[5] Arshid E, Azimi M, Moradi M, et al. Mathematical solution for vibrational response of shear and normal deformable advanced metal foam CMBS treated with nanocomposite actuators. Int J Struct Stab Dyn. 2024;25(11):2550118.

https://doi.org/10.1142/S0219455425501184

[6] Filho RJI, Gôlo SPM, Marcacini MR, et al. How do financial time series enhance the detection of news significance in market movements? A study using graph neural networks with heterogeneous representations. Neural Comput Appl. 2024;37(3):1-13.

https://doi.org/10.1007/s00521-024-10418-5

[7] Yeh H, Yeh Y, Shen D. Word vector models approach to text regression of financial risk prediction. Symmetry. 2020;12(1):89.

https://doi.org/10.3390/sym12010089

[8] Wang W, Miaomiao L, Xuanling D, et al. Binary wavelet transform-based financial text image authentication algorithm. Autom Control Comput Sci. 2024;58(3):326-35.
https://doi.org/10.3103/S0146411624030115

How to cite this paper

Constructing a CMBS Default Risk Sentiment Index Using Financial Text Embeddings and Evaluating Its Predictive Effectiveness

How to cite this paper: Jingzhi Yin. (2026). Constructing a CMBS Default Risk Sentiment Index Using Financial Text Embeddings and Evaluating Its Predictive Effectiveness. Engineering Advances6(1), 50-54.

DOI: http://dx.doi.org/10.26855/ea.2026.03.011