Predictive Analytics in Behavioral Finance: Modeling Investor Sentiment with NLP Techniques
DOI:
https://doi.org/10.64137/XXXXXXXX/IJFEMS-V1I1P103Keywords:
Investor sentiment, Predictive analytics, Behavioral finance, NLP, Sentiment analysis, Deep learning, Financial text, Market forecasting, Stock price predictionAbstract
Behavioral finance with predictive analytics is a developing discipline that tries to understand how psychological powers and behavioral predispositions impact the budgetary choices of investors. In this research paper, utilizing Natural Language Processing (NLP) techniques for modeling of the investor sentiment is studied, using text data from financial news, social media, blogs and financial statements. Sentiment of the investors (an important variable in behavioral finance) can cause significant volatility in the market as well as the pricing of the assets. Quantifying sentiment is complicated and subjective, though. In this paper, we present a flexible framework with sentiment analysis, topic modeling and deep learning, which explores the output of unstructured financial text data as Actionable Sentiment Scores. The research starts off by investigating the impact of behavioral finance and data science on financial analysis, with a focus on the way that non-rational factors, including emotions and biases, play a role in financial decision-making. It then goes on to explain recent advances in NLP, particularly in transformer-based models such as BERT and FinBERT and their applicability for finance-specific applications. Following a multi-source data aggregation strategy, this study extracts investor opinions and applies a sentiment classification, which is further utilized with a time series regression model to evaluate how sentiment trends affect stock price changes. However, findings from this paper also offer quantitative outcomes that show a statistically significant correlation when it comes to aggregated sentiment scores and abnormal returns in specific market segments. The experimental setup contains data from 2017 to 2023, whereas the key tools utilized are Python, HuggingFace Transformers and APIs related to finances. A hybrid model is used to model sentiment with rule-based approaches such as VADER, Loughran-McDonald lexicon and machine learning such as LSTM and BERT. This shows the added value to both theoretical and applied domains by combining behavioral theories of finance and the latest computational techniques to improve market prediction models and make better decisions. Implications for practice include the development of real-time sentiment dashboards for traders as well as risk managers. It finally concludes by suggesting future directions of research, like multilingual sentiment analysis and building real-time prediction engines
References
[1] Barberis, N., Shleifer, A., & Vishny, R. (1998). A model of investor sentiment. Journal of Financial Economics, 49(3), 307-343.
[2] Kahneman, D., & Tversky, A. (2013). Prospect theory: An analysis of decision under risk. In Handbook of the fundamentals of financial decision making: Part I (pp. 99-127).
[3] Barberis, N., Huang, M., & Santos, T. (2001). Prospect theory and asset prices. The quarterly journal of economics, 116(1), 1-53.
[4] Tetlock, P. C. (2007). Giving content to investor sentiment: The role of media in the stock market. The Journal of finance, 62(3), 1139-1168.
[5] Loughran, T., & McDonald, B. (2011). When is a liability not a liability? Textual analysis, dictionaries, and 10‐Ks. The Journal of finance, 66(1), 35-65.
[6] Pang, B., & Lee, L. (2008). Opinion mining and sentiment analysis. Foundations and Trends® in information retrieval, 2(1–2), 1-135.
[7] Nassirtoussi, A. K., Aghabozorgi, S., Wah, T. Y., & Ngo, D. C. L. (2014). Text mining for market prediction: A systematic review. Expert Systems with Applications, 41(16), 7653-7670.
[8] Hagenau, M., Liebmann, M., & Neumann, D. (2013). Automated news reading: Stock price prediction based on financial news using context-capturing features. Decision support systems, 55(3), 685-697.
[9] Chen, H., De, P., Hu, Y., & Hwang, B. H. (2014). Wisdom of crowds: The value of stock opinions transmitted through social media. The review of financial studies, 27(5), 1367-1403.
[10] Schumaker, R. P., & Chen, H. (2009). Textual analysis of stock market prediction using breaking financial news: The AZFin text system. ACM Transactions on Information Systems (TOIS), 27(2), 1-19.
[11] Patel, J., Shah, S., Thakkar, P., & Kotecha, K. (2015). Predicting stock and stock price index movement using trend deterministic data preparation and machine learning techniques. Expert systems with applications, 42(1), 259-268.
[12] Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural computation, 9(8), 1735-1780.
[13] Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2019, June). Bert: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers) (pp. 4171-4186).
[14] Araci, D. (2019). Finbert: Financial sentiment analysis with pre-trained language models. arXiv preprint arXiv:1908.10063.
[15] Khedr, A. E., & Yaseen, N. (2017). Predicting stock market behavior using data mining technique and news sentiment analysis. International Journal of Intelligent Systems and Applications, 9(7), 22.
[16] Broby, D. (2022). The use of predictive analytics in finance. The Journal of Finance and Data Science, 8, 145-161.
[17] Maha, Dr S. (2025). The Cost of Freedom: Intersectionality and Socioeconomic Inequality in Toni Morrison’s The Bluest Eye, International Journal of Humanities Science Innovations and Management Studies, 1(2), 1-8.
[18] Wirawan, P. (2023). Leveraging Predictive Analytics in Financing Decision-Making for Comparative Analysis and Optimization. Advances in Management & Financial Reporting, 1(3), 157-169.
[19] Maha, Dr S. and Ezhugnayiru, Dr. A. (2025). “Freedom and Intersectionality in the Contemporary Age” International Research Journal of Economics and Management Studies, 4(5), 78-82. https://irjems.org/irjems-v4i5p111.html
[20] Dr. Priya. A., Dr. Charles Arockiasamy J., “The Global Reach of AI: A Postcolonial Analysis of Technological Dominance,” International Journal of Scientific Research in Science and Technology, 11(2), 1-5, 2025.
[21] S. Maha, A. Ezhugnayiru (2025) Analyzing the Intersectionality of Gender, Race, and Technology in Modern Workspaces, International Journal of Multidisciplinary in Humanities and Social Sciences, 1(1), 1-11.
[22] Dr. S. Maha, and T. Jayakumar (2016) “Intersectionality: An Overview,” Shanlax International Journal of English, 4 (3) 18-22. https://www.shanlaxjournals.in/journals/index.php/english/article/view/3052
[23] Dr. S. Maha, (2023) “Dismantling Assumptions: Uncovering Racial and Sexual Stereotypes in Gayl Jones’s Corregidora,” AMNAYIKI, 24 (2) 244-248.