Divide and Contrast: A Text-Based Method for Firm Market Risk Prediction
Published Online:25 Apr 2025https://doi.org/10.1287/ijoc.2023.0195
References
- (2020) Longformer: The long-document transformer. Preprint, submitted April 10, https://arxiv.org/abs/2004.05150.Google Scholar
- (1989) Post-earnings-announcement drift: Delayed price response or risk premium? J. Accounting Res. 27:1–36.Crossref, Google Scholar
- (2019) The pricing of options and corporate liabilities. World Scientific Reference on Contingent Claims Analysis in Corporate Finance: Volume 1: Foundations of CCA and Equity Valuation (World Scientific, Singapore), 3–21.Crossref, Google Scholar
- (2003) Latent dirichlet allocation. J. Machine Learn. Res. 3:993–1022.Google Scholar
- (2003) Volatility forecasting for risk management. J. Forecasting 22(1):1–22.Crossref, Google Scholar
- (2020) Language models are few-shot learners. Adv. Neural Inform. Processing Systems, vol. 33 (MIT Press, Cambridge, MA), 1877–1901.Google Scholar
- (2010) Comparison of volatility measures: A risk management perspective. J. Financial Econometrics 8(1):29–56.Crossref, Google Scholar
- (2003) Open versus closed conference calls: The determinants and effects of broadening access to disclosure. J. Accounting Econom. 34(1–3):149–180.Crossref, Google Scholar
- (2016) Why are 10-k filings so long? Accounting Horizons 30(1):1–21.Crossref, Google Scholar
- (2020) A simple framework for contrastive learning of visual representations. Internat. Conf. Machine Learn. (PMLR, New York), 1597–1607.Google Scholar
- (2020) Knowledge graph-based event embedding framework for financial quantitative investments. Special Interest Group Inform. Retrieval (ACM, New York), 2221–2230.Crossref, Google Scholar
- (2024) Does alternative data improve financial forecasting? The horizon effect. J. Finance 79(3):2237–2287.Crossref, Google Scholar
- (2019) Bert: Pre-training of deep bidirectional transformers for language understanding. NAACL-HLT (Association for Computational Linguistics, Pennsylvania), 4171–4186.Google Scholar
- (2012) Earnings conference call content and stock price: The case of REITs. J. Real Estate Finance Econom. 45(2):402–434.Crossref, Google Scholar
- (2009) Equilibrium portfolio strategies in the presence of sentiment risk and excess volatility. J. Finance 64(2):579–629.Crossref, Google Scholar
- (2017) Using natural language processing to assess text usefulness to readers: The case of conference calls and earnings prediction. Preprint, submitted February 14, https://dx.doi.org/10.2139/ssrn.3095754.Google Scholar
- (1999) An empirical examination of conference calls as a voluntary disclosure medium. J. Accounting Res. 37(1):133–150.Crossref, Google Scholar
- (2022) Bag-of-words vs. graph vs. sequence in text classification: Questioning the necessity of text-graphs and the surprising strength of a wide MLP. 60th Annual Meeting Assoc. Comput. Linguistics (Association for Computational Linguistics, Pennsylvania), 4038–4051.Google Scholar
- (2021) DeCLUTR: Deep contrastive learning for unsupervised textual representations. ACL/IJCNLP (Association for Computational Linguistics, Pennsylvania), 879–895.Google Scholar
- (2020) Momentum contrast for unsupervised visual representation learning. Confe. Comput. Vision Pattern Recognition (Computer Vision Foundation/IEEE, Piscataway, NJ), 9729–9738.Google Scholar
- (2025) Divide-and-contrast: A machine learning method for text-based risk prediction using earnings conference call transcripts. http://dx.doi.org/10.1287/ijoc.2023.0195.cd, https://github.com/INFORMSJoC/2023.0195.Google Scholar
- (2016) Text-based network industries and endogenous product differentiation. J. Political Econom. 124(5):1423–1465.Crossref, Google Scholar
- (2014) Estimating sensitivities of portfolio credit risk using Monte Carlo. INFORMS J. Comput. 26(4):848–865.Link, Google Scholar
- (2021) Gilbert: Generative vision-language pre-training for image-text retrieval. Special Interest Group Inform. Retrieval (ACM, New York), 1379–1388.Google Scholar
- (2018) Analyst information discovery and interpretation roles: A topic modeling approach. Management Sci. 64(6):2833–2855.Link, Google Scholar
- (2020) Online risk monitoring using offline simulation. INFORMS J. Comput. 32(2):356–375.Abstract, Google Scholar
- (2014) Adam: A method for stochastic optimization. Preprint, submitted December 22, https://arxiv.org/abs/1412.6980.Google Scholar
- (2013) Auto-encoding variational bayes. Preprint, submitted December 20, https://arxiv.org/abs/1312.6114.Google Scholar
- (2020) Reformer: The efficient transformer. Preprint, submitted January 13, https://arxiv.org/abs/2001.04451.Google Scholar
- (2009) Predicting risk from financial reports with regression. HLT-NAACL (The Association for Computational Linguistics, Pennsylvania), 272–280.Google Scholar
- (2021) Learning discriminative neural representations for event detection. Special Interest Group Inform. Retrieval (ACM, New York), 644–653.Google Scholar
- (2021) Self-supervised learning: Generative or contrastive. IEEE Trans. Knowledge Data Engrg. 35(1):857–876.Google Scholar
- (2021) SimCLS: A simple framework for contrastive learning of abstractive summarization. ACL/IJCNLP (Association for Computational Linguistics, Pennsylvania), 1065–1072.Google Scholar
- (2018) An efficient framework for learning sentence representations. ICLR (OpenReview.net).Google Scholar
- (2016) Textual analysis in accounting and finance: A survey. J. Accounting Res. 54(4):1187–1230.Crossref, Google Scholar
- (2005) Practical issues in forecasting volatility. Financial Anal. J. 61(1):45–56.Crossref, Google Scholar
- (2012) Earnings conference calls and stock returns: The incremental informativeness of textual tone. J. Banking Finance 36(4):992–1011.Crossref, Google Scholar
- (2019) What you say and how you say it matters: Predicting stock volatility using verbal and vocal cues. ACL (Association for Computational Linguistics, Pennsylvania), 390–401.Google Scholar
- (2019) Sentence-BERT: Sentence embeddings using Siamese BERT-networks. Proc. 2019 Conf. Empirical Methods Natl. Language Processing (Association for Computational Linguistics, Pennsylvania).Google Scholar
- (2021) Hyperbolic online time stream modeling. Special Interest Group Inform. Retrieval (ACM, New York), 1682–1686.Google Scholar
- (2020a) Synthesizer: Rethinking self-attention in transformer models. Preprint, submitted May 2, https://arxiv.org/abs/2005.00743.Google Scholar
- (2020b) Long range arena: A benchmark for efficient transformers. Preprint, submitted November 8, https://arxiv.org/abs/2011.04006.Google Scholar
- (2019) Profet: Predicting the risk of firms from event transcripts. Internat. Joint Conf. Artificial Intelligence, 5211–5217.Google Scholar
- (2021) Learning discriminative text representation for streaming social event detection. IEEE Trans. Knowledge Data Engrg. 35(12):12295–12309.Crossref, Google Scholar
- (2023) LLaMA: Open and efficient foundation language models. Preprint, submitted February 27, https://arxiv.org/abs/2302.13971.Google Scholar
- (2018) Representation learning with contrastive predictive coding. Preprint, submitted July 10, https://arxiv.org/abs/1807.03748.Google Scholar
- (2017) Attention is all you need. Adv. Neural Inform. Processing Systems, vol. 30 (MIT Press, Cambridge, MA).Google Scholar
- (2014) A semiparametric Gaussian copula regression model for predicting financial risks from earnings calls. ACL (The Association for Computer Linguistics, Pennsylvania), 1155–1165.Google Scholar
- (2021) ConSERT: A contrastive framework for self-supervised sentence representation transfer. ACL/IJCNLP (Association for Computational Linguistics, Pennsylvania), 5065–5075.Google Scholar
- (2022) Analyzing firm reports for volatility prediction: A knowledge-driven text-embedding approach. INFORMS J. Comput. 34(1):522–540.Link, Google Scholar
- (2023) Unlocking the power of voice for financial risk prediction: A theory-driven deep learning design approach. MIS Quart. 47(1):63–96.Crossref, Google Scholar
- (2020) Financial risk prediction with multi-round Q&A attention network. Internat. Joint Conf. Artificial Intelligence, 4576–4582.Google Scholar
- , Ravula A, Wang Q, Yang L, (2020) Big bird: Transformers for longer sequences. Adv. in Neural Inform. Processing Systems, vol. 33 (MIT Press, Cambridge, MA), 17283–17297.Google Scholar
- (2020a) An unsupervised sentence embedding method by mutual information maximization. EMNLP (Association for Computational Linguistics, Pennsylvania), 1601–1610.Google Scholar
- (2020b) Cost-sensitive portfolio selection via deep reinforcement learning. IEEE Trans. Knowledge Data Engrg. 34(1):236–248.Google Scholar
- (2022) Exploiting expert knowledge for assigning firms to industries: A novel deep learning method. Preprint, submitted September 11, https://arxiv.org/abs/2209.05943.Google Scholar
- (2020) S3-rec: Self-supervised learning for sequential recommendation with mutual information maximization. Proc. 29th ACM Internat. Conf. Inform. Knowledge Management (ACM, New York), 1893–1902.Google Scholar

