Fine-Grained Job Salary Benchmarking with a Nonparametric Dirichlet Process–Based Latent Factor Model

Published Online:https://doi.org/10.1287/ijoc.2022.1182

References

  • Adomavicius G, Tuzhilin A (2005) Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Trans. Knowledge Data Engrg. 17(6):734–749.CrossrefGoogle Scholar
  • Andonie R (2019) Hyperparameter optimization in learning systems. J. Membrane Comput. 1(4):279–291.CrossrefGoogle Scholar
  • Antoniak CE (1974) Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems. Ann. Statist. 2(6):1152–1174.CrossrefGoogle Scholar
  • Armstrong M (2006) A Handbook of Human Resource Management Practice (Kogan Page Publishers, London).Google Scholar
  • Barkema HG, Gomez-Mejia LR (1998) Managerial compensation and firm performance: A general research framework. Acad. Management J. 41(2):135–145.CrossrefGoogle Scholar
  • Bergmann T, Scarpello V (2002) Compensation Decision Making (South-Western Co., Mason, OH).Google Scholar
  • Berkowitz L, Fraser C, Treasure FP, Cochran S (1987) Pay, equity, job gratifications, and comparisons in pay satisfaction. J. Appl. Psych. 72(4):544–551.CrossrefGoogle Scholar
  • Blankmeyer E, LeSage JP, Stutzman J, Knox KJ, Pace RK (2011) Peer-group dependence in salary benchmarking: A statistical model. Managerial Decision Econom. 32(2):91–104.Google Scholar
  • Blei DM, Jordan MI (2006) Variational inference for Dirichlet process mixtures. Bayesian Anal. 1(1):121–143.CrossrefGoogle Scholar
  • Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J. Machine Learn. Res. 3(Jan):993–1022.Google Scholar
  • Brick IE, Palmon O, Wald JK (2006) CEO compensation, director compensation, and firm performance: Evidence of cronyism? J. Corporate Finance 12(3):403–423.CrossrefGoogle Scholar
  • Bühlmann P, Van De Geer S (2011) Statistics for High-Dimensional Data: Methods, Theory and Applications (Springer Science & Business Media, Berlin).CrossrefGoogle Scholar
  • Chang E, Hahn J (2006) Does pay-for-performance enhance perceived distributive justice for collectivistic employees? Personnel Rev. 35(4):397–412.CrossrefGoogle Scholar
  • Chaturvedi A, Green PE, Caroll JD (2001) K-modes clustering. J. Classification 18(1):35–55.CrossrefGoogle Scholar
  • Chen X, Xia Y, Jin P, Carroll J (2015) Dataless text classification with descriptive LDA. Bonet B, Koenig S, eds. Proc. 29th AAAI Conf. Artificial Intelligence, (AAAI Press, Austin, TX), 2224–2231.Google Scholar
  • Cheng S, Dai R, Xu W, Shi Y (2006) Research on data mining and knowledge management and its applications in China’s economic development: Significance and trend. Internat. J. Inform. Tech. Decision Making 5(4):585–596.CrossrefGoogle Scholar
  • Correll SJ, Benard S, Paik I (2007) Getting a job: Is there a motherhood penalty? Amer. J. Sociol. 112(5):1297–1338.CrossrefGoogle Scholar
  • Dahl DB (2006) Model-based clustering for expression data via a Dirichlet process mixture model. Do K, Müller P, Vannucci M, eds. Bayesian Inference for Gene Expression and Proteomics (Cambridge University Press, Cambridge, UK), 201–218.CrossrefGoogle Scholar
  • Devlin J, Chang M-W, Lee K, Toutanova K (2018) Bert: Pre-training of deep bidirectional transformers for language understanding. Preprint, submitted October 11, https://arxiv.org/abs/1810.04805.Google Scholar
  • Duchi J, Shalev-Shwartz S, Singer Y, Chandra T (2008) Efficient projections onto the l 1-ball for learning in high dimensions. Zhang Z, ed. Proc. 25th Internat. Conf. Machine Learn. (Association for Computing Machinery, New York, NY), 272–279.Google Scholar
  • Edwards JE, Scott JC, Raju NS (2003) The Human Resources Program-Evaluation Handbook (SAGE Publications, Inc., Thousand Oaks, CA).CrossrefGoogle Scholar
  • Escobar MD, West M (1995) Bayesian density estimation and inference using mixtures. J. Amer. Statist. Assoc. 90(430):577–588.CrossrefGoogle Scholar
  • Faulkender M, Yang J (2010) Inside the black box: The role and composition of compensation peer groups. J. Financial Econom. 96(2):257–270.CrossrefGoogle Scholar
  • Ferguson TS (1973) A Bayesian analysis of some nonparametric problems. Ann. Statist. 1(2):209–230.CrossrefGoogle Scholar
  • Ferris GR, Witt LA, Hochwarter WA (2001) Interaction of social skill and general mental ability on job performance and salary. J. Appl. Psych. 86(6):1075–1082.CrossrefGoogle Scholar
  • Frydman C, Jenter D (2010) CEO compensation. Annual Rev. Financial Econom. 2(1):75–102.CrossrefGoogle Scholar
  • Ghahramani Z (2015) Probabilistic machine learning and artificial intelligence. Nature 521(7553):452–459.CrossrefGoogle Scholar
  • Gomez L, Patel Y, Rusiñol M, Karatzas D, Jawahar C (2017) Self-supervised learning of visual features through embedding images into text topic spaces. Bertino E, Chen Y, eds. Proc. IEEE Conf. Comput. Vision Pattern Recognition (IEEE Computer Society, Los Alamitos, CA), 4230–4239.Google Scholar
  • Gong JJ, Li S (2013) CEO incentives and earnings prediction. Rev. Quant. Finance Accounting 40(4):647–674.CrossrefGoogle Scholar
  • Guo Y, Barnes SJ, Jia Q (2017) Mining meaning from online ratings and reviews: Tourist satisfaction analysis using latent Dirichlet allocation. Tourism Management 59:467–483.CrossrefGoogle Scholar
  • Hamlen KR, Hamlen WA (2016) Faculty salary as a predictor of student outgoing salaries from MBA programs. J. Ed. Bus. 91(1):38–44.CrossrefGoogle Scholar
  • Hofmann T (1999) Probabilistic latent semantic indexing. Lang J, ed. Sigir ’99: Proc. 22nd Annual Internat. ACM SIGIR Conf. Res. Development in Informat. Retrieval (Association for Computing Machinery, New York, NY), 50–57.Google Scholar
  • Ishwaran H, James LF (2001) Gibbs sampling methods for stick-breaking priors. J. Amer. Statist. Assoc. 96(453):161–173.CrossrefGoogle Scholar
  • Jerrim J (2015) Do college students make better predictions of their future income than young adults in the labor force? Ed. Econom. 23(2):162–179.Google Scholar
  • Kenthapadi K, Chudhary A, Ambler S (2017a) LinkedIn salary: A system for secure collection and presentation of structured compensation insights to job seekers. Bertino E, Chen Y, eds. 2017 IEEE Sympos. Privacy-Aware Comput. (IEEE, Computer Society, Los Alamitos, CA), 13–24.Google Scholar
  • Kenthapadi K, Ambler S, Zhang L, Agarwal D (2017b) Bringing salary transparency to the world: Computing robust compensation insights via LinkedIn salary. Lim E-P, Winslett M, eds. Proc. 2017 ACM Conf. Inform. Knowledge Management (ACM, New York, NY), 447–455.Google Scholar
  • Khongchai P, Songmuang P (2016a) Implement of salary prediction system to improve student motivation using data mining technique. Gaol FL, Theeramunkong T, Papadopo GA, eds. 2016 11th Internat. Conf. Knowledge Inform. Creativity Support Systems (IEEE, Yogyakarta, Indonesia), 1–6.Google Scholar
  • Khongchai P, Songmuang P (2016b) Random forest for salary prediction system to improve students’ motivation. Manfredi G, Pietro G, eds. 2016 12th Internat. Conf. Signal-Image Tech. Internet-Based Systems (IEEE, Los Alamitos, CA), 637–642.Google Scholar
  • Kim D, Yum B-J (2005) Collaborative filtering based on iterative principal component analysis. Expert Systems Appl. 28(4):823–830.CrossrefGoogle Scholar
  • Koller D, Friedman N (2009) Probabilistic Graphical Models: Principles and Techniques (MIT Press, Cambridge, MA).Google Scholar
  • Koren Y (2008) Factorization meets the neighborhood: A multifaceted collaborative filtering model. Li Y, Liu B, eds. Proc. 14th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining, (ACM, New York, NY), 426–434.Google Scholar
  • Koren Y, Bell R, Volinsky C (2009) Matrix factorization techniques for recommender systems. Comput. 42(8):30–37.CrossrefGoogle Scholar
  • Lazar A (2004) Income prediction via support vector machine. Tourassi GD, ed. Internat. Conf. Machine Learning and Applications, (ICMLA), 143–149.CrossrefGoogle Scholar
  • Le Q, Mikolov T (2014) Distributed representations of sentences and documents. Xing EP, Jebara T, eds. Internat. Conf. Machine Learn. (PMLR), 1188–1196.Google Scholar
  • Lee DD, Seung HS (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401(6755):788–791.CrossrefGoogle Scholar
  • Lee DD, Seung HS (2001) Algorithms for non-negative matrix factorization. Adv. Neural Inform. Processing Systems 13:556–562.Google Scholar
  • L’heureux A, Grolinger K, Elyamany HF, Capretz MA (2017) Machine learning with big data: Challenges and approaches. IEEE Access 5:7776–7797.CrossrefGoogle Scholar
  • Lin H, Zhu H, Zuo Y, Zhu C, Wu J, Xiong H (2017) Collaborative company profiling: Insights from an employee’s perspective. Singh S, Markovitch S, eds. 31st AAAI Conf. Artificial Intelligence (AAAI Press, San Francisco, CA), 1417–1423.Google Scholar
  • Luo X, Zhou M, Xia Y, Zhu Q (2014) An efficient non-negative matrix-factorization-based approach to collaborative filtering for recommender systems. IEEE Trans. Indust. Informatics 10(2):1273–1284.CrossrefGoogle Scholar
  • Meng Q, Zhu H, Xiao K, Xiong H (2018) Intelligent salary benchmarking for talent recruitment: A holistic matrix factorization approach. Zhu F, Yu J, eds. 2018 IEEE Internat. Conf. Data Mining (IEEE Computer Society, Los Alamitos, CA), 337–346.Google Scholar
  • Mnih A, Salakhutdinov RR (2008) Probabilistic matrix factorization. Platt JC, Koller D, Singer Y, Rowies ST, eds. Proc. 20th Internat. Conf. Adv. Neural Inform. Processing Systems NIPS’07 (Curran Associates Inc., Red Hook, NY), 20:1257–1264.Google Scholar
  • Neal RM (2000) Markov chain sampling methods for Dirichlet process mixture models. J. Comput. Graphical Statist. 9(2):249–265.Google Scholar
  • Nguyen V, Gupta S, Rana S, Li C, Venkatesh S (2016) A Bayesian nonparametric approach for multi-label classification. Durrant RJ, Kim K, eds. Asian Conf. Machine Learn., vol. 63 (University of Waikato, Hamilton, New Zealand), 254–269.Google Scholar
  • Ocepek U, Rugelj J, Bosnić Z (2015) Improving matrix factorization recommendations for examples in cold start. Expert Systems Appl. 42(19):6784–6794.CrossrefGoogle Scholar
  • Paterek A (2007) Improving regularized singular value decomposition for collaborative filtering. Liu B, ed. Proc. KDD Cup and Workshop (New York, NY), 5–8.Google Scholar
  • Pavlinek M, Podgorelec V (2017) Text classification method based on self-training and LDA topic models. Expert Systems Appl. 80:83–93.CrossrefGoogle Scholar
  • Peng L, Röell A (2008) Manipulation and equity-based compensation. Amer. Econom. Rev. 98(2):285–290.CrossrefGoogle Scholar
  • Peng L, Röell A (2014) Managerial incentives and stock price manipulation. J. Finance 69(2):487–526.CrossrefGoogle Scholar
  • Ramasamy R (2015) The production of salary profiles of ICT professionals: Moving from structured database to big data analytics. Statist. J. IAOS 31(2):177–191.CrossrefGoogle Scholar
  • Ranjbar M, Moradi P, Azami M, Jalili M (2015) An imputation-based matrix factorization method for improving accuracy of collaborative filtering systems. Engrg. Appl. Artificial Intelligence 46:58–66.CrossrefGoogle Scholar
  • Rasiwasia N, Vasconcelos N (2013) Latent Dirichlet allocation models for image classification. IEEE Trans. Pattern Anal. Machine Intelligence 35(11):2665–2679.CrossrefGoogle Scholar
  • Sarwar B, Karypis G, Konstan J, Riedl J (2001) Item-based collaborative filtering recommendation algorithms. Shen VY, Saito N, eds. Proc. 10th Internat. Conf. World Wide Web (Association for Computing Machinery, New York, NY), 285–295.Google Scholar
  • Scarpello V, Jones FF (1996) Why justice matters in compensation decision making. J. Organ. Behav. 17(3):285–299.CrossrefGoogle Scholar
  • Shollo A, Kautz K (2010) Toward an understanding of business intelligence. Green P, Rosemann M, eds. Proc. 21st Australasian Conf. Inform. Systems. (ACIS 2010), Brisbane, Australia.Google Scholar
  • Steiger DM (2010) Decision support as knowledge creation: A business intelligence design theory. Internat. J. Bus. Intelligence Res. 1(1):29–47.CrossrefGoogle Scholar
  • Teh YW, Jordan MI, Beal MJ, Blei DM (2005) Sharing clusters among related groups: Hierarchical Dirichlet processes. Saul L, Weiss Y, Bottou L, eds. Adv. Neural Inform. Processing Systems (MIT Press, Cambridge, MA), 17:1385–1392.Google Scholar
  • Terpstra DE, Honoree AL (2003) The relative importance of external, internal, individual and procedural equity to pay satisfaction: Procedural equity may be more important to employees than organizations believe. Compensation Benefits Rev. 35(6):67–74.CrossrefGoogle Scholar
  • Tirunillai S, Tellis GJ (2014) Mining marketing meaning from online chatter: Strategic brand analysis of big data using latent Dirichlet allocation. J. Marketing Res. 51(4):463–479.CrossrefGoogle Scholar
  • Van der Wal Z, Oosterbaan A (2013) Government or business? Identifying determinants of MPA and MBA students’ career preferences. Public Personnel Management 42(2):239–258.CrossrefGoogle Scholar
  • Wainwright MJ, Jordan MI (2008) Graphical Models, Exponential Families, and Variational Inference (Now Publishers, Inc., Delft, Netherlands).Google Scholar
  • Wan C, Peng Y, Xiao K, Liu X, Jiang T, Liu D (2020) An association-constrained LDA model for joint extraction of product aspects and opinions. Inform. Sci. 519:243–259.CrossrefGoogle Scholar
  • Wang C, Blei DM (2011) Collaborative topic modeling for recommending scientific articles. Apte C, Ghosh J, eds. Proc. 17th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (ACM, New York, NY), 448–456.Google Scholar
  • Xue Y, Liao X, Carin L, Krishnapuram B (2007) Multi-task learning for classification with Dirichlet process priors. J. Machine Learn. Res. 8:35–63.Google Scholar
  • Yang C, Liu Z, Zhao D, Sun M, Chang E (2015) Network representation learning with rich text information. Yang Q, Wooldridge M, eds. 24th Internat. Joint Conf. Artificial Intelligence (AAAI Press, San Francisco, CA), 2111–2117.Google Scholar
  • Yao S, Yu D, Xiao K (2019) Enhancing domain word embedding via latent semantic imputation. Proc. 25th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (Association for Computing Machinery, New York, NY) 557–565.Google Scholar
  • Zhang J, Ghahramani Z, Yang Y (2005) A probabilistic model for online document clustering with application to novelty detection. Saul L, Weiss Y, Bottou L, eds. Adv. Neural Inform. Processing Systems (MIT Press, Cambridge, MA), 17:1617–1624.Google Scholar
  • Zhang X, Li W, Nguyen V, Zhuang F, Xiong H, Lu S (2018b) Label-sensitive task grouping by Bayesian nonparametric approach for multi-task multi-label learning. Lang J, ed. Proc. 27th Internat. Joint Conf. Artificial Intelligence IJCAI (AAAI Press, San Francisco, CA), 3125–3131.CrossrefGoogle Scholar
  • Zhang L, Xiao K, Zhu H, Liu C, Yang J, Jin B (2018a) Caden: A context-aware deep embedding network for financial opinions mining. Zhu F, Yu J, eds. 2018 IEEE Internat. Conf. Data Mining (IEEE Computer Society, Los Alamitos, CA), 757–766.Google Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.