Adomavicius G, Tuzhilin A (2005) Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Trans. Knowledge Data Engrg. 17(6):734–749.Crossref, Google Scholar
Andonie R (2019) Hyperparameter optimization in learning systems. J. Membrane Comput. 1(4):279–291.Crossref, Google Scholar
Antoniak CE (1974) Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems. Ann. Statist. 2(6):1152–1174.Crossref, Google Scholar
Armstrong M (2006) A Handbook of Human Resource Management Practice (Kogan Page Publishers, London).Google Scholar
Barkema HG, Gomez-Mejia LR (1998) Managerial compensation and firm performance: A general research framework. Acad. Management J. 41(2):135–145.Crossref, Google Scholar
Bergmann T, Scarpello V (2002) Compensation Decision Making (South-Western Co., Mason, OH).Google Scholar
Berkowitz L, Fraser C, Treasure FP, Cochran S (1987) Pay, equity, job gratifications, and comparisons in pay satisfaction. J. Appl. Psych. 72(4):544–551.Crossref, Google Scholar
Blankmeyer E, LeSage JP, Stutzman J, Knox KJ, Pace RK (2011) Peer-group dependence in salary benchmarking: A statistical model. Managerial Decision Econom. 32(2):91–104.Google Scholar
Blei DM, Jordan MI (2006) Variational inference for Dirichlet process mixtures. Bayesian Anal. 1(1):121–143.Crossref, Google Scholar
Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J. Machine Learn. Res. 3(Jan):993–1022.Google Scholar
Brick IE, Palmon O, Wald JK (2006) CEO compensation, director compensation, and firm performance: Evidence of cronyism? J. Corporate Finance 12(3):403–423.Crossref, Google Scholar
Bühlmann P, Van De Geer S (2011) Statistics for High-Dimensional Data: Methods, Theory and Applications (Springer Science & Business Media, Berlin).Crossref, Google Scholar
Chang E, Hahn J (2006) Does pay-for-performance enhance perceived distributive justice for collectivistic employees? Personnel Rev. 35(4):397–412.Crossref, Google Scholar
Chaturvedi A, Green PE, Caroll JD (2001) K-modes clustering. J. Classification 18(1):35–55.Crossref, Google Scholar
Chen X, Xia Y, Jin P, Carroll J (2015) Dataless text classification with descriptive LDA. Bonet B, Koenig S, eds. Proc. 29th AAAI Conf. Artificial Intelligence, (AAAI Press, Austin, TX), 2224–2231.Google Scholar
Cheng S, Dai R, Xu W, Shi Y (2006) Research on data mining and knowledge management and its applications in China’s economic development: Significance and trend. Internat. J. Inform. Tech. Decision Making 5(4):585–596.Crossref, Google Scholar
Correll SJ, Benard S, Paik I (2007) Getting a job: Is there a motherhood penalty? Amer. J. Sociol. 112(5):1297–1338.Crossref, Google Scholar
Dahl DB (2006) Model-based clustering for expression data via a Dirichlet process mixture model. Do K, Müller P, Vannucci M, eds. Bayesian Inference for Gene Expression and Proteomics (Cambridge University Press, Cambridge, UK), 201–218.Crossref, Google Scholar
Devlin J, Chang M-W, Lee K, Toutanova K (2018) Bert: Pre-training of deep bidirectional transformers for language understanding. Preprint, submitted October 11, https://arxiv.org/abs/1810.04805.Google Scholar
Duchi J, Shalev-Shwartz S, Singer Y, Chandra T (2008) Efficient projections onto the l 1-ball for learning in high dimensions. Zhang Z, ed. Proc. 25th Internat. Conf. Machine Learn. (Association for Computing Machinery, New York, NY), 272–279.Google Scholar
Edwards JE, Scott JC, Raju NS (2003) The Human Resources Program-Evaluation Handbook (SAGE Publications, Inc., Thousand Oaks, CA).Crossref, Google Scholar
Escobar MD, West M (1995) Bayesian density estimation and inference using mixtures. J. Amer. Statist. Assoc. 90(430):577–588.Crossref, Google Scholar
Faulkender M, Yang J (2010) Inside the black box: The role and composition of compensation peer groups. J. Financial Econom. 96(2):257–270.Crossref, Google Scholar
Ferguson TS (1973) A Bayesian analysis of some nonparametric problems. Ann. Statist. 1(2):209–230.Crossref, Google Scholar
Ferris GR, Witt LA, Hochwarter WA (2001) Interaction of social skill and general mental ability on job performance and salary. J. Appl. Psych. 86(6):1075–1082.Crossref, Google Scholar
Frydman C, Jenter D (2010) CEO compensation. Annual Rev. Financial Econom. 2(1):75–102.Crossref, Google Scholar
Ghahramani Z (2015) Probabilistic machine learning and artificial intelligence. Nature 521(7553):452–459.Crossref, Google Scholar
Gomez L, Patel Y, Rusiñol M, Karatzas D, Jawahar C (2017) Self-supervised learning of visual features through embedding images into text topic spaces. Bertino E, Chen Y, eds. Proc. IEEE Conf. Comput. Vision Pattern Recognition (IEEE Computer Society, Los Alamitos, CA), 4230–4239.Google Scholar
Gong JJ, Li S (2013) CEO incentives and earnings prediction. Rev. Quant. Finance Accounting 40(4):647–674.Crossref, Google Scholar
Guo Y, Barnes SJ, Jia Q (2017) Mining meaning from online ratings and reviews: Tourist satisfaction analysis using latent Dirichlet allocation. Tourism Management 59:467–483.Crossref, Google Scholar
Hamlen KR, Hamlen WA (2016) Faculty salary as a predictor of student outgoing salaries from MBA programs. J. Ed. Bus. 91(1):38–44.Crossref, Google Scholar
Hofmann T (1999) Probabilistic latent semantic indexing. Lang J, ed. Sigir ’99: Proc. 22nd Annual Internat. ACM SIGIR Conf. Res. Development in Informat. Retrieval (Association for Computing Machinery, New York, NY), 50–57.Google Scholar
Ishwaran H, James LF (2001) Gibbs sampling methods for stick-breaking priors. J. Amer. Statist. Assoc. 96(453):161–173.Crossref, Google Scholar
Jerrim J (2015) Do college students make better predictions of their future income than young adults in the labor force? Ed. Econom. 23(2):162–179.Google Scholar
Kenthapadi K, Chudhary A, Ambler S (2017a) LinkedIn salary: A system for secure collection and presentation of structured compensation insights to job seekers. Bertino E, Chen Y, eds. 2017 IEEE Sympos. Privacy-Aware Comput. (IEEE, Computer Society, Los Alamitos, CA), 13–24.Google Scholar
Kenthapadi K, Ambler S, Zhang L, Agarwal D (2017b) Bringing salary transparency to the world: Computing robust compensation insights via LinkedIn salary. Lim E-P, Winslett M, eds. Proc. 2017 ACM Conf. Inform. Knowledge Management (ACM, New York, NY), 447–455.Google Scholar
Khongchai P, Songmuang P (2016a) Implement of salary prediction system to improve student motivation using data mining technique. Gaol FL, Theeramunkong T, Papadopo GA, eds. 2016 11th Internat. Conf. Knowledge Inform. Creativity Support Systems (IEEE, Yogyakarta, Indonesia), 1–6.Google Scholar
Khongchai P, Songmuang P (2016b) Random forest for salary prediction system to improve students’ motivation. Manfredi G, Pietro G, eds. 2016 12th Internat. Conf. Signal-Image Tech. Internet-Based Systems (IEEE, Los Alamitos, CA), 637–642.Google Scholar
Kim D, Yum B-J (2005) Collaborative filtering based on iterative principal component analysis. Expert Systems Appl. 28(4):823–830.Crossref, Google Scholar
Koller D, Friedman N (2009) Probabilistic Graphical Models: Principles and Techniques (MIT Press, Cambridge, MA).Google Scholar
Koren Y (2008) Factorization meets the neighborhood: A multifaceted collaborative filtering model. Li Y, Liu B, eds. Proc. 14th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining, (ACM, New York, NY), 426–434.Google Scholar
Koren Y, Bell R, Volinsky C (2009) Matrix factorization techniques for recommender systems. Comput. 42(8):30–37.Crossref, Google Scholar
Lazar A (2004) Income prediction via support vector machine. Tourassi GD, ed. Internat. Conf. Machine Learning and Applications, (ICMLA), 143–149.Crossref, Google Scholar
Le Q, Mikolov T (2014) Distributed representations of sentences and documents. Xing EP, Jebara T, eds. Internat. Conf. Machine Learn. (PMLR), 1188–1196.Google Scholar
Lee DD, Seung HS (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401(6755):788–791.Crossref, Google Scholar
Lee DD, Seung HS (2001) Algorithms for non-negative matrix factorization. Adv. Neural Inform. Processing Systems 13:556–562.Google Scholar
L’heureux A, Grolinger K, Elyamany HF, Capretz MA (2017) Machine learning with big data: Challenges and approaches. IEEE Access 5:7776–7797.Crossref, Google Scholar
Lin H, Zhu H, Zuo Y, Zhu C, Wu J, Xiong H (2017) Collaborative company profiling: Insights from an employee’s perspective. Singh S, Markovitch S, eds. 31st AAAI Conf. Artificial Intelligence (AAAI Press, San Francisco, CA), 1417–1423.Google Scholar
Luo X, Zhou M, Xia Y, Zhu Q (2014) An efficient non-negative matrix-factorization-based approach to collaborative filtering for recommender systems. IEEE Trans. Indust. Informatics 10(2):1273–1284.Crossref, Google Scholar
Meng Q, Zhu H, Xiao K, Xiong H (2018) Intelligent salary benchmarking for talent recruitment: A holistic matrix factorization approach. Zhu F, Yu J, eds. 2018 IEEE Internat. Conf. Data Mining (IEEE Computer Society, Los Alamitos, CA), 337–346.Google Scholar
Mnih A, Salakhutdinov RR (2008) Probabilistic matrix factorization. Platt JC, Koller D, Singer Y, Rowies ST, eds. Proc. 20th Internat. Conf. Adv. Neural Inform. Processing Systems NIPS’07 (Curran Associates Inc., Red Hook, NY), 20:1257–1264.Google Scholar
Neal RM (2000) Markov chain sampling methods for Dirichlet process mixture models. J. Comput. Graphical Statist. 9(2):249–265.Google Scholar
Nguyen V, Gupta S, Rana S, Li C, Venkatesh S (2016) A Bayesian nonparametric approach for multi-label classification. Durrant RJ, Kim K, eds. Asian Conf. Machine Learn., vol. 63 (University of Waikato, Hamilton, New Zealand), 254–269.Google Scholar
Ocepek U, Rugelj J, Bosnić Z (2015) Improving matrix factorization recommendations for examples in cold start. Expert Systems Appl. 42(19):6784–6794.Crossref, Google Scholar
Paterek A (2007) Improving regularized singular value decomposition for collaborative filtering. Liu B, ed. Proc. KDD Cup and Workshop (New York, NY), 5–8.Google Scholar
Pavlinek M, Podgorelec V (2017) Text classification method based on self-training and LDA topic models. Expert Systems Appl. 80:83–93.Crossref, Google Scholar
Peng L, Röell A (2008) Manipulation and equity-based compensation. Amer. Econom. Rev. 98(2):285–290.Crossref, Google Scholar
Peng L, Röell A (2014) Managerial incentives and stock price manipulation. J. Finance 69(2):487–526.Crossref, Google Scholar
Ramasamy R (2015) The production of salary profiles of ICT professionals: Moving from structured database to big data analytics. Statist. J. IAOS 31(2):177–191.Crossref, Google Scholar
Ranjbar M, Moradi P, Azami M, Jalili M (2015) An imputation-based matrix factorization method for improving accuracy of collaborative filtering systems. Engrg. Appl. Artificial Intelligence 46:58–66.Crossref, Google Scholar
Rasiwasia N, Vasconcelos N (2013) Latent Dirichlet allocation models for image classification. IEEE Trans. Pattern Anal. Machine Intelligence 35(11):2665–2679.Crossref, Google Scholar
Sarwar B, Karypis G, Konstan J, Riedl J (2001) Item-based collaborative filtering recommendation algorithms. Shen VY, Saito N, eds. Proc. 10th Internat. Conf. World Wide Web (Association for Computing Machinery, New York, NY), 285–295.Google Scholar
Scarpello V, Jones FF (1996) Why justice matters in compensation decision making. J. Organ. Behav. 17(3):285–299.Crossref, Google Scholar
Shollo A, Kautz K (2010) Toward an understanding of business intelligence. Green P, Rosemann M, eds. Proc. 21st Australasian Conf. Inform. Systems. (ACIS 2010), Brisbane, Australia.Google Scholar
Steiger DM (2010) Decision support as knowledge creation: A business intelligence design theory. Internat. J. Bus. Intelligence Res. 1(1):29–47.Crossref, Google Scholar
Teh YW, Jordan MI, Beal MJ, Blei DM (2005) Sharing clusters among related groups: Hierarchical Dirichlet processes. Saul L, Weiss Y, Bottou L, eds. Adv. Neural Inform. Processing Systems (MIT Press, Cambridge, MA), 17:1385–1392.Google Scholar
Terpstra DE, Honoree AL (2003) The relative importance of external, internal, individual and procedural equity to pay satisfaction: Procedural equity may be more important to employees than organizations believe. Compensation Benefits Rev. 35(6):67–74.Crossref, Google Scholar
Tirunillai S, Tellis GJ (2014) Mining marketing meaning from online chatter: Strategic brand analysis of big data using latent Dirichlet allocation. J. Marketing Res. 51(4):463–479.Crossref, Google Scholar
Van der Wal Z, Oosterbaan A (2013) Government or business? Identifying determinants of MPA and MBA students’ career preferences. Public Personnel Management 42(2):239–258.Crossref, Google Scholar
Wainwright MJ, Jordan MI (2008) Graphical Models, Exponential Families, and Variational Inference (Now Publishers, Inc., Delft, Netherlands).Google Scholar
Wan C, Peng Y, Xiao K, Liu X, Jiang T, Liu D (2020) An association-constrained LDA model for joint extraction of product aspects and opinions. Inform. Sci. 519:243–259.Crossref, Google Scholar
Wang C, Blei DM (2011) Collaborative topic modeling for recommending scientific articles. Apte C, Ghosh J, eds. Proc. 17th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (ACM, New York, NY), 448–456.Google Scholar
Xue Y, Liao X, Carin L, Krishnapuram B (2007) Multi-task learning for classification with Dirichlet process priors. J. Machine Learn. Res. 8:35–63.Google Scholar
Yang C, Liu Z, Zhao D, Sun M, Chang E (2015) Network representation learning with rich text information. Yang Q, Wooldridge M, eds. 24th Internat. Joint Conf. Artificial Intelligence (AAAI Press, San Francisco, CA), 2111–2117.Google Scholar
Yao S, Yu D, Xiao K (2019) Enhancing domain word embedding via latent semantic imputation. Proc. 25th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (Association for Computing Machinery, New York, NY) 557–565.Google Scholar
Zhang J, Ghahramani Z, Yang Y (2005) A probabilistic model for online document clustering with application to novelty detection. Saul L, Weiss Y, Bottou L, eds. Adv. Neural Inform. Processing Systems (MIT Press, Cambridge, MA), 17:1617–1624.Google Scholar
Zhang X, Li W, Nguyen V, Zhuang F, Xiong H, Lu S (2018b) Label-sensitive task grouping by Bayesian nonparametric approach for multi-task multi-label learning. Lang J, ed. Proc. 27th Internat. Joint Conf. Artificial Intelligence IJCAI (AAAI Press, San Francisco, CA), 3125–3131.Crossref, Google Scholar
Zhang L, Xiao K, Zhu H, Liu C, Yang J, Jin B (2018a) Caden: A context-aware deep embedding network for financial opinions mining. Zhu F, Yu J, eds. 2018 IEEE Internat. Conf. Data Mining (IEEE Computer Society, Los Alamitos, CA), 757–766.Google Scholar

cover image INFORMS Journal on Computing

Volume 34, Issue 5

September-October 2022

Pages 2383-2865, C2

Article Information

Supplemental Material

Metrics

Information

Received:May 03, 2020
Accepted:February 12, 2022
Published Online:April 18, 2022

Cite as

Qingxin Meng, Keli Xiao, Dazhong Shen, Hengshu Zhu Hui Xiong (2022) Fine-Grained Job Salary Benchmarking with a Nonparametric Dirichlet Process–Based Latent Factor Model. INFORMS Journal on Computing 34(5):2443-2463.

https://doi.org/10.1287/ijoc.2022.1182

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Fine-Grained Job Salary Benchmarking with a Nonparametric Dirichlet Process–Based Latent Factor Model

References

Volume 34, Issue 5

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News