Fine-Grained Job Salary Benchmarking with a Nonparametric Dirichlet Process–Based Latent Factor Model
Published Online:18 Apr 2022https://doi.org/10.1287/ijoc.2022.1182
References
- (2005) Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Trans. Knowledge Data Engrg. 17(6):734–749.Crossref, Google Scholar
- (2019) Hyperparameter optimization in learning systems. J. Membrane Comput. 1(4):279–291.Crossref, Google Scholar
- (1974) Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems. Ann. Statist. 2(6):1152–1174.Crossref, Google Scholar
- (2006) A Handbook of Human Resource Management Practice (Kogan Page Publishers, London).Google Scholar
- (1998) Managerial compensation and firm performance: A general research framework. Acad. Management J. 41(2):135–145.Crossref, Google Scholar
- (2002) Compensation Decision Making (South-Western Co., Mason, OH).Google Scholar
- (1987) Pay, equity, job gratifications, and comparisons in pay satisfaction. J. Appl. Psych. 72(4):544–551.Crossref, Google Scholar
- (2011) Peer-group dependence in salary benchmarking: A statistical model. Managerial Decision Econom. 32(2):91–104.Google Scholar
- (2006) Variational inference for Dirichlet process mixtures. Bayesian Anal. 1(1):121–143.Crossref, Google Scholar
- (2003) Latent Dirichlet allocation. J. Machine Learn. Res. 3(Jan):993–1022.Google Scholar
- (2006) CEO compensation, director compensation, and firm performance: Evidence of cronyism? J. Corporate Finance 12(3):403–423.Crossref, Google Scholar
- (2011) Statistics for High-Dimensional Data: Methods, Theory and Applications (Springer Science & Business Media, Berlin).Crossref, Google Scholar
- (2006) Does pay-for-performance enhance perceived distributive justice for collectivistic employees? Personnel Rev. 35(4):397–412.Crossref, Google Scholar
- (2001) K-modes clustering. J. Classification 18(1):35–55.Crossref, Google Scholar
- (2015) Dataless text classification with descriptive LDA. Bonet B, Koenig S, eds. Proc. 29th AAAI Conf. Artificial Intelligence, (AAAI Press, Austin, TX), 2224–2231.Google Scholar
- (2006) Research on data mining and knowledge management and its applications in China’s economic development: Significance and trend. Internat. J. Inform. Tech. Decision Making 5(4):585–596.Crossref, Google Scholar
- (2007) Getting a job: Is there a motherhood penalty? Amer. J. Sociol. 112(5):1297–1338.Crossref, Google Scholar
- (2006) Model-based clustering for expression data via a Dirichlet process mixture model. Do K, Müller P, Vannucci M, eds. Bayesian Inference for Gene Expression and Proteomics (Cambridge University Press, Cambridge, UK), 201–218.Crossref, Google Scholar
- (2018) Bert: Pre-training of deep bidirectional transformers for language understanding. Preprint, submitted October 11, https://arxiv.org/abs/1810.04805.Google Scholar
- (2008) Efficient projections onto the l 1-ball for learning in high dimensions. Zhang Z, ed. Proc. 25th Internat. Conf. Machine Learn. (Association for Computing Machinery, New York, NY), 272–279.Google Scholar
- (2003) The Human Resources Program-Evaluation Handbook (SAGE Publications, Inc., Thousand Oaks, CA).Crossref, Google Scholar
- (1995) Bayesian density estimation and inference using mixtures. J. Amer. Statist. Assoc. 90(430):577–588.Crossref, Google Scholar
- (2010) Inside the black box: The role and composition of compensation peer groups. J. Financial Econom. 96(2):257–270.Crossref, Google Scholar
- (1973) A Bayesian analysis of some nonparametric problems. Ann. Statist. 1(2):209–230.Crossref, Google Scholar
- (2001) Interaction of social skill and general mental ability on job performance and salary. J. Appl. Psych. 86(6):1075–1082.Crossref, Google Scholar
- (2010) CEO compensation. Annual Rev. Financial Econom. 2(1):75–102.Crossref, Google Scholar
- (2015) Probabilistic machine learning and artificial intelligence. Nature 521(7553):452–459.Crossref, Google Scholar
- (2017) Self-supervised learning of visual features through embedding images into text topic spaces. Bertino E, Chen Y, eds. Proc. IEEE Conf. Comput. Vision Pattern Recognition (IEEE Computer Society, Los Alamitos, CA), 4230–4239.Google Scholar
- (2013) CEO incentives and earnings prediction. Rev. Quant. Finance Accounting 40(4):647–674.Crossref, Google Scholar
- (2017) Mining meaning from online ratings and reviews: Tourist satisfaction analysis using latent Dirichlet allocation. Tourism Management 59:467–483.Crossref, Google Scholar
- (2016) Faculty salary as a predictor of student outgoing salaries from MBA programs. J. Ed. Bus. 91(1):38–44.Crossref, Google Scholar
- (1999) Probabilistic latent semantic indexing. Lang J, ed. Sigir ’99: Proc. 22nd Annual Internat. ACM SIGIR Conf. Res. Development in Informat. Retrieval (Association for Computing Machinery, New York, NY), 50–57.Google Scholar
- (2001) Gibbs sampling methods for stick-breaking priors. J. Amer. Statist. Assoc. 96(453):161–173.Crossref, Google Scholar
- (2015) Do college students make better predictions of their future income than young adults in the labor force? Ed. Econom. 23(2):162–179.Google Scholar
- (2017a) LinkedIn salary: A system for secure collection and presentation of structured compensation insights to job seekers. Bertino E, Chen Y, eds. 2017 IEEE Sympos. Privacy-Aware Comput. (IEEE, Computer Society, Los Alamitos, CA), 13–24.Google Scholar
- (2017b) Bringing salary transparency to the world: Computing robust compensation insights via LinkedIn salary. Lim E-P, Winslett M, eds. Proc. 2017 ACM Conf. Inform. Knowledge Management (ACM, New York, NY), 447–455.Google Scholar
- (2016a) Implement of salary prediction system to improve student motivation using data mining technique. Gaol FL, Theeramunkong T, Papadopo GA, eds. 2016 11th Internat. Conf. Knowledge Inform. Creativity Support Systems (IEEE, Yogyakarta, Indonesia), 1–6.Google Scholar
- (2016b) Random forest for salary prediction system to improve students’ motivation. Manfredi G, Pietro G, eds. 2016 12th Internat. Conf. Signal-Image Tech. Internet-Based Systems (IEEE, Los Alamitos, CA), 637–642.Google Scholar
- (2005) Collaborative filtering based on iterative principal component analysis. Expert Systems Appl. 28(4):823–830.Crossref, Google Scholar
- (2009) Probabilistic Graphical Models: Principles and Techniques (MIT Press, Cambridge, MA).Google Scholar
- (2008) Factorization meets the neighborhood: A multifaceted collaborative filtering model. Li Y, Liu B, eds. Proc. 14th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining, (ACM, New York, NY), 426–434.Google Scholar
- (2009) Matrix factorization techniques for recommender systems. Comput. 42(8):30–37.Crossref, Google Scholar
- (2004) Income prediction via support vector machine. Tourassi GD, ed. Internat. Conf. Machine Learning and Applications, (ICMLA), 143–149.Crossref, Google Scholar
- (2014) Distributed representations of sentences and documents. Xing EP, Jebara T, eds. Internat. Conf. Machine Learn. (PMLR), 1188–1196.Google Scholar
- (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401(6755):788–791.Crossref, Google Scholar
- (2001) Algorithms for non-negative matrix factorization. Adv. Neural Inform. Processing Systems 13:556–562.Google Scholar
- (2017) Machine learning with big data: Challenges and approaches. IEEE Access 5:7776–7797.Crossref, Google Scholar
- (2017) Collaborative company profiling: Insights from an employee’s perspective. Singh S, Markovitch S, eds. 31st AAAI Conf. Artificial Intelligence (AAAI Press, San Francisco, CA), 1417–1423.Google Scholar
- (2014) An efficient non-negative matrix-factorization-based approach to collaborative filtering for recommender systems. IEEE Trans. Indust. Informatics 10(2):1273–1284.Crossref, Google Scholar
- (2018) Intelligent salary benchmarking for talent recruitment: A holistic matrix factorization approach. Zhu F, Yu J, eds. 2018 IEEE Internat. Conf. Data Mining (IEEE Computer Society, Los Alamitos, CA), 337–346.Google Scholar
- (2008) Probabilistic matrix factorization. Platt JC, Koller D, Singer Y, Rowies ST, eds. Proc. 20th Internat. Conf. Adv. Neural Inform. Processing Systems NIPS’07 (Curran Associates Inc., Red Hook, NY), 20:1257–1264.Google Scholar
- (2000) Markov chain sampling methods for Dirichlet process mixture models. J. Comput. Graphical Statist. 9(2):249–265.Google Scholar
- (2016) A Bayesian nonparametric approach for multi-label classification. Durrant RJ, Kim K, eds. Asian Conf. Machine Learn., vol. 63 (University of Waikato, Hamilton, New Zealand), 254–269.Google Scholar
- (2015) Improving matrix factorization recommendations for examples in cold start. Expert Systems Appl. 42(19):6784–6794.Crossref, Google Scholar
- (2007) Improving regularized singular value decomposition for collaborative filtering. Liu B, ed. Proc. KDD Cup and Workshop (New York, NY), 5–8.Google Scholar
- (2017) Text classification method based on self-training and LDA topic models. Expert Systems Appl. 80:83–93.Crossref, Google Scholar
- (2008) Manipulation and equity-based compensation. Amer. Econom. Rev. 98(2):285–290.Crossref, Google Scholar
- (2014) Managerial incentives and stock price manipulation. J. Finance 69(2):487–526.Crossref, Google Scholar
- (2015) The production of salary profiles of ICT professionals: Moving from structured database to big data analytics. Statist. J. IAOS 31(2):177–191.Crossref, Google Scholar
- (2015) An imputation-based matrix factorization method for improving accuracy of collaborative filtering systems. Engrg. Appl. Artificial Intelligence 46:58–66.Crossref, Google Scholar
- (2013) Latent Dirichlet allocation models for image classification. IEEE Trans. Pattern Anal. Machine Intelligence 35(11):2665–2679.Crossref, Google Scholar
- (2001) Item-based collaborative filtering recommendation algorithms. Shen VY, Saito N, eds. Proc. 10th Internat. Conf. World Wide Web (Association for Computing Machinery, New York, NY), 285–295.Google Scholar
- (1996) Why justice matters in compensation decision making. J. Organ. Behav. 17(3):285–299.Crossref, Google Scholar
- (2010) Toward an understanding of business intelligence. Green P, Rosemann M, eds. Proc. 21st Australasian Conf. Inform. Systems. (ACIS 2010), Brisbane, Australia.Google Scholar
- (2010) Decision support as knowledge creation: A business intelligence design theory. Internat. J. Bus. Intelligence Res. 1(1):29–47.Crossref, Google Scholar
- (2005) Sharing clusters among related groups: Hierarchical Dirichlet processes. Saul L, Weiss Y, Bottou L, eds. Adv. Neural Inform. Processing Systems (MIT Press, Cambridge, MA), 17:1385–1392.Google Scholar
- (2003) The relative importance of external, internal, individual and procedural equity to pay satisfaction: Procedural equity may be more important to employees than organizations believe. Compensation Benefits Rev. 35(6):67–74.Crossref, Google Scholar
- (2014) Mining marketing meaning from online chatter: Strategic brand analysis of big data using latent Dirichlet allocation. J. Marketing Res. 51(4):463–479.Crossref, Google Scholar
- (2013) Government or business? Identifying determinants of MPA and MBA students’ career preferences. Public Personnel Management 42(2):239–258.Crossref, Google Scholar
- (2008) Graphical Models, Exponential Families, and Variational Inference (Now Publishers, Inc., Delft, Netherlands).Google Scholar
- (2020) An association-constrained LDA model for joint extraction of product aspects and opinions. Inform. Sci. 519:243–259.Crossref, Google Scholar
- (2011) Collaborative topic modeling for recommending scientific articles. Apte C, Ghosh J, eds. Proc. 17th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (ACM, New York, NY), 448–456.Google Scholar
- (2007) Multi-task learning for classification with Dirichlet process priors. J. Machine Learn. Res. 8:35–63.Google Scholar
- (2015) Network representation learning with rich text information. Yang Q, Wooldridge M, eds. 24th Internat. Joint Conf. Artificial Intelligence (AAAI Press, San Francisco, CA), 2111–2117.Google Scholar
- (2019) Enhancing domain word embedding via latent semantic imputation. Proc. 25th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (Association for Computing Machinery, New York, NY) 557–565.Google Scholar
- (2005) A probabilistic model for online document clustering with application to novelty detection. Saul L, Weiss Y, Bottou L, eds. Adv. Neural Inform. Processing Systems (MIT Press, Cambridge, MA), 17:1617–1624.Google Scholar
- (2018b) Label-sensitive task grouping by Bayesian nonparametric approach for multi-task multi-label learning. Lang J, ed. Proc. 27th Internat. Joint Conf. Artificial Intelligence IJCAI (AAAI Press, San Francisco, CA), 3125–3131.Crossref, Google Scholar
- (2018a) Caden: A context-aware deep embedding network for financial opinions mining. Zhu F, Yu J, eds. 2018 IEEE Internat. Conf. Data Mining (IEEE Computer Society, Los Alamitos, CA), 757–766.Google Scholar

