Rethinking Cost-Sensitive Classification in Deep Learning via Adversarial Data Augmentation
Published Online:28 Feb 2024https://doi.org/10.1287/ijds.2022.0033
References
- (2017) A closer look at memorization in deep networks. Proc. Internat. Conf. on Machine Learn. (PMLR, New York), 233–242.Google Scholar
- (2014) Example-dependent cost-sensitive logistic regression for credit scoring. Proc. 13th Internat. Conf. on Machine Learn. and Applications (IEEE, New York), 263–269.Google Scholar
- (2020) Benign overfitting in linear regression. Proc. Natl. Acad. Sci. USA 117(48):30063–30070.Google Scholar
- (2018) Overfitting or perfect fitting? Risk bounds for classification and regression rules that interpolate. Adv. Neural Inform. Processing Systems 31 (MIT Press, Cambridge, MA), 2300–2311.Google Scholar
- (1984) Classification and Regression Trees (Wadsworth, Belmont, CA).Google Scholar
- (2020) Language models are few-shot learners. Adv. Neural Inform. Processing Systems 33 (MIT Press, Cambridge, MA), 1877–1901.Google Scholar
- (2017) Mitigating evasion attacks to deep neural networks via region-based classification. Proc. 33rd Annual Computer Security Applications Conf. (ACM, New York), 278–287.Google Scholar
- (1999) Toward scalable learning with non-uniform distributions: Effects and a multi-classifier approach. Proc. 4th Internat. Conf. on Knowledge Discovery and Data Mining (ACM, New York).Google Scholar
- (2015) Cost-aware pre-training for multiclass cost-sensitive deep learning. Preprint, submitted November 30, https://arxiv.org/abs/1511.09337.Google Scholar
- (2020) Cost-sensitive deep learning with layer-wise cost estimation. Proc. Internat. Conf. on Tech. and Applications of Artificial Intelligence (IEEE, New York), 108–113.Google Scholar
- (2012) The mnist database of handwritten digit images for machine learning research. IEEE Signal Processing Magazine 29(6):141–142.Google Scholar
- (2018) Bert: Pre-training of deep bidirectional transformers for language understanding. Preprint, submitted October 11, https://arxiv.org/abs/1810.04805.Google Scholar
- (1999) Metacost: A general method for making classifiers cost-sensitive. Proc. 5th ACM SIGKDD Internat. Conf. on Knowledge Discovery and Data Mining (ACM, New York), 155–164.Google Scholar
- (2021) Interpolation can hurt robust generalization even when there is no noise. Adv. Neural Inform. Processing Systems 34 (MIT Press, Cambridge, MA), 23465–23477.Google Scholar
- (2020) An image is worth 16x16 words: Transformers for image recognition at scale. Preprint, submitted October 22, https://arxiv.org/abs/2010.11929.Google Scholar
- (2014) Explaining and harnessing adversarial examples. Preprint, submitted December 20, https://arxiv.org/abs/1412.6572.Google Scholar
- (2017) On calibration of modern neural networks. Proc. Internat. Conf. on Machine Learn. (PMLR, New York), 1321–1330.Google Scholar
- (2022) The curse of overparametrization in adversarial training: Precise analysis of robust generalization for random features regression. Preprint, submitted January 13, https://arxiv.org/abs/2201.05149.Google Scholar
- (2016) Deep residual learning for image recognition. Proc. IEEE Conf. on Computer Vision and Pattern Recognition (IEEE, Piscataway, NJ), 770–778.Google Scholar
- Health Services Department (2019) National action plan for adverse drug event prevention, office of disease prevention and health promotion, department of health and human services. Accessed November 1, 2021, https://health.gov/sites/default/files/2019-09/ADE-Action-Plan-508c.pdf.Google Scholar
- (2019) Knowledge distillation with adversarial samples supporting decision boundary. Proc. Conf. AAAI Artificial Intelligence 33(01):3771–3778.Google Scholar
- Institute for Safe Medication Practices (ISMP) (2021) ISMP list of high-alert medications in community/ambulatory care settings. Accessed July 10, 2022, https://www.ismp.org/recommendations/high-alert-medications-community-ambulatory-list.Google Scholar
- (2019) Cost-sensitive support vector machines. Neurocomputing 343:50–64.Google Scholar
- (2018) Identifying medical diagnoses and treatable diseases by image-based deep learning. Cell 172(5):1122–1131.Google Scholar
- (2015) Adam: A method for stochastic optimization. 3rd Internat. Conf. Learning Representations (San Diego).Google Scholar
- (2020) Blt: Balancing long-tailed data sets with adversarially-perturbed images. Proc. Asian Conf. on Computer Vision (ACM, New York).Google Scholar
- (1998) Cost-sensitive learning with neural networks. ECAI 15(27):88–94.Google Scholar
- Kurakin A, Goodfellow IJ, Bengio S (2018) Adversarial examples in the physical world. Artificial Intelligence Safety and Security (Chapman and Hall/CRC, Boca Raton, FL), 99–112.Google Scholar
- (2021) Performance evaluation of a prescription medication image classification model: An observational cohort. NPJ Digital Medicine 4(1):1–8.Google Scholar
- (2022) Blip: Bootstrapping language-image pre-training for unified vision-language understanding and generation. Proc. Internat. Conf. on Machine Learn. (PMLR, New York), 12888–12900.Google Scholar
- (2021) Comparative study of adversarial training methods for long-tailed classification. Proc. 1st Internat. Workshop on Adversarial Learn. for Multimedia (ACM, New York), 1–7.Google Scholar
- (2021) Swin transformer: Hierarchical vision transformer using shifted windows. Proc. IEEE/CVF Internat. Conf. on Computer Vision (IEEE, Piscataway, NJ), 10012–10022.Google Scholar
- (2013) A survey of cost-sensitive decision tree induction algorithms. ACM Comput. Survey 45(2):1–35.Google Scholar
- (2017) Decoupled weight decay regularization. Preprint, submitted November 14, https://arxiv.org/abs/1711.05101.Google Scholar
- (2018) Toward deep learning models resistant to adversarial attacks. Proc. Internat. Conf. on Learn. Representations (ICLR, Appleton, WI).Google Scholar
- (2016) Deepfool: A simple and accurate method to fool deep neural networks. Proc. IEEE Conf. on Computer Vision and Pattern Recognition (IEEE, Piscataway, NJ), 2574–2582.Google Scholar
- (2022) The majority can help the minority: Context-rich minority oversampling for long-tailed classification. Proc. IEEE/CVF Conf. on Computer Vision and Pattern Recognition (IEEE, Piscataway, NJ), 6887–6896.Google Scholar
- (2023) Adversarial learning with cost-sensitive classes. IEEE Trans. Cybernetics 53(8):4855–4866.Google Scholar
- (2019) Super-convergence: Very fast training of neural networks using large learning rates. Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications, vol. 11006 (SPIE, Bellingham, WA), 369–386.Google Scholar
- (2022) A continual learning framework for adaptive defect classification and inspection. Preprint, submitted March 16, https://arxiv.org/abs/2203.08796.Google Scholar
- (2013) Intriguing properties of neural networks. Preprint, submitted December 21, https://arxiv.org/abs/1312.6199.Google Scholar
- (2010) One-sided support vector regression for multiclass cost-sensitive classification. Proc. Internat. Conf. on Machine Learn. (PMLR, New York).Google Scholar
- (2017) Attention is all you need. Adv. Neural Inform. Processing Systems 30 (MIT Press, Cambridge, MA), 5998–6008.Google Scholar
- (2018) Generalizing to unseen domains via adversarial data augmentation. Preprint, submitted May 30, https://arxiv.org/abs/1805.12018.Google Scholar
- Wyeth Pharmaceuticals Inc (2018) Cordarone [package insert]. Accessed July 8, 2022, https://www.accessdata.fda.gov/drugsatfda_docs/label/2018/018972s054lbl.pdf.Google Scholar
- (2021) Gifair-fl: An approach for group and individual fairness in federated learning. Preprint, submitted August 5, https://arxiv.org/abs/2108.02741.Google Scholar
- (2001) Learning and making decisions when costs and probabilities are both unknown. Proc. 7th ACM SIGKDD Internat. Conf. on Knowledge Discovery and Data Mining (ACM, New York), 204–213.Google Scholar
- (2003) Cost-sensitive learning by cost-proportionate example weighting. Proc. 3rd IEEE Internat. Conf. on Data Mining (IEEE, New York), 435–442.Google Scholar
- (2018) Cost-sensitive robustness against adversarial examples. Preprint, submitted October 22, https://arxiv.org/abs/1810.09225.Google Scholar
- (2016) Understanding deep learning requires rethinking generalization. Preprint, submitted November 10, https://arxiv.org/abs/1611.03530.Google Scholar
- (2021) Understanding deep learning (still) requires rethinking generalization. Comm. ACM 64(3):107–115.Google Scholar
- (2023) Deep long-tailed learning: A survey. IEEE Trans. Pattern Anal. Machine Intelligence (IEEE, Piscataway, NJ).Google Scholar
- (2005) Training cost-sensitive neural networks with methods addressing the class imbalance problem. IEEE Trans. Knowledge Data Engrg. 18(1):63–77.Google Scholar

