Bayesian Optimization over Mixed Type Inputs with Encoding Methods (original) (raw)

Abstract

Traditional Bayesian Optimization (BO) algorithms assume that the objective function is defined over numeric input space. To generalize BO for mixed numeric and categorical inputs, existing approaches mainly model or optimize them separately and thus cannot fully capture the relationship among different types of inputs. The complexity incurred by additional operations for the categorical inputs in these approaches can also reduce the efficiency of BO, especially when facing high-cardinality inputs. In this paper, we revisit the encoding approaches, which transfer categorical inputs to numerical ones to form a concise and easy-to-use BO framework. Specifically, we propose the target mean encoding BO (TmBO) and aggregate encoding BO (AggBO), where TmBO transfers each value of a categorical input based on the outputs corresponding to this value, and AggBO encodes multiple choices of a categorical input through several distinct ranks. Different from the prominent one-hot encoding, both approaches transfer each categorical input into exactly one numerical input and thus avoid severely increasing the dimension of the input space. We demonstrate that TmBO and AggBO are more efficient than existing approaches on several synthetic and real-world optimization tasks.

References

Abdelkhalik, O.: Autonomous planning of multigravity-assist trajectories with deep space maneuvers using a differential evolution approach. Int. J. Aerosp. Eng. 2013 (2013)
Google Scholar
Bergstra, J., Bardenet, R., Bengio, Y., Kégl, B.: Algorithms for hyper-parameter optimization. In: Advances in Neural Information Processing Systems, vol. 24 (2011)
Google Scholar
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Article MATH Google Scholar
Deshwal, A., Belakaria, S., Doppa, J.R.: Bayesian optimization over hybrid spaces. In: International Conference on Machine Learning, pp. 2632–2643. PMLR (2021)
Google Scholar
González, J., Dai, Z.: GPYOPT: a Bayesian optimization framework in Python. Accessed (2016)
Google Scholar
Häse, F., Aldeghi, M., Hickman, R.J., Roch, L.M., Aspuru-Guzik, A.: GRYFFIN: An algorithm for Bayesian optimization of categorical variables informed by expert knowledge. Appl. Phys. Rev. 8(3), 031406 (2021)
Article Google Scholar
Hutter, F., Hoos, H.H., Leyton-Brown, K.: Sequential model-based optimization for general algorithm configuration. In: Coello, C.A.C. (ed.) LION 2011. LNCS, vol. 6683, pp. 507–523. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-25566-3_40
Chapter Google Scholar
Lakshminarayanan, B., Roy, D.M., Teh, Y.W.: Mondrian forests for large-scale regression when uncertainty matters. In: Artificial Intelligence and Statistics, pp. 1478–1487. PMLR (2016)
Google Scholar
Larionov, M.: Sampling techniques in Bayesian target encoding. arXiv preprint arXiv:2006.01317 (2020)
McCane, B., Albert, M.: Distance functions for categorical and mixed variables. Pattern Recogn. Lett. 29(7), 986–993 (2008)
Article Google Scholar
Micci-Barreca, D.: A preprocessing scheme for high-cardinality categorical attributes in classification and prediction problems. ACM SIGKDD Explor. Newsl. 3(1), 27–32 (2001)
Article Google Scholar
Mockus, J., Tiesis, V., Zilinskas, A.: The application of Bayesian methods for seeking the extremum. Towards Glob. Optim. 2(117–129), 2 (1978)
MATH Google Scholar
Mougan, C., Masip, D., Nin, J., Pujol, O.: Quantile encoder: tackling high cardinality categorical features in regression problems. In: Torra, V., Narukawa, Y. (eds.) MDAI 2021. LNCS (LNAI), vol. 12898, pp. 168–180. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-85529-1_14
Chapter Google Scholar
Nguyen, D., Gupta, S., Rana, S., Shilton, A., Venkatesh, S.: Bayesian optimization for categorical and category-specific continuous inputs. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 5256–5263 (2020)
Google Scholar
Pargent, F., Bischl, B., Thomas, J.: A benchmark experiment on how to encode categorical features in predictive modeling. Ph.D. thesis, Master Thesis in Statistics, Ludwig-Maximilians-Universität München ... (2019)
Google Scholar
Pargent, F., Pfisterer, F., Thomas, J., Bischl, B.: Regularized target encoding outperforms traditional methods in supervised machine learning with high cardinality features. arXiv preprint arXiv:2104.00629 (2021)
Ru, B., Alvi, A., Nguyen, V., Osborne, M.A., Roberts, S.: Bayesian optimisation over multiple continuous and categorical inputs. In: International Conference on Machine Learning, pp. 8276–8285. PMLR (2020)
Google Scholar
Sheather, S.J., Jones, M.C.: A reliable data-based bandwidth selection method for kernel density estimation. J. Roy. Stat. Soc.: Ser. B (Methodol.) 53(3), 683–690 (1991)
MathSciNet MATH Google Scholar
Srinivas, N., Krause, A., Kakade, S.M., Seeger, M.: Gaussian process optimization in the bandit setting: no regret and experimental design. arXiv preprint arXiv:0912.3995 (2009)

Download references

Acknowledgement

This work is supported by the National Natural Science Foundation (NNSF) of China under Grant 72101106 and the Shenzhen Science and Technology Program under Grant No. RCBS20210609103119020.

Author information

Authors and Affiliations

SUSTech Business School, Southern University of Science and Technology, Shenzhen, 518055, China
Zhihao Liu, Weiming Ou & Songhao Wang

Authors

Zhihao Liu
Weiming Ou
Songhao Wang

Corresponding author

Correspondence toSonghao Wang .

Editor information

Editors and Affiliations

Kyoto University, Kyoto, Japan
Hisashi Kashima
IBM Research, Thomas J. Watson Research Center, Yorktown Heights, NY, USA
Tsuyoshi Ide
National Chiao Tung University, Hsinchu, Taiwan
Wen-Chih Peng

Rights and permissions

Copyright information

About this paper

Cite this paper

Liu, Z., Ou, W., Wang, S. (2023). Bayesian Optimization over Mixed Type Inputs with Encoding Methods. In: Kashima, H., Ide, T., Peng, WC. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2023. Lecture Notes in Computer Science(), vol 13936. Springer, Cham. https://doi.org/10.1007/978-3-031-33377-4\_16

Download citation

.RIS
.ENW
.BIB
DOI: https://doi.org/10.1007/978-3-031-33377-4\_16
Published: 28 May 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-33376-7
Online ISBN: 978-3-031-33377-4
eBook Packages: Computer Science Computer Science (R0)Springer Nature Proceedings Computer Science