Abstract
Advancements in generative artificial intelligence have introduced state-of-the-art models capable of producing impressive visual shape outputs. However, when it comes to supporting decisions during the three-dimensional shape creation process, prioritizing outputs that align with designers’ needs over mere visual craftsmanship becomes crucial. Furthermore, designers often intricately combine three-dimensional parts of various shapes to create novel designs. The ability to generate designs that align with the designers’ intentions at the part-level is pivotal for assisting designers. Hence, we introduced BOgen, a novel system that empowers designers to proactively generate and synthesize part-level three-dimensional shapes and enhances their overall user experience by reflecting designer intentions through Bayesian optimization. We assessed BOgen’s performance using a study involving 30 designers. The results revealed that, compared to the baseline, BOgen fulfilled the designer requirements for three-dimensional shape part recommendations and shape exploration space guidance. BOgen assists designers in navigation and development, offering design suggestions and fostering proactive design exploration and creation during early-stage design ideation.
Data availability
The data underlying this article will be shared on reasonable request to the corresponding author.
References
Ramesh, A., Dhariwal, P., Nichol, A., Chu, C. & Chen, M. Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.061251, 3, https://doi.org/10.48550/arXiv.2204.06125 (2022).
Rombach, R., Blattmann, A., Lorenz, D., Esser, P. & Ommer, B. High-resolution image synthesis with latent diffusion models. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (IEEE, 2022).
von Rütte, D., Fedele, E., Thomm, J. & Wolf, L. Fabric: Personalizing diffusion models with iterative feedback. arXiv preprint arXiv:2307.10159https://doi.org/10.48550/arXiv.2307.10159 (2023).
Zheng, X.-Y. et al. Locally attentional SDF diffusion for controllable 3d shape generation. ACM Transactions on Graphics 42, 1–13.
Liu, V., Vermeulen, J., Fitzmaurice, G. & Matejka, J. 3dall-e: Integrating text-to-image ai in 3d design workflows. In Proceedings of the 2023 ACM Designing Interactive Systems Conference, DIS ’23, 1955–1977, https://doi.org/10.1145/3563657.3596098 (Association for Computing Machinery, New York, NY, USA, 2023).
Sanghi, A. et al. Clip-sculptor: Zero-shot generation of high-fidelity and diverse shapes from natural language. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 18339–18348, https://doi.org/10.1109/CVPR52729.2023.01759 (IEEE Computer Society, Los Alamitos, CA, USA, 2023).
Park, J. & Kang, N. Bmo-gnn: Bayesian mesh optimization for graph neural networks to enhance engineering performance prediction. J. Comput. Des. Eng. 11, 260–271, https://doi.org/10.1093/jcde/qwae102 (2024). https://academic.oup.com/jcde/article-pdf/11/6/260/61212383/qwae102.pdf.
Hertz, A., Perel, O., Giryes, R., Sorkine-Hornung, O. & Cohen-Or, D. Spaghetti: Editing implicit shapes through part aware generation. ACM Transactions on Graphics (TOG) 41, 1–20. https://doi.org/10.1145/3528223.3530084 (2022).
Hui, K.-H., Li, R., Hu, J. & Fu, C.-W. Neural wavelet-domain diffusion for 3d shape generation. In SIGGRAPH Asia 2022 Conference Papers, (ACM, 2022).
Koo, J., Yoo, S., Nguyen, M. H. & Sung, M. Salad: Part-level latent diffusion for 3d shape generation and manipulation. arXiv preprint arXiv:2303.12236https://doi.org/10.48550/arXiv.2303.12236 (2023).
Lee, H., Lee, J., Kim, H. & Mun, D. Dataset and method for deep learning-based reconstruction of 3d cad models containing machining features for mechanical parts. J. Comput. Des. Eng. 9, 114–127, https://doi.org/10.1093/jcde/qwab072 (2021). https://academic.oup.com/jcde/article-pdf/9/1/114/41988785/qwab072.pdf.
Nyamsuren, P., Lee, S.-H., Hwang, H.-T. & Kim, T.-J. A web-based collaborative framework for facilitating decision making on a 3d design developing process. J. Comput. Des. Eng. 2, 148–156, https://doi.org/10.1016/j.jcde.2015.02.001 (2015). https://academic.oup.com/jcde/article-pdf/2/3/148/33133610/j.jcde.2015.02.001.pdf.
Auyeskhan, U. et al. Virtual reality-based assembly-level design for additive manufacturing decision framework involving human aspects of design. J. Comput. Des. Eng. 10, 1126–1142, https://doi.org/10.1093/jcde/qwad041 (2023). https://academic.oup.com/jcde/article-pdf/10/3/1126/52600312/qwad041.pdf.
Son, K. & Hyun, K. H. Designer-centric spatial design support. Autom. Constr. 137, 104195. https://doi.org/10.1016/j.autcon.2022.104195 (2022).
Son, K., Lee, S. W., Yoon, W. & Hyun, K. H. Creativesearch: Proactive design exploration system with bayesian information gain and information entropy. Autom. Constr. 142, 104502. https://doi.org/10.1016/j.autcon.2022.10450 (2022).
Pandey, K., Chevalier, F. & Singh, K. Juxtaform: interactive visual summarization for exploratory shape design. ACM Trans. Graph. 42, https://doi.org/10.1145/3592436 (2023).
Averkiou, M., Kim, V. G., Zheng, Y. & Mitra, N. J. Shapesynth: Parameterizing model collections for coupled shape exploration and synthesis. Comput. Graph. Forum 33, 125–134. https://doi.org/10.1111/cgf.12310 (2014).
Matejka, J. et al. Dream lens: Exploration and visualization of large-scale generative design datasets. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, CHI ’18, 1–12, https://doi.org/10.1145/3173574.3173943 (Association for Computing Machinery, New York, NY, USA, 2018).
Evirgen, N. & Chen, X. A. Ganzilla: User-driven direction discovery in generative adversarial networks. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology, UIST ’22, https://doi.org/10.1145/3526113.3545638 (Association for Computing Machinery, New York, NY, USA, 2022).
Evirgen, N. & Chen, X. A. Ganravel: User-driven direction disentanglement in generative adversarial networks. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, CHI ’23, https://doi.org/10.1145/3544548.3581226 (Association for Computing Machinery, New York, NY, USA, 2023).
Dang, H., Mecke, L. & Buschek, D. Ganslider: How users control generative models for images using multiple sliders with and without feedforward information. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems, CHI ’22, https://doi.org/10.1145/3491102.3502141 (Association for Computing Machinery, New York, NY, USA, 2022).
Wu, Y., Ma, L., Yuan, X. & Li, Q. Human–machine hybrid intelligence for the generation of car frontal forms. Advanced Engineering Informatics 55, 101906. https://doi.org/10.1016/j.aei.2023.101906 (2023).
Kadner, F., Keller, Y. & Rothkopf, C. AdaptiFont: Increasing individuals’ reading speed with a generative font model and bayesian optimization. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems, (ACM, 2021).
Koyama, Y. & Goto, M. BO as assistant: Using bayesian optimization for asynchronously generating design suggestions. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology, (ACM, 2022).
Lee, S. W., Kim, H., Yi, T. & Hyun, K. H. Bigaze: An eye-gaze action-guided bayesian information gain framework for information exploration. Advanced Engineering Informatics 58, 102159. https://doi.org/10.1016/j.aei.2023.102159 (2023).
Liu, W., d’Oliveira, R. L., Beaudouin-Lafon, M. & Rioul, O. Bignav: Bayesian information gain for guiding multiscale navigation. In Proceedings of the 2017 CHI conference on human factors in computing systems, 5869–5880, https://doi.org/10.1145/3025453.3025524 (2017).
Son, K., Kim, K. & Hyun, K. H. BIGexplore: Bayesian information gain framework for information exploration. In CHI Conference on Human Factors in Computing Systems, (ACM, 2022).
Danhaive, R. & Mueller, C. T. Design subspace learning: Structural design space exploration using performance-conditioned generative modeling. Autom. Constr. 127, 103664, (2021).
Choi, J. & Hyun, K. H. Typeface network and the principle of font pairing. Sci. Reports 14, 30820. https://doi.org/10.1038/s41598-024-81601-w (2024).
Umetani, N. Exploring generative 3d shapes using autoencoder networks. In SIGGRAPH Asia,. Technical Briefs. SA ’ 17, 2017. https://doi.org/10.1145/3145749.3145758 (Association for Computing Machinery, New York, NY, USA, 2017).
Achlioptas, P., Diamanti, O., Mitliagkas, I. & Guibas, L. Learning representations and generative models for 3d point clouds. In International conference on machine learning, 40–49, https://proceedings.mlr.press/v80/achlioptas18a.html (PMLR, 2018).
Zhang, S. et al. Brep2seq: a dataset and hierarchical deep learning network for reconstruction and generation of computer-aided design models. Journal of Computational Design and Engineering 11, 110–134, https://doi.org/10.1093/jcde/qwae005 (2024). https://academic.oup.com/jcde/article-pdf/11/1/110/57275777/qwae005.pdf.
Hao, Z., Averbuch-Elor, H., Snavely, N. & Belongie, S. DualSDF: Semantic shape manipulation using a two-level representation. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (IEEE, 2020).
Hyun, K. H. & Lee, J.-H. Balancing homogeneity and heterogeneity in design exploration by synthesizing novel design alternatives based on genetic algorithm and strategic styling decision. Adv. Eng. Informatics 38, 113–128. https://doi.org/10.1016/j.aei.2018.06.005 (2018).
Ban, S. & Hyun, K. H. 3d computational sketch synthesis framework: Assisting design exploration through generating variations of user input sketch and interactive 3d model reconstruction. Comput. Des. 120, 102789. https://doi.org/10.1016/j.cad.2019.102789 (2020).
Khan, S. & Awan, M. J. A generative design technique for exploring shape variations. Adv. Eng. Informatics 38, 712–724. https://doi.org/10.1016/j.aei.2018.10.005 (2018).
Zhou, Y., Koyama, Y., Goto, M. & Igarashi, T. Interactive exploration-exploitation balancing for generative melody composition. In 26th International Conference on Intelligent User Interfaces, IUI ’21, 43–47, https://doi.org/10.1145/3397481.3450663 (Association for Computing Machinery, New York, NY, USA, 2021).
Shi, M., Seo, J., Cha, S. H., Xiao, B. & Chi, H.-L. Generative ai-powered architectural exterior conceptual design based on the design intent. J. Comput. Des. Eng. 11, 125–142, https://doi.org/10.1093/jcde/qwae077 (2024). https://academic.oup.com/jcde/article-pdf/11/5/125/59140299/qwae077.pdf.
Chu, W. & Ghahramani, Z. Preference learning with gaussian processes. In Proceedings of the 22nd international conference on Machine learning, 137–144, https://doi.org/10.1145/1102351.1102369 (2005).
Eric, B., Freitas, N. & Ghosh, A. Active preference learning with discrete choice data. Advances in neural information processing systems 20, https://doi.org/10.5555/2981562.2981614 (2007).
González, J., Dai, Z., Damianou, A. & Lawrence, N. D. Preferential bayesian optimization. In International Conference on Machine Learning, 1282–1291, https://doi.org/10.48550/arXiv.1704.03651 (PMLR, 2017).
Brochu, E. Interactive Bayesian optimization: learning user preferences for graphics and animation. Ph.D. thesis, University of British Columbia (2010). https://doi.org/10.14288/1.0051462.
Brochu, E., Cora, V. M. & De Freitas, N. A tutorial on bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning. arXiv preprint arXiv:1012.2599https://doi.org/10.48550/arXiv.1012.2599 (2010).
Brochu, E., Brochu, T. & De Freitas, N. A bayesian interactive optimization approach to procedural animation design. In Proceedings of the 2010 ACM SIGGRAPH/Eurographics Symposium on Computer Animation, 103–112, https://doi.org/10.5555/1921427.1921443 (2010).
Gelbart, M. A., Snoek, J. & Adams, R. P. Bayesian optimization with unknown constraints. arXiv preprint arXiv:1403.5607https://doi.org/10.48550/arXiv.1403.5607 (2014).
Hernández-Lobato, J. M., Gelbart, M. A., Adams, R. P., Hoffman, M. W. & Ghahramani, Z. A general framework for constrained bayesian optimization using information-based search. J. Mach. Learn. Res. 17, 1–53, https://doi.org/10.48550/arXiv.1511.09422 (2016).
Liu, W., Rioul, O., Mcgrenere, J., Mackay, W. E. & Beaudouin-Lafon, M. Bigfile: Bayesian information gain for fast file retrieval. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 1–13, https://doi.org/10.1145/3173574.3173959 (2018).
Bradley, R. A. & Terry, M. E. Rank analysis of incomplete block designs: I. the method of paired comparisons. Biometrika 39, 324, (1952).
Seeger, M. Gaussian processes for machine learning. Int. journal neural systems 14, 69–106. https://doi.org/10.1142/S0129065704001899 (2004).
Srinivas, N., Krause, A., Kakade, S. M. & Seeger, M. W. Information-theoretic regret bounds for gaussian process optimization in the bandit setting. IEEE Transactions on Information Theory 58, 3250–3265, (2012).
Schonlau, M., Welch, W. J. & Jones, D. R. Global versus local search in constrained optimization of computer models. In Institute of Mathematical Statistics Lecture Notes - Monograph Series, 11–25, (Institute of Mathematical Statistics, 1998).
Goldschmidt, G. Linkographic evidence for concurrent divergent and convergent thinking in creative design. Creat. research journal 28, 115–122. https://doi.org/10.1080/10400419.2016.1162497 (2016).
Funding
This work was supported by the Technology Innovation Program (RS-2025-02317326, Development of AI-Driven Design Generation Technology Based on Designer Intent) funded by the Ministry of Trade, Industry & Energy (MOTIE, Korea) and National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIP: Ministry of Science, ICT and Future Planning) (RS-2023-00208542).
Author information
Authors and Affiliations
Contributions
S.W.L. led the conceptualization and methodology design, developed and prepared the original draft of the manuscript. J.C. contributed to writing, reviewing, and editing the manuscript. K.H.H. supervised the project and participated in writing, reviewing, and editing. All authors reviewed and approved the final version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Supplementary Information.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Lee, S.W., Choi, J. & Hyun, K.H. Part-level 3D shape generation driven by user intention inference with preferential Bayesian optimization. Sci Rep (2026). https://doi.org/10.1038/s41598-026-38916-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-38916-7