Part-level 3D shape generation driven by user intention inference with preferential Bayesian optimization

Lee, Seung Won; Choi, Jiin; Hyun, Kyung Hoon

doi:10.1038/s41598-026-38916-7

Download PDF

Article
Open access
Published: 07 February 2026

Part-level 3D shape generation driven by user intention inference with preferential Bayesian optimization

Seung Won Lee^1,2,
Jiin Choi^1,2 &
Kyung Hoon Hyun^1,2

Scientific Reports , Article number: (2026) Cite this article

572 Accesses
Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

Abstract

Advancements in generative artificial intelligence have introduced state-of-the-art models capable of producing impressive visual shape outputs. However, when it comes to supporting decisions during the three-dimensional shape creation process, prioritizing outputs that align with designers’ needs over mere visual craftsmanship becomes crucial. Furthermore, designers often intricately combine three-dimensional parts of various shapes to create novel designs. The ability to generate designs that align with the designers’ intentions at the part-level is pivotal for assisting designers. Hence, we introduced BOgen, a novel system that empowers designers to proactively generate and synthesize part-level three-dimensional shapes and enhances their overall user experience by reflecting designer intentions through Bayesian optimization. We assessed BOgen’s performance using a study involving 30 designers. The results revealed that, compared to the baseline, BOgen fulfilled the designer requirements for three-dimensional shape part recommendations and shape exploration space guidance. BOgen assists designers in navigation and development, offering design suggestions and fostering proactive design exploration and creation during early-stage design ideation.

Data availability

The data underlying this article will be shared on reasonable request to the corresponding author.

References

Ramesh, A., Dhariwal, P., Nichol, A., Chu, C. & Chen, M. Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.061251, 3, https://doi.org/10.48550/arXiv.2204.06125 (2022).
Rombach, R., Blattmann, A., Lorenz, D., Esser, P. & Ommer, B. High-resolution image synthesis with latent diffusion models. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (IEEE, 2022).
von Rütte, D., Fedele, E., Thomm, J. & Wolf, L. Fabric: Personalizing diffusion models with iterative feedback. arXiv preprint arXiv:2307.10159 https://doi.org/10.48550/arXiv.2307.10159 (2023).
Zheng, X.-Y. et al. Locally attentional SDF diffusion for controllable 3d shape generation. ACM Transactions on Graphics 42, 1–13.
Liu, V., Vermeulen, J., Fitzmaurice, G. & Matejka, J. 3dall-e: Integrating text-to-image ai in 3d design workflows. In Proceedings of the 2023 ACM Designing Interactive Systems Conference, DIS ’23, 1955–1977, https://doi.org/10.1145/3563657.3596098 (Association for Computing Machinery, New York, NY, USA, 2023).
Sanghi, A. et al. Clip-sculptor: Zero-shot generation of high-fidelity and diverse shapes from natural language. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 18339–18348, https://doi.org/10.1109/CVPR52729.2023.01759 (IEEE Computer Society, Los Alamitos, CA, USA, 2023).
Park, J. & Kang, N. Bmo-gnn: Bayesian mesh optimization for graph neural networks to enhance engineering performance prediction. J. Comput. Des. Eng. 11, 260–271, https://doi.org/10.1093/jcde/qwae102 (2024). https://academic.oup.com/jcde/article-pdf/11/6/260/61212383/qwae102.pdf.
Hertz, A., Perel, O., Giryes, R., Sorkine-Hornung, O. & Cohen-Or, D. Spaghetti: Editing implicit shapes through part aware generation. ACM Transactions on Graphics (TOG) 41, 1–20. https://doi.org/10.1145/3528223.3530084 (2022).
Google Scholar
Hui, K.-H., Li, R., Hu, J. & Fu, C.-W. Neural wavelet-domain diffusion for 3d shape generation. In SIGGRAPH Asia 2022 Conference Papers, (ACM, 2022).
Koo, J., Yoo, S., Nguyen, M. H. & Sung, M. Salad: Part-level latent diffusion for 3d shape generation and manipulation. arXiv preprint arXiv:2303.12236 https://doi.org/10.48550/arXiv.2303.12236 (2023).
Lee, H., Lee, J., Kim, H. & Mun, D. Dataset and method for deep learning-based reconstruction of 3d cad models containing machining features for mechanical parts. J. Comput. Des. Eng. 9, 114–127, https://doi.org/10.1093/jcde/qwab072 (2021). https://academic.oup.com/jcde/article-pdf/9/1/114/41988785/qwab072.pdf.
Nyamsuren, P., Lee, S.-H., Hwang, H.-T. & Kim, T.-J. A web-based collaborative framework for facilitating decision making on a 3d design developing process. J. Comput. Des. Eng. 2, 148–156, https://doi.org/10.1016/j.jcde.2015.02.001 (2015). https://academic.oup.com/jcde/article-pdf/2/3/148/33133610/j.jcde.2015.02.001.pdf.
Auyeskhan, U. et al. Virtual reality-based assembly-level design for additive manufacturing decision framework involving human aspects of design. J. Comput. Des. Eng. 10, 1126–1142, https://doi.org/10.1093/jcde/qwad041 (2023). https://academic.oup.com/jcde/article-pdf/10/3/1126/52600312/qwad041.pdf.
Son, K. & Hyun, K. H. Designer-centric spatial design support. Autom. Constr. 137, 104195. https://doi.org/10.1016/j.autcon.2022.104195 (2022).
Google Scholar
Son, K., Lee, S. W., Yoon, W. & Hyun, K. H. Creativesearch: Proactive design exploration system with bayesian information gain and information entropy. Autom. Constr. 142, 104502. https://doi.org/10.1016/j.autcon.2022.10450 (2022).
Google Scholar
Pandey, K., Chevalier, F. & Singh, K. Juxtaform: interactive visual summarization for exploratory shape design. ACM Trans. Graph. 42, https://doi.org/10.1145/3592436 (2023).
Averkiou, M., Kim, V. G., Zheng, Y. & Mitra, N. J. Shapesynth: Parameterizing model collections for coupled shape exploration and synthesis. Comput. Graph. Forum 33, 125–134. https://doi.org/10.1111/cgf.12310 (2014).
Google Scholar
Matejka, J. et al. Dream lens: Exploration and visualization of large-scale generative design datasets. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, CHI ’18, 1–12, https://doi.org/10.1145/3173574.3173943 (Association for Computing Machinery, New York, NY, USA, 2018).
Evirgen, N. & Chen, X. A. Ganzilla: User-driven direction discovery in generative adversarial networks. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology, UIST ’22, https://doi.org/10.1145/3526113.3545638 (Association for Computing Machinery, New York, NY, USA, 2022).
Evirgen, N. & Chen, X. A. Ganravel: User-driven direction disentanglement in generative adversarial networks. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, CHI ’23, https://doi.org/10.1145/3544548.3581226 (Association for Computing Machinery, New York, NY, USA, 2023).
Dang, H., Mecke, L. & Buschek, D. Ganslider: How users control generative models for images using multiple sliders with and without feedforward information. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems, CHI ’22, https://doi.org/10.1145/3491102.3502141 (Association for Computing Machinery, New York, NY, USA, 2022).
Wu, Y., Ma, L., Yuan, X. & Li, Q. Human–machine hybrid intelligence for the generation of car frontal forms. Advanced Engineering Informatics 55, 101906. https://doi.org/10.1016/j.aei.2023.101906 (2023).
Google Scholar
Kadner, F., Keller, Y. & Rothkopf, C. AdaptiFont: Increasing individuals’ reading speed with a generative font model and bayesian optimization. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems, (ACM, 2021).
Koyama, Y. & Goto, M. BO as assistant: Using bayesian optimization for asynchronously generating design suggestions. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology, (ACM, 2022).
Lee, S. W., Kim, H., Yi, T. & Hyun, K. H. Bigaze: An eye-gaze action-guided bayesian information gain framework for information exploration. Advanced Engineering Informatics 58, 102159. https://doi.org/10.1016/j.aei.2023.102159 (2023).
Google Scholar
Liu, W., d’Oliveira, R. L., Beaudouin-Lafon, M. & Rioul, O. Bignav: Bayesian information gain for guiding multiscale navigation. In Proceedings of the 2017 CHI conference on human factors in computing systems, 5869–5880, https://doi.org/10.1145/3025453.3025524 (2017).
Son, K., Kim, K. & Hyun, K. H. BIGexplore: Bayesian information gain framework for information exploration. In CHI Conference on Human Factors in Computing Systems, (ACM, 2022).
Danhaive, R. & Mueller, C. T. Design subspace learning: Structural design space exploration using performance-conditioned generative modeling. Autom. Constr. 127, 103664, (2021).
Choi, J. & Hyun, K. H. Typeface network and the principle of font pairing. Sci. Reports 14, 30820. https://doi.org/10.1038/s41598-024-81601-w (2024).
Google Scholar
Umetani, N. Exploring generative 3d shapes using autoencoder networks. In SIGGRAPH Asia,. Technical Briefs. SA ’ 17, 2017. https://doi.org/10.1145/3145749.3145758 (Association for Computing Machinery, New York, NY, USA, 2017).
Achlioptas, P., Diamanti, O., Mitliagkas, I. & Guibas, L. Learning representations and generative models for 3d point clouds. In International conference on machine learning, 40–49, https://proceedings.mlr.press/v80/achlioptas18a.html (PMLR, 2018).
Zhang, S. et al. Brep2seq: a dataset and hierarchical deep learning network for reconstruction and generation of computer-aided design models. Journal of Computational Design and Engineering 11, 110–134, https://doi.org/10.1093/jcde/qwae005 (2024). https://academic.oup.com/jcde/article-pdf/11/1/110/57275777/qwae005.pdf.
Hao, Z., Averbuch-Elor, H., Snavely, N. & Belongie, S. DualSDF: Semantic shape manipulation using a two-level representation. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (IEEE, 2020).
Hyun, K. H. & Lee, J.-H. Balancing homogeneity and heterogeneity in design exploration by synthesizing novel design alternatives based on genetic algorithm and strategic styling decision. Adv. Eng. Informatics 38, 113–128. https://doi.org/10.1016/j.aei.2018.06.005 (2018).
Google Scholar
Ban, S. & Hyun, K. H. 3d computational sketch synthesis framework: Assisting design exploration through generating variations of user input sketch and interactive 3d model reconstruction. Comput. Des. 120, 102789. https://doi.org/10.1016/j.cad.2019.102789 (2020).
Google Scholar
Khan, S. & Awan, M. J. A generative design technique for exploring shape variations. Adv. Eng. Informatics 38, 712–724. https://doi.org/10.1016/j.aei.2018.10.005 (2018).
Google Scholar
Zhou, Y., Koyama, Y., Goto, M. & Igarashi, T. Interactive exploration-exploitation balancing for generative melody composition. In 26th International Conference on Intelligent User Interfaces, IUI ’21, 43–47, https://doi.org/10.1145/3397481.3450663 (Association for Computing Machinery, New York, NY, USA, 2021).
Shi, M., Seo, J., Cha, S. H., Xiao, B. & Chi, H.-L. Generative ai-powered architectural exterior conceptual design based on the design intent. J. Comput. Des. Eng. 11, 125–142, https://doi.org/10.1093/jcde/qwae077 (2024). https://academic.oup.com/jcde/article-pdf/11/5/125/59140299/qwae077.pdf.
Chu, W. & Ghahramani, Z. Preference learning with gaussian processes. In Proceedings of the 22nd international conference on Machine learning, 137–144, https://doi.org/10.1145/1102351.1102369 (2005).
Eric, B., Freitas, N. & Ghosh, A. Active preference learning with discrete choice data. Advances in neural information processing systems 20, https://doi.org/10.5555/2981562.2981614 (2007).
González, J., Dai, Z., Damianou, A. & Lawrence, N. D. Preferential bayesian optimization. In International Conference on Machine Learning, 1282–1291, https://doi.org/10.48550/arXiv.1704.03651 (PMLR, 2017).
Brochu, E. Interactive Bayesian optimization: learning user preferences for graphics and animation. Ph.D. thesis, University of British Columbia (2010). https://doi.org/10.14288/1.0051462.
Brochu, E., Cora, V. M. & De Freitas, N. A tutorial on bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning. arXiv preprint arXiv:1012.2599 https://doi.org/10.48550/arXiv.1012.2599 (2010).
Brochu, E., Brochu, T. & De Freitas, N. A bayesian interactive optimization approach to procedural animation design. In Proceedings of the 2010 ACM SIGGRAPH/Eurographics Symposium on Computer Animation, 103–112, https://doi.org/10.5555/1921427.1921443 (2010).
Gelbart, M. A., Snoek, J. & Adams, R. P. Bayesian optimization with unknown constraints. arXiv preprint arXiv:1403.5607 https://doi.org/10.48550/arXiv.1403.5607 (2014).
Hernández-Lobato, J. M., Gelbart, M. A., Adams, R. P., Hoffman, M. W. & Ghahramani, Z. A general framework for constrained bayesian optimization using information-based search. J. Mach. Learn. Res. 17, 1–53, https://doi.org/10.48550/arXiv.1511.09422 (2016).
Liu, W., Rioul, O., Mcgrenere, J., Mackay, W. E. & Beaudouin-Lafon, M. Bigfile: Bayesian information gain for fast file retrieval. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 1–13, https://doi.org/10.1145/3173574.3173959 (2018).
Bradley, R. A. & Terry, M. E. Rank analysis of incomplete block designs: I. the method of paired comparisons. Biometrika 39, 324, (1952).
Seeger, M. Gaussian processes for machine learning. Int. journal neural systems 14, 69–106. https://doi.org/10.1142/S0129065704001899 (2004).
Google Scholar
Srinivas, N., Krause, A., Kakade, S. M. & Seeger, M. W. Information-theoretic regret bounds for gaussian process optimization in the bandit setting. IEEE Transactions on Information Theory 58, 3250–3265, (2012).
Schonlau, M., Welch, W. J. & Jones, D. R. Global versus local search in constrained optimization of computer models. In Institute of Mathematical Statistics Lecture Notes - Monograph Series, 11–25, (Institute of Mathematical Statistics, 1998).
Goldschmidt, G. Linkographic evidence for concurrent divergent and convergent thinking in creative design. Creat. research journal 28, 115–122. https://doi.org/10.1080/10400419.2016.1162497 (2016).
Google Scholar

Download references

Funding

This work was supported by the Technology Innovation Program (RS-2025-02317326, Development of AI-Driven Design Generation Technology Based on Designer Intent) funded by the Ministry of Trade, Industry & Energy (MOTIE, Korea) and National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIP: Ministry of Science, ICT and Future Planning) (RS-2023-00208542).

Author information

Authors and Affiliations

Department of Interior Architecture Design, Hanyang University, Seoul, 04763, Republic of Korea
Seung Won Lee, Jiin Choi & Kyung Hoon Hyun
Human-Centered AI Design Institute, Hanyang University, Seoul, 04763, Republic of Korea
Seung Won Lee, Jiin Choi & Kyung Hoon Hyun

Authors

Seung Won Lee
View author publications
Search author on:PubMed Google Scholar
Jiin Choi
View author publications
Search author on:PubMed Google Scholar
Kyung Hoon Hyun
View author publications
Search author on:PubMed Google Scholar

Contributions

S.W.L. led the conceptualization and methodology design, developed and prepared the original draft of the manuscript. J.C. contributed to writing, reviewing, and editing the manuscript. K.H.H. supervised the project and participated in writing, reviewing, and editing. All authors reviewed and approved the final version of the manuscript.

Corresponding author

Correspondence to Kyung Hoon Hyun.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Lee, S.W., Choi, J. & Hyun, K.H. Part-level 3D shape generation driven by user intention inference with preferential Bayesian optimization. Sci Rep (2026). https://doi.org/10.1038/s41598-026-38916-7

Download citation

Received: 13 October 2025
Accepted: 31 January 2026
Published: 07 February 2026
DOI: https://doi.org/10.1038/s41598-026-38916-7

Keywords