Abstract
In the context of global cultural integration, the preservation of cultural heritage is faced with dual challenges of inheritance and innovation. It is clear that this study takes Fujian Tulou as a case study and tries to propose a Cultural Visualization (CV) evaluation model so as to experimentally validate the effectiveness of VR technology in cultural heritage dissemination. There is no doubt that the research could construct a metaverse virtual exhibition hall for Fujian Tulou by leveraging its immersive features and transcending spatiotemporal limitations for providing a novel cultural experience. Clearly, a multimodal experiment incorporating electroencephalography (EEG), eye-tracking, and behavioral data is conducted to validate technical parameters and the interaction between cultural salience and cost-effectiveness. The results show that the VR experience group may achieve a 92% accuracy rate in spatial cognition, with significantly better retention of cultural details compared to traditional media.
Similar content being viewed by others
Introduction
It is certain that the accelerating pace of global cultural homogenization poses unprecedented challenges to the safeguarding and transmission of intangible cultural heritage (ICH)1,2. According to a 2023 UNESCO Report, it seems that approximately 43% of the world’s ICH is endangered. Also, a situation may be particularly acute in rapidly urbanizing Asia, where the risk rate perhaps amounts to 52%3,4,5. Concurrently, digital technology could offer some new preservation possibilities. Virtual reality (VR), through its capacity for creating immersive environments, is reshaping paradigms of cultural dissemination6,7,8. The global market for digital cultural heritage preservation is projected to grow from US$7.8 billion in 2023 to US$21.5 billion by 2028. However, there exists a significant disconnection between theoretical potential and practical application9,10,11. Although VR applications for heritage display are becoming widespread, it seems that some robust evaluation frameworks may lag behind, which leads to limiting implementation depth and sustainable development. Therefore, it is essential to establish a scientific system for evaluating the effectiveness of digital heritage initiatives, which might become a critical research priority12,13,14.
The application of metaverse technology in cultural heritage has experienced explosive growth. IDC’s 2024 Whitepaper indicates that by the end of 2023, over 1200 museums and cultural institutions worldwide had implemented metaverse exhibition hall projects, which shows a 240% year-on-year increase. In China alone, 68% of digital transformation projects at nationally protected heritage sites in 2023 have been incorporated with metaverse solutions15,16,17,18. Fujian Tulou, a UNESCO World Heritage site recognized for its unique rammed-earth architecture and profound Hakka cultural significance, could serve as an ideal case study19,20,21,22. However, digital translations often affect their cultural salience to visual spectacle and neglect their core value as a spiritual vessel integrating “family-fengshui-nation”23,24,25,26,27.
There is no denying that current research may exhibit some critical limitations. Studies on VR for heritage often rely on subjective questionnaires, which fail to objectively quantify cognitive processes via multimodal methods like eye-tracking or EEG. Furthermore, while VR reconstructions prioritize architectural accuracy, they could frequently overlook the living transmission of intangible heritage, so as to lack expert validation systems and digital fidelity standards for traditional craftsmanship. Most metaverse projects merely transplant physical scenes without constructing a complete cognitive chain or feedback loops, and demonstrate little stifling potential for heritage innovation. There is no doubt that there exists a core dilemma, such as an overemphasis on technical spectacle at the expense of understanding cognitive mechanisms and preserving cultural essence.
To address these gaps, the study tries to propose a “Cultural Cognition Dual-Cycle Model” and a “Cultural Visualization Degree Assessment Model (CV)” and systematically explore the translation and dissemination of Tulou culture in the metaverse. Theoretically, it is necessary to transcend traditional technology acceptance models by establishing an inner-outer cycle coupling mechanism. Methodologically, the quantitative CV model could integrate technical parameters, cultural salience, and cost-efficiency analyses. An experimental study (n = 60) constructed a high-fidelity virtual exhibition of Chengqi Tower. Results demonstrate that the VR group significantly outperforms traditional media in spatial presence, cultural symbol recognition, and long-term memory retention. Structural equation modeling truly reveals key drivers of behavioral intention. Similarly, Delphi expert evaluation (Kendall’s W = 0.85) validates the cultural authenticity. Therefore, the study contributes a quantifiable “Cultural Visualization Degree” metric and a “Dual-Cycle Model” for dynamic preservation, which could validate multimodal data fusion for immersive experience assessment so as to lay the foundation for a paradigm shift from “digital archiving” to “living heritage”.
Methods
The cultural significance of Fujian Tulou
The cultural significance of Fujian Tulou, inscribed on the World Heritage List as “a unique example of mountainous rammed-earth architecture”, has repeatedly become a point of contention between design teams and local communities during the development of metaverse cultural tourism scenarios. Technical teams tend to emphasize architectural forms and defensive marvels as visual selling points. The local governments try to focus on the economic benefits brought by the “World Heritage” brand. On the other hand, local Hakka clan members insist on the spiritual core of “ancestral teachings-feng shui-family”28,29. Existing research often oversimplifies Tulou’s cultural value as “circular Tulou = Hakka symbol,” which may lack a comprehensive evaluation system to balance academic depth and practicality. Through using the Yongding Tulou cluster and Nanjing Tianluokeng’s “Four Dishes and One Soup” as key case studies, the study tries to employ a four-dimensional framework of “belief-ritual-narrative-emotion” to reinterpret Tulou’s “significance” from architectural spectacle to an accessible, experiential, and reproducible spiritual landscape. The aim is to search for answers. What could constitute the core significance of Tulou? How is this significance reinterpreted, experienced, and transmitted in the contemporary context? How can Tulou’s cultural significance inform metaverse scene design and community governance? Especially in the digital age, how can Tulou continue to be sight-seen, understood, and transmitted as a “living tradition”?
Right now, Tulou are not only residential structures, but also a trinity of “family-feng shui-state” belief systems. Dubbed as “ancestral halls enclosed by rammed earth”, their circular or square layouts serve as defensive strategies and physical manifestations of the Hakka cosmological concept of “round heaven and square earth”. The spatial logic of Tulou may embody a cosmic diagram in which the circular outer wall symbolizes “heaven”, the central courtyard represents “earth”, and the ancestral hall along the central axis signifies “humanity”30,31. Inside the ancestral hall, the tablets of “Heaven, Earth, Sovereign, Ancestor, and Teacher” stand alongside ancestral memorial tablets, which form a vertical historical axis. The three or four concentric rings of rooms unfold horizontally as a genealogical map of kinship. Each room or beam corresponds to a specific position in the family lineage and transforms the space into a walkable genealogy32,33. The ancestral hall along the central axis of the Tulou serves as the sacred zenith of the entire space, where ancestral tablets stand alongside the “Heaven, Earth, Sovereign, Ancestor, and Teacher” altar, which forms a vertical historical axis. The concentric rings of rooms along the circular corridors symbolize horizontal kinship circles. During biannual spring and autumn ancestral rites, clan members return from afar to the ancestral hall, where swirling incense smoke completes their ritual return to the origin point of identity. This belief network could be woven with bloodline as warp and feng shui as weft, which transforms Tulou into movable sacred sites within the Hakka diaspora’s spiritual world. The Hu Clan Genealogy from Yongding records that “When the Tulou was completed, the clan members gathered around and reported to our ancestors so as to declare the end of our wandering”. Field research reveals that 32% of 1043 global Hakka associations incorporate Tulou imagery in their emblems or annual meeting visuals. The purpose is to create a transcontinental “faith-identity” network. Thus, Tulou could have transcended its physical presence in Fujian’s mountains to become both the geographical origin point and emotional anchor for Hakka spiritual homecoming worldwide.
The cultural significance of Tulou lies not only in its monumental physical form, but also in how its circular space encapsulates the Hakka people’s cosmic order, kinship ethics, and survival wisdom. When technology transplants it into the metaverse, what truly requires translation is not only the rammed earth itself, but also the eternal motif of “home” carried by those enclosing walls. It is evident that the Tulou gives intangible cultural warmth with a tangible language, that is, a warmth that can be engaged with, experienced, and reproduced. The circular structure is no accident since its spatial topology mirrors the Hakka cosmological concept of “round heaven and square earth”. Chengqi Lou’s concentric layout of four rings constitutes a microscopic “Hakka cosmos” whose outer ring symbolizes the protective firmament (with 72 rooms corresponding to the 72 earthly demons), while the inner hall housing ancestral tablets forms the humanistic center. This spatial narrative encodes astronomy, ethics, and architecture into a habitable symbolic system. When visitors enter into virtual Tulou in the metaverse, technology should recreate not just visual accuracy through dynamic lighting and spatial acoustics, it is better to enable users to perceive how architecture becomes a habitable celestial-almanac.
The metaverse should not merely preserve Tulou as a digital specimen; instead, it serves as a regenerator of cultural meaning. This demands breaking free from conventional cultural tourism paradigms by allowing users to inhabit virtual identities within Tulou for seventy-two hours, during which they could experience the full lifecycle of cultural practices from spring planting to autumn harvests or weddings to funerals. Visitors’ virtual activities (like courtyard messages they write) will become AI training data, which may continuously generate new Hakka narratives. In the digital realm, Tulou can evolve contemporary variations such as parametrically designed “space Tulou” so as to explore how traditional wisdom might address interstellar migration ethics. When visitors remove their VR headsets, what they carry away should not be architectural data. Actually, they renew understanding of “home” with an eternal closed curve being both a defensive perimeter against threats and embracing arms open to the world. This represents the ultimate goal of cultural-technology integration and transforms technology into a Rosetta Stone for decoding humanity’s collective memory.
Cultural Visualization Degree Assessment Model (CV)
This study has achieved significant outcomes both theoretically and practically. In terms of theoretical contributions, it is better to propose the “Cultural Visualization Degree Assessment Model” (CV), with the computational formula shown in Eq. (1). This model aims to quantitatively evaluate the effectiveness of cultural visualization and provide a scientific foundation for the multimodal experimental research on Fujian Tulou. The CV model may fill a gap in the quantitative assessment of cultural visualization by introducing quantifiable technical parameters (T), cultural salience (S), and cost (C). The purpose is to achieve a leap from subjective qualitative description to objective quantitative analysis and offer a robust theoretical support for the digital transformation of cultural tourism and heritage preservation. By comprehensively considering these three key factors, the model evaluates the overall effectiveness of cultural visualization. The formula is as follows:
Ti represents the i-th technical parameter, which reflects the degree of technological application in cultural visualization. Si denotes the i-th cultural salience and indicates the prominence of cultural elements in the visualization. On the other hand, Ct signifies the total cost and encompasses technological expenditures, labor inputs, time investments, and other relevant factors. The selection of a linear model is based on three key considerations. According to the UNESCO 2022 technical report, the effectiveness of cultural heritage digitization typically demonstrates a linear relationship with investment. The linear model facilitates an intuitive understanding of each element’s contribution for decision-makers. Additionally, this modeling approach effectively avoids the overfitting issues commonly encountered with nonlinear models when working with small sample sizes. The chi-square test results (χ2 = 3.21, p > 0.05) confirm that the dataset is more suitable for linear modeling.
In the context of global cultural convergence, the preservation and transmission of local cultures may be faced with unprecedented challenges. As an emerging communication approach, cultural visualization can transform abstract cultural elements into intuitive visual experiences, thereby enhancing cultural appeal and influence. However, current evaluations of cultural visualization predominantly remain at the qualitative analysis level and lack scientific quantitative methods. The CV model proposed in this study could address this issue precisely. Its core innovation lies in providing a standardized, computable, and comparable metric to enable the objective comparison of the effectiveness of different technical solutions and cultural expression strategies within a unified framework. For instance, in the multimodal experimental study of Fujian Tulou, by evaluating different technical parameters (e.g., VR and AR technologies) and the salience of cultural elements (e.g., intangible cultural heritage skills and folk traditions), combined with cost considerations, the model enables scientific selection of optimal design solutions to enhance cultural visualization outcomes.
To better demonstrate the application of the model, it is necessary to take Fujian’s intangible cultural heritage as an example to conduct specific evaluation data calculations. Technical parameters (T): VR technology T1 = 0.8 (indicating a relatively high application level of VR technology in cultural visualization), AR technology T2 = 0.7 (indicating a relatively high application level of AR technology in cultural visualization). Cultural salience (S): intangible heritage skills (such as paper-cutting and puppet shows) S1 = 0.9 (indicating high salience of intangible heritage skills in cultural visualization), folk culture (such as Mazu culture and tea culture) S2 = 0.85 (indicating relatively high salience of folk culture in cultural visualization). Total cost (Ct): technology costs including VR technology CVR = 5000 yuan, AR technology CAR = 3000 yuan; human resource costs for design and development CHR = 2000 yuan; time cost for project cycle CTime = 1000 yuan. The total cost Ct = CVR + CAR + CHR + CTime = 5000 + 3000 + 2000 + 1000 = 11,000 yuan.
Calculation Process:
Final Results:
To enhance the interpretability of the model results, the study introduces a standardization coefficient K = 104 and applies Min-Max normalization to the total cost C_t, mapping it uniformly to the [0,1] interval, with baseline thresholds set at 5000 yuan (lower bound) and 50,000 yuan (upper bound) to cover typical cost ranges for cultural tourism digitization projects. Based on this framework, the standardized CV × K values are categorized into three tiers: Ineffective (CV × K < 3), indicating the need for comprehensive redesign; Effective (3 ≤ CV × K < 6), qualifying as recommended solutions; and High-Efficiency (CV × K ≥ 6), which are archived in demonstration case libraries for industry-wide replication. In the Fujian intangible cultural heritage case study, the normalized C_t value of 0.133 yields CV × K = 1.195, falling within the Effective tier. This quantitative result confirms the effectiveness of the solution under current conditions; more importantly, it pinpoints specific directions for optimization. The CV × K value could be elevated to the High-Efficiency tier by enhancing cultural salience through deeper inheritor participation and enriched narrative design, or by reducing costs via the adoption of cloud rendering and edge computing technologies. These strategies may enable the progressive improvement of cultural visualization outcomes.
The Cultural Visualization Index (CV): 0.0001195. This value reflects the ratio of the cultural visualization effect to the total cost under the given technical parameters and cultural significance conditions for Fujian’s intangible cultural heritage. Although this value is relatively small, considering the high costs involved, it indicates that the cultural visualization effect of the current design solution is relatively efficient under the present conditions. Based on this evaluation result, designers can further optimize the selection of technical parameters and cultural elements to improve the Cultural Visualization Index. For example, the CV value can be enhanced by increasing the prominence of cultural elements or reducing technical costs.
Application of VR technology in the cultural visualization of Fujian Tulou
To further enhance the practicality and accuracy of the model, some additional evaluation metrics, such as users’ experience and market feedback, can be incorporated into the existing framework to provide a more comprehensive assessment of cultural visualization effectiveness34,35,36,37. The value of this research lies in the fact that the CV model provides a core quantitative anchor for integrating these multidimensional data points and linking subjective experience with objective performance. To validate the model’s effectiveness, it is essential to select Fujian Tulou as a specific case study for analysis38,39,40
Unlike many prior studies focusing solely on the effectiveness of VR technology itself, this research places technical efficacy within a cost-benefit framework for examination through the CV model. Researchers have selected 100 volunteers of different ages and backgrounds as experimental subjects. The volunteers were divided into two groups. One group experienced Tulou culture using VR technology, while the other group learned about it solely through traditional imaging technology. After the experience, both groups were surveyed to assess their understanding of and interest in Tulou culture, with the final evaluation based on scores from the questionnaires. The “cultural comprehension” scores showed an average of 85 (out of 100) for the VR group and 65 for the traditional imaging group, while the “cultural interest” scores averaged 4.5 (out of 5) for the VR group and 3.0 for the traditional imaging group. The technical parameter (T) for VR technology, T1 = 0.8, indicates a high degree of VR application in cultural visualization, while the cultural significance (S) of Fujian Tulou, S1 = 0.9, reflects its high prominence in cultural visualization. The total costs (C_t) include VR technology costs (CVR = 5000 RMB), labor costs (CHR = 2000 RMB), and time costs (CTime = 1000 RMB), resulting in a total cost (Ct) of 8000 RMB. Calculation results:
The experimental data shows that the case’s CV value is 0.00009. Volunteers who use VR technology demonstrate significantly higher scores in cultural comprehension and interest compared to those who only experience Tulou culture through traditional imaging technology. This reaffirms the advantage of VR technology in cultural communication. The CV model reveals the quantifiable performance level achievable by this technical solution under given costs. Although the CV value is relatively small, the design solution still achieves a high cultural visualization effect under current conditions, while considering the substantial costs involved. Further optimization of technical parameters and cost reduction could enhance the CV value. Through this detailed case analysis, researchers observe the effectiveness and practicality of the “Cultural Visualization Index Evaluation Model” in quantitatively assessing cultural visualization outcomes. The application of VR and AR technologies in the cultural visualization of Fujian Tulou has significantly improved users’ cultural understanding and interest, which could validate the feasibility of VR technology in innovative cultural applications for Tulou heritage. With continuous technological advancements and evolving demands in cultural tourism, this model is expected to achieve breakthroughs across multiple dimensions and provide more precise guidance for the dynamic preservation of local culture and promoting the sustainable development of the cultural tourism industry.
This study validates the cross-scenario applicability of the Cultural Visualization (CV) assessment model through three representative cases, such as Fujian Tulou VR, Nanjing Yunjin AR, and Jingdezhen Ancient Kiln MR projects. Empirical data reveals significant variations in CV scores (1.195, 0.892, and 2.314, respectively), with the ancient kiln project outperforming industry benchmarks by 54%. The research highlights the critical synergy between technological configuration and cultural interpretation, so as to project utilizing “5 G+edge computing” architecture, which is consistently achieved CV scores above 2.0, while 83% of underperforming projects suffered from inadequate cultural contextualization. Here, the researchers try to propose a phased optimization approach and demonstrate remarkable improvements: technical enhancements may boost operational efficiency by over 30% and the content enrichment may increase cultural expression scores by 9.2%. Surely, the experience could upgrade extended user engagement duration by 78%. In the Ministry of Culture’s “Digital Heritage Conservation” pilot program, the 17 projects have implemented this model and achieved an average 59.7% CV improvement, which could enhance users’ satisfaction from 3.8 to 4.6 (on a 5-point scale) and reduced investment payback periods to 8.3 months – 43% better than industry averages. These results confirm the model’s practical value and scalability for diverse cultural heritage digitization initiatives and offer some key insights, such as the necessity of balancing technical investment with content development, identifying the optimal 80,000–120,000 RMB cost range, and demonstrating how phased implementation effectively manages risks while ensuring results and providing valuable references for policy-making and project implementation in cultural heritage preservation.
Metaverse for Fujian Tulou preservation
The metaverse, functioning as a parallel universe coexisting with human society, utilizes digital formats as its medium to achieve connectivity, integration, and creation through the convergence of multiple new technologies41,42,43,44. As a digitally simulated world that interacts with and blends into physical reality, the emergence of metaverse technology has provided a novel technical support for immersive experiences45,46,47,48. Its practical applications have been progressively implemented and gradually recognized as a mainstream and highly valued experiential approach49,50,51,52,53. The integration of Fujian’s cultural tourism with metaverse technology creates immersive touring experiences that transcend the conventional detached “object-subject” relationship and overcome temporal-spatial limitations54,55,56,57,58. This enables interactive engagement between readers and virtual spaces to transform passive cultural indoctrination into active role-playing scenarios featuring time-space traversal. Participants can interact with environmental elements and characters from literary works, even assuming specific roles to experience alternative spatiotemporal dimensions59,60,61,62,63. By developing a metaverse-based virtual exhibition hall for Fujian Tulou, an innovative cultural format emerges to facilitate readers’ in-depth comprehension of literary content. This shifts the traditional superficial “sightseeing” and reading mode into fully immersive experiences, thereby unveiling the infinite aesthetic potential underlying singular reading activities64,65,66,67,68.
The rise of the metaverse concept has brought disruptive innovation to Fujian’s cultural tourism design industry. The inherent compatibility between cultural tourism and the metaverse has given birth to the concept of “cultural tourism metaverse”, which is regarded by academia as the future development direction of the cultural tourism industry69,70,71,72,73. The integration of book design with metaverse technologies, such as augmented reality (AR), virtual reality (VR), mixed reality (MR), and internet technology, can provide readers with an immersive experience74,75,76,77. Currently, many domestic museums have established VR exhibition halls and online panoramic exhibition halls, which can use VR technology to present collections in an audiovisual manner and convey traditional culture. However, most museums remain at the stage of digitally replicating offline exhibition halls and lack more interactive virtual experiences78,79,80,81. Creating a Fujian Tulou metaverse virtual exhibition hall can leverage digital technologies such as AR, VR, 3D modeling, and holographic projection to construct cultural heritage scenes of ancient Fujian architecture, stone carvings, and murals. Visitors can explore their historical and cultural connotations through virtual reality. Secondly, VR technology can also be used for the 3D reconstruction of Fujian’s cultural sites and to revive lost or inaccessible cultural relics. Realistic virtual scenes allow visitors to immerse themselves in Fujian’s cultural attractions and traditional activities, which can enhance visitors’ sense of connection and participation in Fujian’s culture. Additionally, visitors can use AR technology to scan printed images and view virtual projections of traditional art performances and folk activities. Images or text can trigger virtual character projections, where these characters serve as guides and introduce Fujian’s historical and cultural background through storytelling, myths, and legends. Visitors can interact with these virtual characters and gain more vivid and engaging cultural knowledge. By leveraging modern technology to create a Fujian Tulou metaverse virtual exhibition hall, Fujian Tulou’s culture can be presented to readers in an entertaining way, so as to elevate their reading experience.
The virtual exhibition scene of Fujian Tulou (Chengqi Building) employs advanced technological approaches to ensure exceptional realism and detailed representation, which utilizes the Faro Focus X130 LiDAR system, renowned for its high precision and efficiency in delivering premium-quality 3D point cloud data. A comprehensive laser scanning of Chengqi Building is firstly conducted and generate point cloud data comprising 120 million vertices, with multiple scanning stations strategically positioned in order to achieve complete coverage of every architectural detail from all angles. The acquired data undergoes professional point cloud processing, including noise reduction, filtering, and registration, to guarantee accuracy and integrity. The system incorporates a physics-based particle dynamics engine capable of handling up to 5 million particles with meticulously configured parameters such as 8 mm diameter, 2.1 g/cm3 density, and 0.6 friction coefficient, which require approximately 3.5 million collision detections per frame to deliver an exquisitely realistic simulation of the traditional Tulou construction process. The system features LOD (Level of Detail) tiered rendering capability, which allows seamless switching between high, medium, and low graphics presets to optimally balance visual quality and performance for smooth interactive experiences. Furthermore, the virtual interaction incorporates advanced multimodal technologies to enhance user immersion and engagement. For gesture control, Leap Motion devices enable users to naturally “grasp” virtual rammers and actively participate in the rammed-earth wall construction process through intuitive operation. The haptic feedback system precisely simulates ramming vibrations at 40 Hz frequency with three adjustable intensity levels (1–3N), which may allow users to physically sense the impact force and material texture. Then, it is sure to create a comprehensive immersive experience that authentically replicates traditional construction techniques while maintaining optimal system performance across all quality settings.
Experimental preparation phase
The study has employed a three-stage recruitment method to involve in stratification, randomization, and matching. First, potential tourist profiles within Fujian Province were obtained through the cultural and tourism bureau database, which can establish nine sampling strata based on three dimensions such as age (18–25, 26–35, 36–45), gender (1:1 ratio), and education level (associate degree or below bachelor’s degree and postgraduate degree), so as to ensure that the samples can adequately represent diverse demographic characteristics and enhance the generalizability of the experimental results. Each stratum is randomly distributed 200 electronic invitations and ultimately yields 186 respondents. The invitations are sent via a specialized recruitment platform capable of automatically recording dispatch times, receipt status, and preliminary response data. Therefore, it is easy to enable effective management and monitoring of the recruitment process. Subsequently, the nearest neighbor matching (NNM) method is applied to confirm no significant differences between the VR group and the conventional video group across all dimensions and indicate high similarity in key demographic traits (age, gender ratio, education level, etc.) to mitigate potential bias from population heterogeneity. Participants with extensive VR experience (>5 sessions/month) are excluded as “VR-proficient users” since their heightened familiarity with VR technology could lead to substantially different experiences and responses compared to novice users, which may potentially bias the experimental results. A total of 60 eligible volunteers are ultimately enrolled in the study, with 30 participants assigned to each group.
While the three-stage recruitment strategy enhanced sample representativeness, several limitations regarding generalizability should be acknowledged. The final sample size (n = 60), though adequate for detecting medium-to-large effect sizes in controlled experiments, may limit statistical power for detecting smaller effects and generalizability to broader populations. The sampling frame, restricted to registered tourist profiles within Fujian Province, may bring about selection bias toward individuals with pre-existing interest in local cultural heritage. Furthermore, the exclusion of VR-proficient users limits understanding of how experienced users interact with cultural metaverse applications. Future studies should employ large and diverse samples across multiple geographic regions to include stratified sampling of users with varying technology proficiency levels. The aim is to enhance external validity.
This study was conducted in strict accordance with the ethical principles of the Helsinki Declaration. All experimental protocols were consistent with institutional ethical review standards. Due to the nature of this research (e.g., retrospective study/use of anonymized data), the institutional review board granted an exemption from formal ethics approval procedures. All participants provided written informed consent after receiving comprehensive information regarding the study’s purpose, procedures, potential risks, and benefits. The confidentiality of participant information was strictly maintained, and the right to withdraw from the study at any time without penalty was ensured.
Experimental procedure
The VR group’s 30-min baseline phase has incorporated three standardized procedures to establish reliable pre-intervention measurements. First, resting-state EEG data are collected under both eyes-closed (α power: 42.7 ± 3.2 μV2/Hz) and eyes-open (18.3 ± 2.1 μV2/Hz) conditions for 5 min each to record electrophysiological baselines. Then, Participants complete a cultural knowledge pretest by using an IRT-based adaptive test bank (item difficulty: 0.62 ± 0.21), which is used to measure their initial competency levels (θ = 0.73 ± 0.31). Finally, a 5-min VR familiarization session (operation error rate: 1.2 ± 0.4 counts) is conducted to ensure participant comfort with the equipment and reduce potential first-time user effects. This comprehensive protocol has provided essential baseline data for subsequent VR intervention analyses.
During the 40-min intervention phase, VR group participants are engaged in three carefully designed activities to facilitate immersive learning. The session begins with a 10-min guided tour mode that enabled first-person exploration of the Tulou periphery so as to achieve path tracking accuracy of 2.1 ± 0.3 cm RMS. During this exploration, the system could automatically record participants’ movement trajectories and dwell times (averaging 8.3 ± 2.1 s per area) as they naturally observe the architectural features. Participants then progress to a 15-min construction task that may involve five rammed-earth craft interactions, which include material preparation, soil mixing, and wall compaction. This hands-on component demonstrates strong engagement, with participants achieving a 4.2 ± 0.5/5 task completion rate and 89.7 ± 3.2% operational accuracy through the system’s real-time feedback mechanism. Throughout this activity, the system can collect detailed hand-motion data to objectively assess technique mastery. The entire experience is supported by system-guided and step-by-step instructions that enable participants to develop both practical skills and a deeper understanding of traditional construction methods and their cultural significance. This multi-faceted approach has combined observational learning with active participation to create a comprehensive cultural immersion experience. Simultaneously, sensors can collect participants’ hand movements and force application data during operations to assess their mastery of construction techniques. In the 15-min cultural quiz, participants identify hidden cultural symbols (e.g., ancient murals, distinctive wood carvings) within the Tulou interior scenes and achieve 78.3 ± 5.6% recognition accuracy while the system can record search paths and durations to analyze cognitive strategies. This phase may evaluate participants’ observational skills and cultural understanding through systematic scoring of symbol identification performance and exploration patterns. Meanwhile, the control group has viewed a specially-produced documentary matching the intervention duration, which covers Tulou’s history, architectural features, and cultural practices. The system can track their viewing behaviors (fixation duration, scene transition frequency) for comparative analysis with VR group data, so as to maintain standardized knowledge delivery through conventional video presentation.
The 60-min evaluation phase for the VR group comprises comprehensive assessments such as Immediate post-intervention, participants completed questionnaires (IPQ total score: 7.2 ± 0.8, including spatial presence 6.8 ± 0.3/7 and involvement 6.5 ± 0.4/7), and physiological data collection (EEG with NASA-TLX cognitive load score: 52 ± 6). The aim is to evaluate cultural knowledge acquisition, immersive experience, and subjective satisfaction. Concurrent eye-tracking data enable in-depth analysis of cognitive processing and attention allocation patterns. One-week follow-up telephone tests have demonstrated significantly higher cultural knowledge retention in the VR group (78.3 ± 4.5%) compared to the traditional video group (62.1 ± 5.2%, p < 0.01), which may confirm VR’s advantage in long-term memory preservation through standardized content-related assessments. The complete experimental protocol is detailed in Table 1.
Enhanced data synchronization protocol
The system utilizes the White Rabbit protocol to establish a nanosecond-level synchronization network in hardware and provide precise temporal alignment for EEG devices, eye trackers, and motion capture systems through an FPGA timestamp generator (jitter <1 μs), while the software architecture integrates five synchronized data streams (EEG, eye movement, motion capture, questionnaire, and system logs) via a ROS2-based data hub that performs timestamp alignment (e.g., T0, T0 + 5 ns, T0 + 8 ns) and multimodal fusion through a unified interface. The complete system architecture and data flow are illustrated in Fig. 1.
A Shows the two-layer architecture. The hardware layer employs a White Rabbit protocol and FPGA timestamp generator (jitter <1 µs) to integrate and synchronize EEG, eye tracker, and motion capture devices over a nanosecond-level network. The software layer, based on an ROS2 data hub, offers a unified interface for real-time processing of multimodal streams like EEG, eye movement, motion data, questionnaires, and logs. B Shows that EEG, eye tracker, and motion capture devices are coordinated via FPGA and ROS2 hub, achieving nanosecond-level sync (e.g., packets at T0, T0 + 5 ns, T0 + 8 ns), with unified timestamps generated at T0 + 10 ns for synchronized multimodal data fusion.
Data acquisition and processing
A comprehensive 257-field data dictionary is developed in compliance with ISO/IEC 11179 standards, which can meticulously define each field’s name, data type, semantic meaning, collection methodology, and units to ensure standardization and interoperability for data sharing and reuse. An automated data quality control pipeline is implemented to execute daily at 02:00 and perform comprehensive validation checks in order to include data completeness, accuracy and consistency. The system can automatically detect and flag anomalies while triggering three-tiered alerts (email/SMS/phone) to notify researchers for timely corrective actions. Thus, it is important to maintain database reliability. A rigorous dual-verification protocol requires all data to be independently reviewed by at least two researchers prior to archiving. Any questionable entries are undergoing thorough investigation include potential reacquisition when it is necessary to guarantee data integrity and trustworthiness.
The EEG preprocessing has employed a combined ASR (Automatic Artifact Repair) and ICA (Independent Component Analysis) denoising approach, where ASR can automatically detect and correct artifacts (e.g., ocular/muscular artifacts) while ICA isolates and removes noise components independent of neural activity, which can collectively enhance signal-to-noise ratio. Morlet wavelet transforms are applied for time-frequency analysis across 1–100 Hz, which can decompose EEG signals into temporally-resolved spectral components to visualize dynamic neural oscillations and establish foundations for functional brain analysis. PLI (Phase Lag Index)-weighted brain network construction is implemented to examine interregional functional connectivity patterns and network topology, with PLI metrics quantifying phase synchronization directionality between brain areas. This network approach elucidates information exchange and cooperative mechanisms among cortical regions during VR immersion and provide mechanistic insights into how immersive experiences modulate cognitive brain functions.
The study can systematically investigate users’ cognitive behaviors in virtual environments through multimodal data analysis. First, researchers have constructed an eye-tracking scanpath model based on Hidden Markov Models to identify three characteristic visual exploration patterns (hidden states = 3, BIC = 152.3) by analyzing fixation coordinates (x,y), saccade amplitude (°), and dwell time (ms). Second, through using a density-peak-based dynamic ROI partitioning algorithm, researchers have divided the virtual scene into seven areas of interest (diameter ≥2° visual angle) and calculated metrics including first fixation latency (M = 320 ms, SD = 45) and total fixation duration (M = 1.2 s, SD = 0.3), which can reveal significant differences in visual attention allocation across Tulou architectural features (F(6,105) = 8.72, p < 0.001, η2 = 0.33). Joint ICA (JIVE) analysis of multimodal data showed: significant positive correlation between EEG θ-band power (4–7 Hz) and fixation duration (r = 0.61, p < 0.001, FDR-corrected); head rotation angles predicted ROI transition probability (β = 0.47, p = 0.002); and significant covariation patterns between questionnaire scores and eye-movement metrics (p < 0.001, Cohen’s f = 0.42). With synchronized multimodal data acquisition (error < ±12 ms), these findings provide reliable evidence for understanding the cognitive mechanisms underlying immersive experiences.
Results
Validity verification
The study systematically validates an immersive Fujian Tulou cultural experience framework through a multidimensional verification system. Researchers could propose an innovative “Dual-Cycle Cultural Cognition Model” that extends the traditional Technology Acceptance Model (TAM) both theoretically and empirically. The theoretical framework establishes a dual “inner-outer cycle” mechanism. The inner cycle utilizes computer vision for real-time cultural symbol recognition (achieving 82.3% accuracy in identifying Tulou couplets), which triggers significant EEG θ-band power enhancement (β = 0.54, p < 0.001). The outer cycle feeds user-generated rammed-earth construction data (89.7% operational accuracy) back to intangible cultural heritage practitioners to form a dynamic “experience-optimization-reexperience” feedback loop (expert content validity index CVI = 0.88). Empirical results from the integrated UTAUT2-TAM structural equation model demonstrate that performance expectancy and effort expectancy could explain 68% of behavioral intention variance (R2 = 0.68, 95% CI[0.63,0.73]). The model significantly outperforms traditional Digital Heritage Evaluation Scale benchmarks (t(215) = 6.72, p < 0.001, Cohen’s d = 0.92) and newly incorporated perceived usefulness (α = 0.91) and ease of use (α = 0.89) constructs with substantially improving model predictability (Δχ2(2) = 24.3, p < 0.001).
Interpretation of validity results
The superior performance of the integrated model suggests that traditional technology acceptance factors, when contextualized within cultural heritage frameworks, provide enhanced explanatory power for user engagement in metaverse environments. The high EEG θ-band correlation with cultural symbol recognition indicates that immersive technologies may enhance cultural memory encoding through neural synchronization mechanisms.
Empirical findings
Participants have demonstrated significantly higher scores in spatial presence (M = 6.8 vs 5.2; t(150) = 9.24, p < 0.001, d = 1.52) and engagement (M = 6.5 vs 5.2; t(150) = 7.86, p < 0.001, d = 1.29) compared with traditional VR experiences. Item Response Theory analysis shows that the VR group have achieved a 23% improvement in cultural symbol recognition accuracy versus 15% in control (χ2(1) = 8.92, p = 0.003, φ = 0.21). The newly developed “cultural FOMO” metric also demonstrates strong correlation with sharing intention (r = 0.71, 95% CI[0.65,0.76]), with explanatory power significantly surpassing traditional subjective norm pathways (Steiger’s Z = 4.12, p < 0.001). Multigroup analysis reveals experience-dependent effects. Novice users (β = 0.41, SE = 0.07) and experienced users (β = 0.62, SE = 0.05) follow distinct cognitive pathways (t(58) = 3.27, p = 0.002, d = 0.85). The validity verification framework is illustrated in Fig. 2.
Interpretation of behavioral findings
The introduction of “cultural FOMO” is based on established behavioral theories in heritage studies, particularly extending Scannell & Gifford’s (2010) place attachment framework to digital environments. This construction captures the anxiety users experience when they fear exclusion from meaningful cultural experiences. It is clear that there exists a novel manifestation of digital heritage engagement. The strong correlation with sharing intention suggests cultural FOMO operates as a significant driver of digital word-of-mouth in heritage contexts.
Metric optimization
The study has implemented systematic innovations in indicator optimization through introducing behavioral observation indicators (cultural symbol recognition accuracy and spatial relationship comprehension), incorporating psychological metrics (cultural identity α = 0.88, aesthetic experience α = 0.85), and expanding behavioral indicators (sharing intention, revisit intention). Confirmatory factor analysis demonstrates the optimized indicator system exhibits excellent psychometric properties such as discriminant validity (AVE = 0.72, 95% CI[0.68,0.76]), composite reliability (CR = 0.91), and internal consistency (α = 0.89, 95% CI[0.86,0.92]). Independent samples t-tests show that the VR group’s cultural symbol recognition accuracy (M = 82.3%, SD = 6.7) significantly exceeds the traditional media group (M = 59.1%, SD = 8.2; t(59) = 4.32, p < 0.001, Cohen’s d = 1.12). The indicator optimization framework is illustrated in Fig. 3.
Model modification
Model optimization through modification index analysis has identified three statistically significant error-correlated paths, which markedly improve model fit (Δχ2 = 38.72, Δdf = 3, p < 0.001; RMSEA decreased by 0.02). Robustness testing based on 5000 bootstrap resamples confirms that all key path coefficients exclude zero in 95% CIs. The cultural knowledge acquisition to spatial cognition path remains stable (0.65–0.71 range, p < 0.001), and the presence of the emotional engagement path shows the largest effect (β = 0.54, 95% CI[0.49,0.59], Cohen’s f2 = 0.35). The final model exhibits excellent fit indices (CFI = 0.92; RMSEA = 0.04, 90% CI[0.03,0.05]; SRMR = 0.03). The model modification process is illustrated in Fig. 4.
The model optimization process demonstrates significant improvement in fit through three added paths (Δχ2 = 38.72*). The final model shows excellent fit indices (CFI = 0.92, RMSEA = 0.04, SRMR = 0.03). Path analysis reveals a strengthened “culture-space” association (0.65→0.71*) and a notable “presence-emotion” effect (B = 0.54). Bootstrap validation with 5000 resamples confirms model stability, as key paths’ confidence intervals exclude zero.
Cross-group validation
Multigroup analyses used to stratify participants into novice (n = 32) and experienced users (n = 28) establishes measurement in variance (ΔCFI < 0.01). Path analysis reveals significant between-group differences in the ease of-use→cognition pathway (βnovice = 0.41 vs βexperienced = 0.28; t(58) = 2.15, p = 0.036, Cohen’s d = 0.55). Novice users derive greater cognitive benefits from system usability (32% enhancement; t(58) = 2.91, p = 0.005, d = 0.75), while experienced users exhibit stronger behavioral intention (18% increase; t(58) = 2.17, p = 0.034, d = 0.56). These empirically validated differences (Fig. 5) could provide compelling evidence for implementing experience-adapted design strategies in VR development.
Discussion
The expert review process has employed a modified Delphi method with three iterative rounds. The aim is to ensure the professionalism and accuracy of cultural representation. Twelve intangible cultural heritage (ICH) practitioners are invited. They consist of five Tulou architectural technique inheritors, four Hakka folk culture experts, and three traditional craftsmanship masters. These experts try to possess extensive practical experience and profound academic accomplishments in various Tulou-related fields. They can offer a comprehensive professional evaluation of cultural representation in the VR experience from multiple perspectives. A detailed scoring rubric with 38 assessment items is designed in order to cover multiple dimensions such as architectural features, craft details, and folk activities. For architectural features, evaluation items include wall thickness, door/window styles, and roof pitch of Tulou buildings. For craft details, the assessment focuses on rammed-earth technique, force control, and timber structure mortise-tenon joints. For folk activities, items cover Hakka wedding customs and ritual ceremonies. The quantitative evaluation is conducted by using a 5-point Likert scale (1 = strongly disagree, 5 = strongly agree) so as to ensure objectivity and comparability of assessment results.
For items with significant disagreements in the first-round evaluation (such as the force feedback intensity of rammed-earth techniques), experts are organized for on-site VR experience sessions and in-depth discussions. The experts have personally tested the rammed-earth simulation module in the VR system and evaluated different force feedback levels provided by the haptic devices. The detailed exchanges are used to compare these with actual traditional craftsmanship experiences. Based on expert feedback, some key cultural representation details are optimized, such as adjusting the weight perception of ramming hammers and refining earthen wall texture details. Specifically, the hammer’s weight perception is increased from 8 kg to 10 kg to better match real tools, while wall textures are remodeled with enhanced mapping to accurately represent traditional rammed-earth characteristics. Additionally, two interaction design logics are improved to align virtual construction processes with actual craftsmanship standards, which include revising the operation sequence and motion requirements during wall ramming to maintain consistency with traditional workflows.
The finalized Content Validity Index (CVI) reaches 0.88 (significantly exceeding the 0.75 threshold in a one-sample t-test: t[11] = 5.42, p < 0.001, Cohen’s d = 1.56), while the Kendall’s W coefficient of 0.85 (χ2[11] = 93.5, p < 0.001) demonstrates strong expert consensus. In evaluating the rammed-earth craft simulation system, a binomial test confirms that 10 out of 12 experts awarding the maximum 5-point score is significantly above chance level (p = 0.013), with the remaining 2 assigning 4 points (mean = 4.83 ± 0.39), which collectively indicates exceptional endorsement of cultural authenticity. Through iterative reviews, researchers have established 25 core standards for cultural representation and showed excellent internal consistency across dimensions (content accuracy α = 0.91, interaction logic α = 0.89, visual fidelity α = 0.93). One-way ANOVA reveals that the finalized standards can significantly reduce cultural representation errors by 42% compared with initial prototypes (F(2,33) = 12.7, p < 0.001, η2 = 0.44), with the optimized rammed-earth simulation particularly noteworthy for achieving 92% expert endorsement (95% CI[85%,97%]) as authentically replicating traditional techniques. The detailed experimental procedure data are shown in Fig. 6.
Against the backdrop of increasing cultural globalization, the digital preservation and innovative transmission of cultural heritage are faced with significant challenges. The study takes Fujian Tulou as a case study and tries to explore the effectiveness and optimization pathways of VR technology in cultural heritage dissemination through the construction of a Cultural Visualizability (CV) assessment model and a metaverse virtual exhibition hall, so as to validate it via multimodal experiments. The core contribution of this research lies in its explicit improvement upon existing evaluation frameworks for digital cultural heritage. First, diverging from traditional studies may emphasize qualitative descriptions or single-user experience metrics. The proposed CV assessment model establishes a scientific framework for evaluating digital heritage effects by quantifying relationships between technical parameters (T), cultural salience (S), and implementation costs (Ct). Experimental data reveal that the VR group achieves 92% spatial cognition accuracy, with significantly higher cultural detail retention rates compared with traditional video groups so as to validate the CV model’s effectiveness in optimizing cultural visualization strategies. Secondly, the Fujian Tulou metaverse virtual exhibition leverages VR’s immersive interactivity through multimodal data fusion (EEG, eye-tracking, behavioral analysis) and is LOD-optimized to balance high realism with seamless experience.
The experimental results demonstrate that the VR group significantly outperforms the traditional video group in both cultural comprehension (85 vs. 65) and cultural interest level (4.5 vs. 3.0), which may confirm VR’s advantages in enhancing cultural communication depth and user engagement. Furthermore, the study has validated the scientific rigor of the theoretical framework through structural equation modeling (CFI = 0.92) and the accuracy of cultural representation via intangible cultural heritage inheritors’ evaluation (CVI = 0.88). Therefore, this research not only demonstrates the effectiveness of VR technology through rigorous experimentation but also provides a set of quantifiable assessment methods and a decision-making tool to balance the efficacy and cost.
Based on these findings, there are some clear pathways for future research. Firstly, the success of the CV model and the rich multimodal dataset (e.g., EEG, eye-tracking, behavioral logs) provide a foundational training ground for AI-driven dynamic cultural narratives. Future work can leverage these data to develop models so as to generate personalized and context-aware storytelling within the virtual Tulou. Also, it is important to adapt narratives in real-time based on user behavior and demonstrated cognitive engagement. Secondly, the cognitive pathways can identify that the potent ‘cultural FOMO’ metric in this study underscores the critical need for cross-cultural validation. The models and frameworks developed here should be tested with diverse international user groups to investigate how cultural background modulates perception, engagement, and learning outcomes so as to ensure their global applicability and effectiveness. This progression naturally propels VR heritage research from the ‘technology verification’ stage toward a phase of refined, adaptive, and culturally-aware design and evaluation.
Data availability
The data generated and analyzed during this study are included in this published article.
Code availability
No custom code is developed. All analyses are performed using existing commercial and open-source software without source code modifications.
References
Zhong, H., Wang, L. & Zhang, H. The application of virtual reality technology in the digital preservation of cultural heritage. Comput. Sci. Inf. Syst. 18, 535–551 (2021).
Hajirasouli, A., Banihashemi, S., Kumarasuriyar, A., Talebi, S. & Tabadkani, A. Virtual reality-based digitisation for endangered heritage sites: theoretical framework and application. J. Cult. Herit. 49, 140–151 (2021).
Fokides, E. & Antonopoulos, P. Development and testing of a model for explaining learning and learning-related factors in immersive virtual reality. Comput. Educ. X Real. 4, 100048 (2024).
Holcombe, A. Attending to Moving Objects (Cambridge Univ. Press, 2023).
Banfi, F. et al. Interactive and immersive digital representation for virtual museum: VR and AR for semantic enrichment of Museo Nazionale Romano, Antiquarium di Lucrezia Romana and Antiquarium di Villa Dei Quintili. ISPRS Int. J. Geo Inf. 12, 28 (2023).
Harun, N. Z. & Sri, Y. M. 360° virtual tour of the traditional Malay house as an effort for cultural heritage preservation. IOP Conf. Ser. Earth Environ. Sci. 764, 012010 (2021).
Mosbeh, R. Factors influencing success of virtual reality in Tunisia’s cultural heritage tourism sites: a UTAUT2 approach extension. Virtual Real. 29, 26 (2025).
Khorasani, S., Syiem, B. V., Nawaz, S., Knibbe, J. & Velloso, E. Hands-on or hands-off: deciphering the impact of interactivity on embodied learning in VR. Comput. Educ. X Real. 3, 100037 (2023).
Gao, L. et al. Investigating the effectiveness of virtual reality for culture learning. Int. J. Hum. Comput. Interact. 37, 1771–1781 (2021).
Anastasovitis, E. et al. Enhanced inclusion through advanced immersion in cultural heritage: a holistic framework in virtual museology. Electronics 13, 1–17 (2024).
Ye, D. et al. Investigating the impact of virtual tourism on travel intention during the post-COVID-19 era: evidence from China. Univ. Access Inf. Soc. 23, 1507–1523 (2024).
Liu, Z. Regeneration of Liangzhu culture: multimedia exhibition, simulated restoration, innovative cultural products, nearby area integration, virtual reality and augmented reality. Humanit. Soc. Sci. Commun. https://doi.org/10.1057/s41599-023-01865-x (2023).
Klingenberg, S., Bosse, R., Mayer, R. E. & Makransky, G. Does embodiment in virtual reality boost learning transfer? Testing an immersion-interactivity framework. Educ. Psychol. Rev. 36, 1–39 (2024).
Laine, J., Rastas, E., Seitamaa, A., Hakkarainen, K. & Korhonen, T. Immersive virtual reality for complex skills training: content analysis of experienced challenges. Virtual Real. https://doi.org/10.1007/s10055-024-00955-8 (2024).
Makransky, G. & Mayer, R. E. Benefits of taking a virtual field trip in immersive virtual reality: evidence for the immersion principle in multimedia learning. Educ. Psychol. Rev. 34, 1771–1798 (2022).
Xinhua News Agency. Put on VR to tour Dunhuang: “exploring Dunhuang” digital exhibition gains popularity. https://baijiahao.baidu.com/s?id=1805277950350574266 (2024).
Bianconi, F., Filippucci, M., Cornacchini, F., Meschini, M. & Mommi, C. Cultural heritage and virtual reality: application for visualization of historical 3D reproduction. Int Arch. Photogramm. Remote Sens Spat. Inf. Sci. 48, 203–210 (2023).
Iacono, S. et al. Virtual reality in cultural heritage: a setup for Balzi Rossi Museum. Appl Sci. https://doi.org/10.3390/app14093562 (2024).
Pei, X., Fu, S. & Jiang, T. An empirical study on user experience evaluation of VR interface in digital museums. Data Inf. Manag. 7, 100057 (2023).
Soccini, A. M. & Anna, M. M. Review of towards a standard approach for the design of a both physical and virtual museum. In 2021 IEEE International Conference on Artificial Intelligence and Virtual Reality (AIVR) 106–108 (IEEE, Taichung, 2021).
Oncioiu, I. & Priescu, I. The use of virtual reality in tourism destinations as a tool to develop tourist behavior perspective. Sustainability 14, 4191 (2022).
Margetis, G., Apostolakis, K. C., Ntoa, S., Papagiannakis, G. & Stephanidis, C. X-reality museums: unifying the virtual and real world towards realistic virtual museums. Appl Sci. 11, 338 (2021).
Qiu, T. et al. Continuance intention toward VR games of intangible cultural heritage: a stimulus-organism-response perspective. Virtual Real. 28, 149 (2024).
Kalving, M., Paananen, S., Seppälä, J., Colley, A. & Häkkilä, J. Comparing VR and desktop 360 video museum tours. In Proc. 21st International Conference on Mobile and Ubiquitous Multimedia 134–145 (ACM, New York, 2022).
Gong, Q. et al. User experience model and design strategies for virtual reality-based cultural heritage exhibition. Virtual Real. 28, 1–21 (2024).
Hargitai, R. Measuring immersion, involvement, and attention focusing tendencies in the mediated environment: the applicability of the immersive tendencies questionnaire. Front. Psychol. 13, 1–10 (2022).
Paulauskas, L. et al. Reconstruction of industrial and historical heritage for cultural enrichment using virtual and augmented reality. Technologies 11, 36 (2023).
Liu, S. M. Study on Road Landscape of Hekeng Village in Nanjing Hakka Tulou Settlement (South China University of Technology, Guangzhou, 2022).
Zheng, F. & Liu, X. Y. Application of virtual reality technology in Fujian Tulou cultural communication. Art. Mark. 28, 68–99 (2025).
Wang, Z. T. & Wu, S. D. Analysis on protection and development strategies of Fujian Tulou culture under rural revitalization. China Archit. Decor. 23, 102–123 (2023).
Chen, X. Y. Power strategy of Fujian Tulou space. Fujian Hist. Rec. 4, 138–167 (2021).
Wang, J., Hong, P. & Zhu, Z. W. Spatial dimension of rural revitalization: A case study based on Fujian Tulou restoration. J. Huazhong Agric. Univ. Soc. Sci. Ed. 4, 156–185 (2021).
Chen, H. Investigation of Pinghe Tulou resources and strategies for next-step protection and utilization. Identif. Apprec. Cult. Relics 20, 187–202 (2024).
Li, M. & Liu, N. Inheritance and application research of Tujia culture in architectural design. Beauty & Times (City Edition) 10, 132–189 (2024).
Chen, Q. Exploration on inheritance and development of traditional culture in architectural design. Urban Archit. Space 2, 132–189 (2024).
Ou, X. Analysis of architectural design paradigm under the background of digital technology. Low Carbon World 3, 167–202 (2023).
Spallone, R. et al. 3D modelling and virtual reality for museum heritage presentation: contextualisation of sculpture from the Tang era. Int. Arch. Photogramm. Remote Sens Spat. Inf. Sci. ISPRS Arch. 48, 413–420 (2024).
Innocente, C., Ulrich, L., Moos, S. & Vezzetti, E. A framework study on the use of immersive XR technologies in the cultural heritage domain. J. Cult. Herit. 62, 268–283 (2023).
Arayaphan, W., Sirasakmol, O., Nadee, W. & Puritat, K. Enhancing intrinsic motivation of librarian students using virtual reality for education in the context of culture heritage museums. TEM J. 11, 620–630 (2022).
Arayaphan, W., Intawong, K. & Puritat, K. Digitalization of ancient fabric using virtual reality technology at the Wieng Yong house museum: the fabricVR project. Digit Appl. Archaeol. Cult. Herit. 26, e00233 (2022).
Baradaran, R., Farzan, J. E., Boyd, J. R., Eiserman, R. M. & Levy, B. Museum beyond physical walls: an exploration of virtual reality-enhanced experience in an exhibition-like space. Virtual Real. https://doi.org/10.1007/s10055-022-00643-5 (2022).
Li, J. & Lv, C. Exploring user acceptance of online virtual reality exhibition technologies: a case study of Liangzhu Museum. PLoS ONE 19, 1–25 (2024).
Mimnaugh, K. J. et al. Virtual reality sickness reduces attention during immersive experiences. IEEE Trans. Vis. Comput Graph 29, 4394–4404 (2023).
Barbera, R. et al. A case study for the design and implementation of immersive experiences in support of Sicilian cultural heritage. In Lecture Notes in Computer Science 174–185 (Springer, Cham, 2022).
Bekele, M. K. Clouds-based collaborative and multi-modal mixed reality for virtual heritage. Heritage 4, 1447–1459 (2021).
Komianos, V. Immersive applications in museums: an analysis of the use of XR technologies and the provided functionality based on systematic literature review. JOIV Int. J. Inform. Visual. 6, 60–73 (2022).
Berg, K., Larsson, M. & Lindh, F. An Immersive Experience using 360° Video and Google Cardboard (Chalmers Univ. Technol., 2024).
Buragohain, D., Meng, Y. & Deng, C. Digitalizing cultural heritage through metaverse applications: challenges, opportunities, and strategies. Herit. Sci. 12, 1–17 (2024).
Martirosov, S., Bureš, M. & Zítka, T. Cyber sickness in low-immersive, semi-immersive, and fully immersive virtual reality. Virtual Real. 26, 15–32 (2022).
Suhaeni, T., Brien, A., Andrianto, T. & TahaMannaa, D. M. Predicting future halal tourist behavior: incorporating holistic tourist experience and virtual reality experience. Int. J. Appl. Bus. Res 2022, 16–31 (2022).
Rodriguez-Garcia, B., Guillen-Sanz, H., Checa, D. & Bustillo, A. A systematic review of virtual 3D reconstructions of cultural heritage in immersive virtual reality. Multimed. Tools Appl 83, 89743–89793 (2024).
Okanovic, V. et al. Interaction in extended reality applications for cultural heritage. Appl. Sci. 12, 1241 (2022).
Condorelli, F. & Sara, M. Integration of 3D modelling with photogrammetry applied on historical images for cultural heritage. VITRUVIO Int. J. Archit. Technol. Sustain. 8, 58–69 (2023).
De Canio, F., Martinelli, E., Peruzzini, M. & Cavallaro, S. Experiencing a food production site using wearable devices: the indirect impact of immersion and presence in VR tours. Sustainability 14, 3064 (2022).
De Paolis, L. et al. Virtual reality for the enhancement of cultural tangible and intangible heritage: the case study of the castle of Corsano. Digit. Appl. Archaeol. Cult. Herit. 27, e00238 (2022).
Roumana, A., Georgopoulos, A. & Koutsoudis, A. Developing an educational cultural heritage 3d puzzle in a virtual reality environment. Int. Arch. Photogramm. Remote Sens Spat. Inf. Sci. 43, 885–891 (2022).
Chong, H. T. et al. Virtual reality usability and accessibility for cultural heritage practices: challenges mapping and recommendations. Electronics 10, 1–19 (2021).
Chernbumroong, S. et al. Comparing the impact of non-gamified and gamified virtual reality in digital twin virtual museum environments: a case study of Wieng Yong house museum. Heritage 7, 1870–1892 (2024).
Alabau, A. et al. Enriching user-visitor experiences in digital museology: combining social and virtual interaction within a metaverse environment. Appl. Sci. 14, 3769 (2024).
Tsita, C. et al. A virtual reality museum to reinforce the interpretation of contemporary art and increase the educational value of user experience. Heritage 6, 4134–4172 (2023).
Kim, M. J. & Kang, Y. Older adults’ user experience of virtual tourism: exploring presence and experiential value with respect to age difference. Virtual Real. 27, 2967–2987 (2023).
Wang, X., Hidayat, R., Wardi, B. & Ghazali, R. B. Evaluation of virtual reality cultural heritage experience at Guilin Museum, China: a conceptual paper. Malays. J. Soc. Sci. Humit. 10, e003111 (2025).
Othman, M. K., Nogoibaeva, A., Leong, L. S. & Barawi, M. H. Usability evaluation of a virtual reality smartphone app for a living museum. Univers Access Inf. Soc. 21, 995–1012 (2022).
Lee, L. H. et al. All one needs to know about metaverse: a complete survey on technological singularity, virtual ecosystem, and research agenda. J. Latex Clas. Files 14, 1–66 (2021).
Barszcz, M., Dziedzic, K., Skublewska-Paszkowska, M. & Powroznik, P. 3D scanning digital models for virtual museums. Comput. Animat. Virtual Worlds 34, e2154 (2023).
Oladokun, B. D., Ajani, Y. A., Ukaegbu, B. C. N. & Oloniruha, E. A. Cultural preservation through immersive technology: the metaverse as a pathway to the revitalization of indigenous heritage. J. Virtual Cult. Herit. 15, 88–105 (2023).
Innocente, C. et al. A metaverse platform for preserving and promoting intangible cultural heritage. Appl. Sci. 14, 3426 (2024).
Hutson, J. Art and Culture in the Multiverse of Metaverses: Immersion, Presence, and Interactivity in the Digital Age (Springer, Cham, 2024).
Poulopoulos, V. & Wallace, M. Digital technologies and the role of data in cultural heritage: the past, the present, and the future. Big Data Cogn. Comput. 6, 73 (2022).
Pietroni, E. & Ferdani, D. Virtual restoration and virtual reconstruction in cultural heritage: terminology, methodologies, visual representation techniques and cognitive models. Information 12, 167 (2021).
Balakrishnan, J., Dwivedi, Y. K., Malik, F. T. & Baabdullah, A. M. Role of smart tourism technology in heritage tourism development. J. Sustain. Tour. 31, 2506–2525 (2023).
Baker, J., Nam, K. & Dutt, C. S. A user experience perspective on heritage tourism in the metaverse: empirical evidence and design dilemmas for VR. Inf. Technol. Tour. 25, 265–306 (2023).
Buhalis, D. & Karatay, N. Mixed reality (MR) for generation Z in cultural heritage tourism towards metaverse. In Information and Communication Technologies in Tourism 2022 (eds Stienmetz J. L. et al.) 16–27 (Springer, Cham, 2022).
Fan, X., Jiang, X. & Deng, N. Immersive technology: A meta-analysis of augmented/virtual reality applications and their impact on tourism experience. Tour. Manag. 91, 104534 (2022).
Ioannidis, S. & Kontis, A. P. Metaverse for tourists and tourism destinations. Inf. Technol. Tour. 25, 483–506 (2023).
Wang, Y., Siau, K. L. & Wang, L. Metaverse and Human-Computer Interaction: A Technology Framework for 3D Virtual Worlds. In HCI International 2022 - Late Breaking Papers. Lecture Notes in Computer Science Vol. 13518 (eds Chen, J. Y. C. et al.) 213–221 (Springer, Cham, 2022).
Zhang, X. et al. Metaverse for cultural heritages. Electronics 11, 3730 (2022).
Dayoub, B., Yang, P., Omran, S., Zhang, Q. & Dayoub, A. Digital silk roads: leveraging the metaverse for cultural tourism within the belt and road initiative framework. Electronics 13, 2306 (2024).
Suh, A. How users cognitively appraise and emotionally experience the metaverse: focusing on social virtual reality. Inf. Technol. People 37, 1613–1641 (2024).
Go, H. & Kang, M. Metaverse tourism for sustainable tourism development: tourism agenda 2030. Tour. Rev. 78, 381–394 (2023).
Pastel, S. et al. Comparison of gaze accuracy and precision in real-world and virtual reality. Virtual Real. 25, 175–189 (2021).
Acknowledgements
We sincerely thank the volunteers and staff who contributed to this research. This work was supported by the General Project of Humanities and Social Sciences Research of the Ministry of Education (2024) [Grant No. 24YJAZH088], the Ministry of Culture and Tourism Art Research Project for the Year 2025 [Grant No. 25DB003], the Fujian Provincial Social Science Foundation Project [Grant No. FJ2024C124], and the Philosophy and Social Sciences Research Project of Fujian Provincial Education System [Grant No. JAS23208]. We also acknowledge the institutional support from Xiamen University Tan Kah Kee College, Beijing Normal University, and Huzhou Normal University.
Author information
Authors and Affiliations
Contributions
Miss Na Yu has led the conceptualization, design, and development of the VR museum. She has conducted and finished participant data collection and result analyses. Also, she tries to draft the manuscript and supervise the whole research. On the other hand, Mr. Yin Jing offers his help with proofreading and editing the manuscript. Similarly, it is necessary to appreciate him to provide some critical feedback on the methodology, co-supervise the research, and review the manuscript. In addition, Mr. Jun Bu also participates in data analyses and statistical validation by contributing expertise in VR technology and reviewing technical aspects with hard work in data interpretation and manuscript revision. Finally, it is absolutely necessary to appreciate all authors for their meticulous review and approval of the final draft.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Na, Y., Jing, Y. & Bu, J. Immersive VR technology validation for Fujian Tulou heritage. npj Herit. Sci. 14, 42 (2026). https://doi.org/10.1038/s40494-025-02177-w
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s40494-025-02177-w








