Abstract
This paper describes a dataset containing field measurement data from classrooms in Belgium. The dataset comprises of survey responses, indicating occupants’ satisfaction with the indoor environmental quality (IEQ) (i.e., thermal, indoor air quality, visual, acoustic and overall IEQ conditions), measurements of the indoor environmental conditions in the classroom during heating and intermediate season, and contextual variables like occupant location in classroom. Data were collected in three educational case studies: three classrooms in a secondary school (Case 1), seven primary school classrooms (Case 2), and a university lecture room (Case 3). The full dataset consists of 6834 satisfaction assessments collected from 321 unique occupants in classrooms, merged with corresponding measurements of the classroom IEQ. The dataset benefits the research community since a longitudinal dataset of this size on occupants’ satisfaction in classrooms is unavailable at this moment. The dataset can serve as a training or validation dataset for novel comfort models, a basis for statistical analyses to establish a causal model or to examine inter- and intra-individual differences in occupant satisfaction.
Similar content being viewed by others
Background & Summary
Providing a satisfactory indoor environmental quality (IEQ) is essential for safeguarding occupants’ health1, productivity2,3 and overall satisfaction4. In the case of educational buildings, an unsatisfactory classroom IEQ can lead to learning loss among pupils and students5,6,7. However, guaranteeing high occupant satisfaction levels is challenging due the complex combination of variables influencing the satisfaction. These variables can be environmental, personal and contextual8. In addition, currently used comfort models like Fanger’s predicted mean vote – predicted percentage dissatisfied (PMV-PPD) model9 and the adaptive thermal comfort model10,11, often poorly predict the actual satisfaction of the individual occupant as these models predict the average comfort preference of a large group of occupants rather than predicting the unique preference of an individual occupant. In addition, these comfort models are constructed on training data from adults, while many of the occupants in classrooms are minors12. Therefore, further research is needed to improve the understanding of occupant satisfaction with IEQ in classrooms and to improve the accuracy of predictive comfort models13. Personal comfort models are a promising modelling approach for enhancing the accuracy of predicting occupant preferences with regard to the IEQ. Personal comfort modelling is an approach which aims to predict the individual preferences of occupants, rather than relying on the average preference of a large group14,15. The integration of personal comfort models in the HVAC control system has the potential to increase occupant satisfaction and reduce energy use by the HVAC system16. A personal comfort model is trained using occupant feedback (e.g., thermal satisfaction, sensation or preference) and one or more environmental, physiological or behaviour-related factors14,17,18,19,20. Therefore, a longitudinal dataset obtained through repeated measures data collection is required to construct a personal comfort model. In these types of data collection campaigns, occupants must evaluate their perception of satisfaction multiple times throughout the study. The study duration can vary from several days to several months. The benefit of a longitudinal dataset is that it can be used to study the impact of variables that change over time (e.g., IEQ-related variables such as temperature and CO2-concentation) on occupants’ satisfaction. Many of the public datasets such as, ASHRAE Global Thermal Comfort Database21 and the other longitudinal datasets17,22 were limited to evaluating occupants’ perceptions of the thermal environment and not of the other IEQ conditions, such as the indoor air quality (IAQ), visual and acoustical conditions. A dataset constructed by Jayathissa et al.23 extended the scope by include assessments of occupants’ perception of the thermal, acoustic and visual conditions of an indoor space. However, an evaluation of occupants’ satisfaction with the IAQ conditions was missing. Therefore, there is a lack of publicly available longitudinal datasets for classrooms that evaluate occupants’ satisfaction with all four IEQ domains.
Added value and novelty of the current dataset
The current paper describes the construction of a longitudinal dataset on occupants’ satisfaction with the IEQ in three educational case studies. The dataset was designed to meet the following requirements:
-
1.
Assessments of occupants’ satisfaction with the IEQ conditions: The dataset was not limited to occupants’ satisfaction with the thermal environment, but also included satisfaction assessments on the IAQ, acoustic and visual conditions.
-
2.
Simultaneous monitoring of the IEQ-related, personal and contextual variables: Measurements of various IEQ-related (e.g., temperature, relative humidity, CO2,…), personal (e.g., clothing level, age, gender,…) and contextual variables (e.g., moment of assessment, window opening, occupants’ seating location,…) were performed in order to obtain a holistic overview of the potential influencing variables on occupants’ satisfaction.
-
3.
Obtained from a field study in classrooms: The dataset represented occupants’ actual response to the IEQ conditions in classrooms.
-
4.
Longitudinal measurement campaign: The dataset resulted from a longitudinal measurement campaign in which multiple assessments (e.g., 10–20 assessments per person) were collected from the occupants. Consequently, sufficient data was available to determine occupants’ unique preferences and train personal comfort models.
The dataset consists of 6834 survey responses on occupants’ satisfaction with the IEQ from 321 different occupants (i.e., children, teenagers and adults), merged with corresponding measurements of the classroom IEQ. All classrooms were located in the region of Flanders (Belgium). The research community can use the presented dataset as:
-
A validation dataset to evaluate novel comfort models.
-
A training dataset for personal comfort models since the dataset comprises of multiple votes per occupant.
-
A basis for a statistical study which aims to establish a causal model for occupant satisfaction in classrooms or to study inter- and intra-individual differences in satisfaction.
-
An evaluation dataset to assess the impact of new heating, ventilation and air-conditioning (HVAC) control systems on occupants’ satisfaction with IEQ. For example, the dataset can be the basis for an agent-based model representing occupant satisfaction responses to HVAC control actions in a building energy simulation study.
The relevance of the current dataset can be summarised in the following three points:
-
Longitudinal feedback for personal comfort modelling: Many existing datasets collect only a few comfort votes per person. Our dataset stems from longitudinal measurement campaigns where participants were frequently asked about their IEQ satisfaction. This repeated feedback is essential for developing and validating personal comfort models, which aim to predict unique occupant preferences rather than group averages.
-
Comprehensive evaluation of occupants’ satisfaction with IEQ: Unlike most datasets that focus solely on thermal comfort, our dataset includes evaluations of satisfaction with thermal, IAQ, acoustic, and visual conditions, enabling holistic modelling of occupants’ satisfaction with IEQ.
-
Underrepresented Belgian context: The ASHRAE global thermal comfort database21 contains only 85 datapoints from Belgium, all collected in residential buildings. Moreover, only 597 datapoints (0.5% of the ASHRAE dataset) originate from classrooms in a climate zone similar to Flanders (Belgium). Our dataset addresses this gap by providing an extensive dataset from classrooms in this underrepresented region, offering a valuable resource for studies on IEQ in educational settings within a mixed-humid climate.
Methods
This section describes the data collection and dataset construction approach. The data were collected during three case studies: three classrooms in a secondary school (Case 1), seven primary school classrooms (Case 2), and a test lecture room at a university campus (Case 3). The three case studies varied in type of occupants: children under 12 years old in case 2, teenagers in case 1 and young adults in case 3. As a result, the satisfaction of a wide range of occupants in educational buildings was studied. Firstly, the data collection approach comprising of environmental monitoring and surveys, assessing occupants’ satisfaction, are described. Secondly, it delves into the monitoring period, building and system characteristics, participants sample, monitoring equipment and survey distribution of each of the three case studies. Afterwards, the data preparation process is specified (e.g. data cleaning, variable creation and data merging).
Data collection approach
Longitudinal measurements, i.e., repeated measures of occupant satisfaction with IEQ, were performed to collect data suitable for (1) determining the main influencing variables of occupants’ satisfaction with IEQ via statistical analysis, and (2) construction of predictive satisfaction models. The measurement campaigns consisted of simultaneously monitoring the indoor environmental conditions in the classroom and assessing occupants’ satisfaction with IEQ via online surveys.
Survey structure and content
Occupants’ satisfaction was assessed via online surveys using the software Qualtrics24. The surveys could be accessed by the participants via a QR-code or weblink. A two-step approach was used to survey occupants’ satisfaction in the case study buildings. Firstly, a retrospective survey was distributed to gain more detailed insights on the occupants’ background (e.g. demographics, typical behaviour and occupant satisfaction during past period). Secondly, shorter right-here-right-now (RHRN) surveys were completed regularly by the participants in order to assess their immediate thermal and IAQ satisfaction over a longer period. The RHRN survey started by asking participants’ unique identifier which was used to track occupants’ responses throughout the data collection campaigns. After the data collection, the participants’ identifiers were pseudonymised to protect their privacy. Occupants’ satisfaction was assessed on a five-point rating scale (see Fig. 1A). This five-point rating scale was used as it resembled a commonly used customer scale to review products and services (e.g., Google Maps25, Trustpilot26, Bol27). Consequently, the IEQ satisfaction assessment became clear and recognisable for participants, which limited their time investment in the survey. The midpoint did not represent a neutral point and was labelled as ‘satisfied’ to participants. As a result, occupant responses could be categorised into ‘dissatisfaction’ and ‘satisfaction’ without neglecting neutral responses or assuming neutral responses resembled satisfaction. All four domains of the IEQ (i.e., IAQ, thermal, acoustic and visual comfort) and the overall IEQ were assessed by the participants. Scores <3 were categorised as dissatisfied, and scores ≥ 3 as satisfied. In case of a score lower than 3, a follow-up question appeared in which the participant could indicate the reason for his/her dissatisfaction.
Depending on the case study, different questions were posed before or after the satisfaction assessments. These questions asked about personal and contextual factors (e.g., clothing level, location in the classroom or window opening) or aimed to assess occupants’ study engagement (Case 3) or survey fatigue (Case 2). Figure 1B shows the structure of the RHRN survey distributed in a secondary school (Case 1). A detailed overview of the content and structure of the surveys used in each case study can be found in the survey contents file at the data repository28.
Informed consent procedure
At the start of each data collection campaign an informed consent procedure was performed. The procedure started with informing the participants were about the content, aim and usage of the surveys. Afterwards, their consent to participate with the data collection campaign was asked at the start of the first survey they completed. In addition, the participants consented to their data being used for scientific purposes and publication, provided that their privacy and anonymity were protected. Additional measures were taken in case the participants were minors. An informed consent form was provided to a parent or legal guardian of the participants which were minors. In case the participant was younger than 16 years, the form had to be signed by a parent or legal guardian. In case the participant was 16 years or older, the parent or legal guardian was not required to return a signed version of the form. However, the form stated that parents or legal guardians could object to their child’s participation via the contact details of the main researcher provided. The data handling and informed consent procedure used during the data collection campaigns was approved by the KU Leuven Social and Societal Ethics Committee under file number G-2021-3074.
Overview collected variables
The measurement campaigns in the different case studies consisted of both continuous IEQ-monitoring and regular satisfaction assessments via RHRN surveys. The number of collected variables during the measurement campaigns differed between the cases. Additional thermal and IAQ instruments were used during the data collection campaign in the test lecture room (case 3). Due to the larger number of classrooms monitored in the secondary and primary school case studies, it was not practically feasible to use these devices in case studies 1 and 2. Consequently, the impact of variables such as air velocity, total volatile organic compounds (TVOC) and particulate matter (PM) concentration on occupant satisfaction can only be studied in case 3. An overview of the collected variables and survey characteristics are given in Tables 1 and 2, respectively. Note that in Table 2 a differentiation is made between “Thermal sensation” and “Thermal comfort”. The first referred to an evaluation of occupants’ thermal sensation on the ASHRAE 7-point thermal sensation scale (i.e., ranging from cold to hot) which is related to the PMV-PPD model29. The latter resembled a simplified 3-point scale evaluating occupants’ thermal comfort perception (i.e., ‘uncomfortably cool’, ‘comfortable’ and ‘uncomfortably warm’). More details on the survey questions and scale can be found in the survey contents file at the data repository28.
Selection and description of case studies
Data collection campaigns were performed in multiple classrooms of educational buildings in Flanders (Belgium). The classrooms were located in climate zone 4 A following the ASHRAE climate classification30, representing a mixed-humid climate. The data collection campaigns were divided into three main case studies, i.e., three classrooms in a secondary school (Case 1), seven primary school classrooms (Case 2), and a test lecture room at KU Leuven Ghent Campus (Case 3). The three case studies were selected to cover a comprehensive range of classrooms. Consequently, the case studies varied in occupant (i.e., occupant age and education level), building (i.e., older and more recent buildings) and system (i.e., with and without ventilation system) characteristics. In addition, the selection was based on the practical feasibility to perform longitudinal measurement campaigns (e.g., motivation of occupants to participate and possibility to place environmental measurement devices).
The measurement duration in the cases varied from six weeks to six months. This period was sufficiently long to establish a useful dataset for further analysis. The measurement durations in the presented dataset is in line or longer than existing studies with durations of two weeks23, three months12 and six months17.
Case 1: Secondary school
Monitoring period
21/04/2021 – 11/06/2021.
This period represents intermediate season conditions in Belgium.
Building and system characteristics
A secondary school in Haacht (Belgium) was selected as a first case study for this research. In total three classrooms were selected, which will be referred to as classrooms A, B and C (Fig. 2). Classroom C was located in an older school building dating back to 1961. This older building lacked a mechanical ventilation system, leading to classroom C being ventilated through manual opening of the windows. Hydronic radiators in the room provided heating for classroom C. Classrooms A and B were situated on the top floor of a new school building that finished construction in 2019. This recent school building consisted of 4 floors, basement included, and was equipped with a balanced mechanical ventilation system. This ventilation system had a total ventilation rate of 11500 m³/h and a rotary heat recovery with an efficiency of 81%. The ventilation flow rate per classroom was 690 m³/h and 25 m³/h per person. A constant air flow rate was provided by the ventilation system from 7 h until 18 h and was fully disabled outside these operation times. Classrooms A and B were each equipped with two inlet air openings. The extraction opening for classrooms A and B was situated in the hallway. Classrooms A and B had a similar layout and positioning of the inlet air openings Furthermore, both classrooms were equipped with a floor heating system. The heating setpoint in classrooms A and B was set to 21.5 °C. None of the three classrooms was equipped with a cooling system. All classrooms were located at least 120 m away from a regional road. A detailed overview of the classroom properties is provided in Table 3.
Participants sample
In total 65 people participated in the survey campaign, of which 63 were students aged 14–17 years old. The two remaining participants were female teachers in the age category of 35–40 years old. The same group of occupants was questioned in the same classroom. Table 4 gives an overview of the demographics of the participating pupils.
Monitoring equipment
Three IEQ measurement devices (Renson Sense31) were placed in each classroom at different locations, i.e., in the front, middle and back of the classroom and at a height of approximately 1.5 m. The different measurement points made it possible to determine local IEQ conditions in the classroom and to detect whether any of the devices measured incorrectly. The devices measured the following IEQ-variables with a 5-minute timestep: temperature, relative humidity, light intensity, CO2-concentration and sound level. The detailed properties of the measuring device can be found in Table 5. Furthermore, the outdoor air temperature was measured with a timestep of 10 minutes by a temperature sensor mounted on the North façade of the most recent school building at a height of 6 m.
Survey distribution
A survey planning was designed, based on the course schedules of the students, which proposed multiple moments to complete the survey. The same planning was used each week, since the course schedule remained unchanged during the monitoring period. We distinguished two types of completion moments, i.e., at the start and end of a course. The number of planned survey completion moments per classroom is show in Table 6. The number of voting moments per week in classroom C is lower compared to classrooms A and B since the classroom C was less occupied by the selected group of students. The participants completed the online survey via their personal smartphone. A QR-code linking to the online survey was distributed in the classrooms to simplify the accessibility of the survey.
Case 2: Primary schools
The data collection campaign in case 2 was performed as part of a master thesis32.
Monitoring period
21/10/2021 – 14/12/2021.
This period represents heating season conditions in Belgium.
Data collection was paused from 01/11/2021 until 07/11/2021 due to a school holiday.
Building and system characteristics
Seven classrooms from five different primary schools in Flanders (Belgium) were monitored for six weeks from October until December 2021. Each of these classrooms was occupied by the same group of pupils and teachers. Three of the classrooms (i.e. A, B and C) were identical and located in the same school. More detailed information on the classrooms is shown in Table 7.
Participants sample
A total of 127 pupils and 8 teachers participated at the data collection campaigns in the primary schools. Only one out of eight teachers were male. The age of the teachers varied from 23 to 52 years and was on average 32.1 years. Table 8 gives an overview of the gender and average age of the pupils per classroom.
Monitoring equipment
The classroom IEQ conditions were monitored using a HOBO-MX, HOBO U12-012 and Netatmo smart weather station. The three measurement devices were placed at opposite sides of each classroom and at different heights ranging from 0.8 to 1.5 m. The sampling frequency was set to 5 minutes. Table 9 gives an overview of the measured IEQ variables with the measurement accuracy per monitoring device.
During the monitoring period, all classrooms were equipped with an additional CO2-monitoring device to inform the pupils and teachers about the IAQ condition in the classroom. The monitoring device was cased as a bird in order to make it interesting for pupils (Fig. 3). The monitoring device had two functions, i.e., changing eye colours and whistling. The eye colour changed from green to blue and from blue to red at a CO2-level of 800 ppm and 1200 ppm respectively. The device was set to whistle at 1000 ppm. The whistling function was only activated from the fourth week of the monitoring campaigns.
Example of the CO2 monitoring device44 that present during the data collection campaigns in the primary school classrooms.
Survey distribution
A survey completion schedule was designed and sent to the participating schools. The participating classes were asked to complete the RHRN surveys five times a day for two days a week (i.e., Monday or Tuesday and Thursday or Friday). Of the five completion moments for the RHRN survey, three assessed the satisfaction upon entering the classroom (i.e., start of the day, after the morning break, after afternoon break), and two assessed the satisfaction when pupils and teachers were in the classroom for some time (i.e., before the morning break, end of the day). The two survey completion moments, i.e., upon entering and before leaving the classroom, were used to assess participants’ satisfaction before and after adapting to the indoor environment. Two survey versions were distributed, i.e., a pupil and a teacher version. The pupil-version comprised of simpler language to make the survey more understandable for young pupils. The pupils completed the online survey via laptops of the schools that were available in the classrooms.
Furthermore, pupils’ motivation for filling in the survey and their interest in the study were assessed at the end of a school day. From the fourth week of the monitoring campaign, the RHRN survey was extended with two questions. One question asked about the whistling of the CO2-monitoring device. In a second question, the pupils could indicate which actions he/she or the teacher took to improve the IAQ in the classroom.
Case 3: Test lecture room (TLR) – KU Leuven Ghent campus
Monitoring period
30/11/2021 – 25/05/2022.
This period represented heating and intermediate season conditions in Belgium.
Measurements ran over three different experiment campaigns. The first ran from 30/11/2021 to 20/01/2022. The second campaign was performed during a full academic semester from 14/02/2022 to 19/05/2022. The last experiment campaign consisted of six consecutive study sessions on Tuesday afternoons from 13:30–17:30 from 19/04/2022 to 25/05/2022.
Building and system characteristics
Case study three was situated in a nearly-zero energy (nZEB) educational building, located in Ghent, Belgium. The educational building33 comprised of two lecture rooms used by university and college students, a staircase and a technical room. The two lecture rooms were situated above each other and were identical in dimensions and in the positioning of ventilation openings. The lecture rooms differed in envelope composition. The lecture room on the 1st floor had a brick external wall with exterior insulation, an air tightness of 0.41 h−1 at 50 Pa, while the 2nd floor lecture room had a lightweight timber frame external wall with the same U-value (0.15 W/m2.K) and an airtightness of 0.29 h−1 at 50 Pa. The roof and floor have U-values of 0.14 and 0.15 W/m2.K respectively. The building had triple-glazed windows, with a U-value of 0.65 W/m2.K, on the southwest and northeast façades, having a g-value of 0.52, a window-to-wall ratio (WWR) of 27% and a window-to-floor ratio (WFR) of 13%. The windows were equipped with internal and external solar shading consisting of screens. The external screens were located on the southwest windows and closed automatically when the incident solar radiation exceeds 250 W/m², however, this control could be manually overruled by the users.
The building was equipped with a variable air volume (VAV)-based all-air HVAC system. The ventilation flow rate and supply air temperature were demand-controlled based on a room temperature and CO2-setpoint. The all-air system had a maximum air flow rate of 2200 m³/h per lecture room leading to 27.5 m³/h per person at a maximum occupancy of 80 occupants. The air was supplied to the space via a displacement ventilation system (four diffusers in the corners) and exhausted via a grid at the back of the lecture room situated at a height of 2 m. The lecture rooms had a floor area of 140 m² and a volume of 380 m³.
The building was cooled by three techniques of ventilative cooling: (1) natural night ventilation with the windows serving as supply and exhaust openings, (2) modular bypass in the air handling unit (AHU), (3) indirect evaporative cooling (IEC) with a maximum capacity of 13.1 kW. As for heating, the air was pre-heated by air-to-air heat recovery via two crossflow plate heat exchangers connected in series with an efficiency of 78%. Furthermore, heating coils with a capacity of 7.9 kW were installed in the supply ducts of each lecture room. The two lecture rooms served as a living lab for the Building Physics and Sustainable Design research group at KU Leuven. The test lecture rooms were thermally insulated from each other. The lower lecture room was used as the case study classroom. We referred to the classroom as the test lecture room (TLR). An inside view of the TLR is shown in Fig. 4. More information on the TLR can be found in the building descriptive paper33.
During the experimental campaigns, the room temperature and CO2 setpoints were varied in order to expose the participants to different conditions. The room temperature setpoint was changed between 18 °C to 26 °C. Originally, the classroom used the setpoint of 23 °C. Furthermore, two different CO2-setpoints were set, i.e., 500 ppm (assuming maximum ventilation flow rate during occupancy) and 1000 ppm.
Participants sample
The group of participants varied between the three experiment campaigns. During the first experimental campaign, the participants comprised of a group of international students who had courses in the TLR classroom for a full semester. Thirty-nine of these students participated in the data collection campaign. Students were mostly a whole day in the classroom for five days a week. The second experimental campaign consisted of two groups of Belgian bachelor students of which 65 participated in the data collection campaign. The third experimental campaign included 17 Belgian volunteers who were recruited via a flyer and poster campaign at the university campus. The 17 volunteers participated at six study session where they could work or study while being present in the TLR. An overview of the participants’ gender and age is given in Table 10.
Monitoring equipment
The TLR classroom was equipped with multiple sensors and a weather station to measure both indoor and outdoor environmental conditions with a timestep of 1 minute. Moreover, an IAQ sensor (Ethera NEMo34) was placed in the middle of the classroom that had a sampling frequency of 10 minutes. Furthermore, a thermal comfort meter (Testo 48035) was placed in the classroom to monitor additional properties of the thermal environment in the classroom (i.e., air velocity and mean radiant temperature) with a timestep of 1 minute. The measured variables and their accuracies are shown in Table 11.
Survey distribution
An English version of the survey was provided for the international participants. The survey completion schedule differed between the experimental campaigns. During the first experimental campaign, a survey completion schedule was proposed to the participants based on their courses in the TLR. Participants of the second experimental campaign were asked to complete the survey upon entering and leaving the TLR. Consequently, the survey completion frequency depended on how often the students had courses in the TLR, i.e., one to four times a week. The third experimental campaign aimed to maximise the number of survey responses per occupants. Therefore, the participants of the third experimental campaign were asked to complete the RHRN survey upon entering and leaving the TLR when attending a study session. In addition, a 30-minute timer was active during the study sessions which informed participants to complete the RHRN survey when the timer finished. Participants completed the online survey using their personal smartphone or laptop. To facilitate access to the online survey, QR codes linking to the survey were distributed to participants.
The RHRN surveys also comprised of questions assessing the study engagement of the participants. In addition, participants of the second and third experimental campaign completed two cognitive tests to evaluate their memory and processing speed. Results of the study engagement and cognitive performance of the participants can be found in a previous publication36.
Dataset construction
The collected survey data, including the satisfaction assessments, was combined with IEQ, outdoor climate and HVAC system-related variables into one dataset. The responses from the RHRN survey were merged with the nearest environmental measurement in time of the IEQ sensors or weather station. The data was merged based on the nearest measurement in time since the RHRN survey evaluated occupants’ satisfaction with the IEQ conditions at the time of completing the survey. After the data was merged, a data cleaning process was performed which comprised of deleting incomplete survey responses from the dataset. A survey response was incomplete when an assessment of the occupant’s satisfaction was lacking. No further data cleaning was performed, but the authors recommend checking the time between the survey response and a measured environmental variable. In cases where the time difference exceeds a chosen threshold (e.g., 15 or 30 minutes), the value of the environmental variable should be set to NaN. As a result, only corresponding environmental conditions are considered in future analyses on the datasets.
Data Records
Summary
All data records have been uploaded to KU Leuven research data repository (RDR) and include four datasets files, a parameter description file, a survey description file and metadata28. A total of 6834 satisfaction assessments from 321 occupants were collected throughout the data collection campaigns in the three case study classrooms after data cleaning. Figure 5 gives an overview of the number of collected satisfaction assessments and participants per case study
Contents
This section describes the different files in our datasets.
FullDataset_SecondarySchool.csv
This datafile consists of 834 rows in which each row represents an occupant survey response with the measured environmental conditions in the classroom for case 1. The file format is a UTF-8 comma-separated csv-file. This datafile comprises of 86 columns each representing a different parameter (e.g., timestamp, survey response, environmental measurement, …). A detailed overview and definition of each parameter can be found in the “Dataset_ParameterDescription.xlsx”-file.
FullDataset_PrimarySchools.csv
This datafile consists of 5101 rows in which each row represents an occupant survey response with the measured environmental conditions in the classroom for case 2. The file format is a UTF-8 comma-separated csv-file. This datafile comprises of 86 columns each representing a different parameter (e.g., timestamp, survey response, environmental measurement, …). A detailed overview and definition of each parameter can be found in the “Dataset_ParameterDescription.xlsx”-file.
FullDataset_TestLectureRoom.csv
This datafile consists of 899 rows in which each row represents an occupant survey response with the measured environmental conditions in the classroom for case 3. The file format is a UTF-8 comma-separated csv-file. This datafile comprises of 145 columns each representing a different parameter (e.g., timestamp, survey response, environmental measurement, …). A detailed overview and definition of each parameter can be found in the “Dataset_ParameterDescription.xlsx”-file.
FullDataset_CombinedSimple.csv
This datafile consists of 6834 rows in which each row represents an occupant survey response with the measured environmental conditions in the classroom for all cases. The file format is a UTF-8 comma-separated csv-file. This datafile only includes a set of key variables that were measured in all three cases (e.g., occupant satisfaction scores, room temperature, relative humidity,…). Therefore, the datafile comprises of only 59 columns each representing a different parameter (e.g., timestamp, survey response, environmental measurement, …). A detailed overview and definition of each parameter can be found in the “Dataset_ParameterDescription.xlsx”-file.
Dataset_ParameterDescription.xlsx
This file provides a description of all the parameters (i.e., column headers) in the four datafiles: “FullDataset_SecondarySchool.csv”, “FullDataset_PrimarySchools.csv”, “FullDataset_TestLectureRoom.csv” and “FullDataset_CombinedSimple.csv”. The datafile has an xlsx-format and comprises of four tabs, i.e., one for each datafile.
OccupantSurveys.pdf
This file provides a description of the occupant survey structure and content used in each of the three cases. The document provides an overview of all survey questions and response scales in the original survey language of Dutch and translated into English. The file has a pdf-format.
README.txt
This file “README.txt” provides the necessary metadata for the dataset. The file has a txt-format.
Technical Validation
Survey validity
The survey questions and scales were tested via pilot tests for comprehensibility to ensure that the surveys assessed what they were intended to assess. The pilot tests were performed at the start of the data collection campaigns. During these pilot tests, the surveys ended with an optional open text field where participants could provide feedback on the survey content. Participant feedback was checked and used to adapt the survey and increase its comprehensibility. The pilot tests did not indicate any misunderstanding of the survey contents. In addition, the surveys that were distributed to minors (i.e., in secondary and primary schools) were reviewed by teachers to assess the comprehensibility of the surveys for the target age group of participants.
Measurement accuracy
The devices and sensors used to measure the environmental conditions are described in the Methods section. Multiple measurement devices were placed in the monitored classrooms to determine local IEQ conditions in the classroom and to detect whether any of the devices measured incorrectly. In addition, the measurement devices were positioned in locations that replicated the indoor environmental conditions experienced by the participants.
Since different measurement devices were used over the different case studies, differences in sampling frequencies of the measurements occurred (i.e., one, five and ten minutes). During the dataset construction, the occupants’ survey response is merged with the environmental measurement that is nearest in time (see Dataset construction section). Consequently, the difference in sampling frequencies is mitigated in the final dataset.
Potential data losses were indicated in the dataset by NA. In case 1, no environmental measurements were missing. In case 2, 999 lighting level measurements were missing due to a malfunction of the lighting measurement device in one of the monitored classrooms. In case 3, 29 datapoints lack a measurement of the indoor temperature, relative humidity and CO2-concentrations due to a short malfunction of the building monitoring system. In addition, 37 datapoints lack a measurement of TVOC- and PM concentrations. Moreover, air velocity was not measured for 63 datapoints. Finally, the lighting and sound level measurements (487 and 581, respectively) were missing since these variables were not measured for the full data collection campaign.
Occupant satisfaction assessments
Figure 6 shows the dissatisfaction rates per case study and per evaluated IEQ domain. The dissatisfaction rate was the percentage of satisfaction assessments with a score lower than three on the five-point rating scale. The results show that occupants were often dissatisfied with the thermal and IAQ conditions in their classrooms. However, differences between the case studies were present. In the secondary school, thermal and IAQ dissatisfaction rates were around 20% making it the highest of all evaluated domains. In contrast, occupants in the primary schools were mostly dissatisfied with the acoustical conditions (16%) in their classroom followed by the IAQ conditions (14%). In the test lecture room (case 3), the occupants showed the highest dissatisfaction with the thermal (13%) and visual environment (9%). A more detailed distribution of the thermal and IAQ satisfaction scores are shown per case in Figs. 7 and 8. These figures show that the peak in thermal and IAQ satisfaction scores was at 3 for the secondary school case while it was at 4 for the TLR case. The primary school case had a similar frequency of satisfaction score of 3, 4 and 5, thus not showing a clear peak as in the other two case studies.
The satisfaction assessments showcase an imbalance in the dataset, i.e., a larger proportion of assessments indicate satisfaction than dissatisfaction. However, imbalanced datasets are common in field data collection campaigns and are representative for real-life situations. Previous studies evaluating predictive models using field measurement data also faced imbalance, e.g., evaluated predictive thermal comfort models on more imbalanced data, e.g., “No change” (58%), “Cooler” (35%), “Warmer” (7%)17 and “No change” (70%), “Want cooler” (17%), “Want warmer” (13%)37. In buildings without specific issues (e.g. HVAC system malfunctions), it is expected that a majority of votes will indicate comfortable conditions.
Measured environmental conditions
A selection of three key environmental variables is investigated in more detail, i.e., room temperature, relative humidity and CO2-concentration. The descriptive statistic of each variable is visualised per case study. In addition, outlier detection was performed on for each set of environmental variables. A datapoint was considered an outlier when its value fell outside the range between the third quartile plus 1.5 times the interquartile range and the first quartile minus 1.5 times the interquartile range.
Figure 9 gives an overview of the values for room temperature in the datasets for all three cases. For case 1 (i.e., secondary school), the descriptive statistics shows that the measurement data comprised mostly of indoor temperature which tended to be high (i.e., average temperature of 23.9 °C). 37 measured values for room temperature were classified as an outlier as they exceeded a temperature of 26.8 °C. The maximum value measured for room temperature in case 1 was 28.7 °C. These results suggest that a short period of overheating occurred during the measurement campaign in case 1. On the contrary, the room temperature in measured in the primary school classrooms (case 2) was lower with an average temperature of 18.9 °C. 32 outliers were detected for room temperature as they were lower than 14 °C. These low room temperatures occurred due to a short malfunction of the heating system in one of the monitored classrooms. Case 3 comprised of room temperature measurements ranging from cooler to warmer, i.e., 17.9 °C to 27.2 °C. The average room temperature measured in case 3 was 22.8 °C, and no outliers were detected.
Figure 10 gives an overview of the values for relative humidity in the datasets for all three cases. Low relative humidities were measured in case 1 with a minimum value of 19.6% and an average value of 35.4%. No values for relative humidity were categorised as outlier in case 1. On the contrary, eight data points were determined outliers in case 2 as they exceeded the value of 63.5%. The values measured for relative humidity were higher in case 2 with an average value of 45.9%. In case 3, an average relative humidity value of 39.4% was measured. Additionally, five values were identified as outliers due to exceeding a threshold of 58.1%.
Figure 11 gives an overview of the values for CO2-concentration in the datasets for all three cases. In case 1, an average CO2-concentration of 720 ppm was measured which reached a maximum value of 1243 ppm. No values for CO2-concentration were detected as outlier in case 1. On average, the measured CO2-concentrations were lower with a mean value of 616 ppm. A total of 202 CO₂ concentration values exceeded 980 ppm in case 2 and were therefore categorised as outliers. The average of the measured CO2-concentrations in case 3 was 652 ppm. 100 values were categorised as outlier as they exceeded a CO2-concentartion of 881 ppm. Many of these outliers were measured during a brief period when that the demand-controlled ventilation (DCV) system was not operational during the data collection campaign. During this malfunction, the CO2-concentration reached a maximum value of 2201 ppm.
Usage Notes
COVID-19 measures
The data collection campaigns in the secondary (case 1) and primary schools (case 2) were performed when COVID-19 measures were still active in Belgian schools. Consequently, occupants were required to wear face masks during the data collection campaigns. Wearing a face mask could have impacted occupants’ perception of the IEQ. Few studies have investigated the impact of face masks on human perception of the IEQ. One such study, by Lang et al.38, showed that occupants’ thermal comfort, thermal acceptability and perceived air quality decreased over time when present in an office space with a room temperature and relative humidity of around 23 °C and 40%, respectively, while wearing face masks. Another study by Tang et al.39 found minor difference in thermal comfort between students with and without face masks in a university library. The authors quantified that the preferred operative temperature of occupants with a face mask was 0.5 °C lower than occupants without a face masks39. On the contrary, a third study did not find a significant impact of wearing a face mask on occupants’ thermal comfort that were present in climate chamber conditioned at 24 °C, 26 °C and 30 °C40. To conclude, robust and conclusive evidence on the impact of face masks on occupants’ IEQ perception is still missing.
In addition to the obligation to wear face masks, schools in cases 1 and 2 were advised to open windows as much as possible as part of the measures to combat the spread of the COVID-19 virus. As a result, the window opening behaviour differed from what was expected during heating season with at least one or more windows being opened during occupancy. The impact of the window opening will be reflected in the measured indoor conditions, such as a lower indoor temperature and CO2-concentration. Consequently, the conditions during these measurement campaigns deviated from what would be expected under normal conditions (i.e., in the absence of active regulations to reduce the risk of respiratory infection).
The active COVID-19 measures may lead to atypical conditions. However, given the proven effectiveness of face masks in reducing the transmission of respiratory diseases, their use can be expected to become a standard preventive measure in future scenarios such as seasonal flu outbreaks or emerging epidemics. Future researchers using the dataset should be aware of the contextual conditions of the data collected in the secondary and primary school classrooms due to the active COVID-19 measures. Specifically, findings based on these case studies may face limitations in generalising to non-pandemic conditions. Nevertheless, COVID-19 measures can be treated as a confounding contextual factor when analysing this dataset alongside non-pandemic data.
Outdoor climate conditions
The current dataset is limited to measurements performed in educational buildings located in the region of Flanders (Belgium). As a result, all measurements were collected under the same climate condition, ASHRAE climate zone 4 A (mixed-humid climate)30. The climatic homogeneity of the current datasets limits the generalisability of findings to other regions with different climate zones. Therefore, if users aim to develop and validate a multi-climate comfort model, the current dataset should be supplemented with additional data from different climate zones.
Data availability
The dataset is available at KU Leuven Research Data Repository: https://doi.org/10.48804/CZQNUU.
Code availability
Additional variables were calculated and added to the published datasets. The air enthalpy (EA) and running mean outdoor temperature (Trm) were added using the relevant functions in the “pythermalcomfort” package41 for Python. EA was added to the datasets because previous studies have shown it relationship with occupants’ IAQ perception42,43. Trm was included in the datasets since it is part of the adaptive thermal comfort model11. The “pythermalcomfort” package41 is an open-source toolkit to compute thermal comfort indices in Python.
The study did not use any custom code to generate data.
References
Lee, J. Y., Wargocki, P., Chan, Y. H., Chen, L. & Tham, K. W. Indoor environmental quality, occupant satisfaction, and acute building-related health symptoms in Green Mark-certified compared with non-certified office buildings. Indoor Air 29, 112–129 (2019).
Wargocki, P., Wyon, D. P., Baik, Y. K., Clausen, G. & Fanger, P. O. Perceived Air Quality, Sick Building Syndrome (SBS) Symptoms and Productivity in an Office with Two Different Pollution Loads. Indoor Air 9, 165–179 (1999).
Lan, L., Wargocki, P. & Lian, Z. Quantitative measurement of productivity loss due to thermal discomfort. Energy Build. 43, 1057–1062 (2011).
Kim, J. & de Dear, R. Nonlinear relationships between individual IEQ factors and overall workspace satisfaction. Build. Environ. 49, 33–40 (2012).
Wargocki, P., Porras-Salazar, J. A. & Contreras-Espinoza, S. The relationship between classroom temperature and children’s performance in school. Build. Environ. 157, 197–204 (2019).
Wargocki, P., Porras-Salazar, J. A., Contreras-Espinoza, S. & Bahnfleth, W. The relationships between classroom air quality and children’s performance in school. Build. Environ. 173 (2020).
Brink, H. W., Loomans, M. G. L. C., Mobach, M. P. & Kort, H. S. M. Classrooms’ indoor environmental conditions affecting the academic achievement of students and teachers in higher education: A systematic literature review. Indoor Air 31, 405–425 (2021).
Chinazzo, G. et al. Quality criteria for multi-domain studies in the indoor environment: Critical review towards research guidelines and recommendations. Build. Environ. 226 (2022).
Fanger, P. O. Thermal Comfort: Analysis and Applications in Environmental Engineering. (1970).
de Dear, R. J. & Brager, G. S. Developing an adaptive model of thermal comfort and preference. ASHRAE Trans. 104, 145–167 (1998).
Nicol, J. F. & Humphreys, M. A. Adaptive thermal comfort and sustainable thermal standards for buildings. Energy Build. 34, 563–572 (2002).
Babich, F., Torriani, G., Corona, J. & Lara-ibeas, I. Comparison of indoor air quality and thermal comfort standards and variations in exceedance for school buildings. J. Build. Eng. 71, 106405 (2023).
Bluyssen, P. M. Health, comfort and performance of children in classrooms – New directions for research. Indoor Built Environ. 26, 1040–1050 (2017).
Kim, J., Schiavon, S. & Brager, G. Personal comfort models – A new paradigm in thermal comfort for occupant-centric environmental control. Build. Environ. 132, 114–124 (2018).
A Martins, L., Soebarto, V. & Williamson, T. A systematic review of personal thermal comfort models. Build. Environ. 207, 108502 (2022).
Xie, J., Li, H., Li, C., Zhang, J. & Luo, M. Review on occupant-centric thermal comfort sensing, predicting, and controlling. Energy Build. 226, 110392 (2020).
Tartarini, F., Schiavon, S., Quintana, M. & Miller, C. Personal comfort models based on a 6‐month experiment using environmental parameters and data from wearables. Indoor Air 32, 1–16 (2022).
Bogatu, D.-I., Kazanci, O. B., Watanabe, F., Kaneko, Y. & Olesen, B. W. Personal comfort model for automatic control of personal comfort systems. in In Press.: CLIMA 2022 1–8 (2022).
Kim, J., Zhou, Y., Schiavon, S., Raftery, P. & Brager, G. Personal comfort models: Predicting individuals’ thermal preference using occupant heating and cooling behavior and machine learning. Build. Environ. 129, 96–106 (2018).
Arakawa Martins, L., Soebarto, V. & Williamson, T. Performance evaluation of personal thermal comfort models for older people based on skin temperature, health perception, behavioural and environmental variables. J. Build. Eng. 51 (2022).
Földváry Ličina, V. et al. Development of the ASHRAE Global Thermal Comfort Database II. Build. Environ. 142, 502–512 (2018).
Gao, N., Marschall, M., Burry, J., Watkins, S. & Salim, F. D. Understanding occupants’ behaviour, engagement, emotion, and comfort indoors with heterogeneous sensors and wearables. Sci. DATA 9 (2022).
Jayathissa, P., Quintana, M., Abdelrahman, M. & Miller, C. Humans-as-a-sensor for buildings—intensive longitudinal indoor comfort models. Buildings 10, 1–22 (2020).
Qualtrics. Qualtrics. https://www.qualtrics.com/ (2005).
Google. Google Maps. www.google.be/maps (2024).
Trustpilot A/S. Trustpilot. https://nl.trustpilot.com/ (2024).
Bol. Bol.com. https://www.bol.com/be/nl/ (2024).
Carton, Q., Kolarik, J. & Breesch, H. Data on occupant satisfaction with the indoor environmental quality in classrooms. at https://doi.org/10.48804/CZQNUU (2025).
ISO 7730. Ergonomie van de thermische omgeving - Analytische bepaling en interpretatie van thermische behaaglijkheid door berekening van de PMV- en PPD-waarden en door criteria voor de plaatselijke thermische behaaglijkheid (2005).
ASHRAE. ANSI/ASHRAE Standard 169-2013 Climatic Data for Building Design Standards. ASHRAE Standard vol. 8400 (2013).
Renson. Sense. https://www.renson.eu/gd-gb/producten-zoeken/ventilatie/mechanische-ventilatie/units/sense (2021).
Vanden Broeck, S. & Van Roy, V. Evaluatie van de gebruikerstevredenheid met het binnenklimaat in Vlaamse lagere scholen. (KU Leuven - Faculty of Engineering Technology, 2022).
Breesch, H., Wauman, B., Klein, R. & Versele, A. Design of a new nZEB test school building. REHVA J 53, 17–20 (2016).
Ethera. Ethera Catalogue. http://www.ethera-labs.com/wp-content/uploads/2020/01/Catalogue_ethera_en_ld.pdf (2017).
Testo. testo 480 - High-end klimaatmeter. https://www.testo.com/nl-NL/testo-480/p/0563-4800 (2017).
Carton, Q., De Coninck, S., Kolarik, J. & Breesch, H. Assessing the effect of a classroom IEQ on student satisfaction, engagement and performance. E3S Web Conf. 396, 01052 (2023).
Kim, J. et al. Occupant comfort and behavior: High-resolution data from a 6-month field study of personal comfort systems with 37 real office workers. Build. Environ. 148, 348–360 (2019).
Lang, X., Vasquez, N. G., Liu, W., Wyon, D. P. & Wargocki, P. Effects of wearing masks indoors on the cognitive performance and physiological and subjective responses of healthy young adults. Build. Environ. 252, 111248 (2024).
Tang, T. et al. Investigation of the effects of face masks on thermal comfort in Guangzhou, China. Build. Environ. 214, 108932 (2022).
Lang, X., Liu, W., Wyon, D. P. & Wargocki, P. Wearing a mask reduces cognitive performance at neutral temperatures: Evidence from climate chamber experiments on heat acclimatized subjects in China. Build. Environ. 277, 112949 (2025).
Tartarini, F. & Schiavon, S. pythermalcomfort: A Python package for thermal comfort research. SoftwareX 12, 4–8 (2020).
Fang, L., Clausen, G. & Fanger, P. O. Impact of Temperature and Humidity on the Perception of Indoor Air Quality. Indoor Air 8, 80–90 (1998).
Piasecki, M., Kostyrko, K., Fedorczak-Cisak, M. & Nowak, K. Air enthalpy as an IAQ indicator in hot and humid environment—experimental evaluation. Energies 13 (2020).
Selectron. Little lilly. https://www.selectron.be/lilly (2021).
Acknowledgements
The authors would like to thank all the participants in the data collection campaigns. This work has been supported by the Flemish Agency for Innovation and Entrepreneurship (VLAIO) in the Flux50 project Towards Smart Ventilation in Mid-Sized Buildings (HBC.2020.2520). Furthermore, the authors acknowledge the support received of the Renson chair: “Data-driven servitization in creating healthy spaces” during the measurements in case 1, the citizens’ science project “BurgerSTEM” during the measurements in case 2, and the PWO project ‘Take cAIR’ during the measurements in case 3. The data handling and informed consent protocol used in this study has been approved by the KU Leuven Social and Societal Ethics Committee under file number G-2021-3074.
Author information
Authors and Affiliations
Contributions
Quinten Carton designed the methodology, performed the data collection campaigns, constructed the datasets, prepared and edited the paper draft. Jakub Kolarik designed the methodology, supervised the project, reviewed and edited the paper draft. Hilde Breesch designed the methodology, supervised the project, reviewed and edited the paper draft, and acquired funding.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Carton, Q., Kolarik, J. & Breesch, H. A dataset on occupant satisfaction with the indoor environmental quality in Belgian classrooms. Sci Data 13, 229 (2026). https://doi.org/10.1038/s41597-026-06545-4
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41597-026-06545-4













