Introduction

In many countries, children and youth are not meeting global physical activity guidelines,1 a finding that is particularly striking in older children. Recent meta-analyses highlight the considerable challenges faced in substantially increasing physical activity in children, whether via community-, school- or family-based approaches.2, 3, 4, 5 Despite these challenges, schools remain an opportune environment for intervention simply because most children attend school regularly and thus provide a captive audience. Although comprehensive school activity programmes may be more successful,6, 7 a lack of large, high-quality studies5, 8 creates uncertainty regarding the effectiveness of different approaches.9, 10 Given the lack of effect seen in many studies,5 investigation of alternative approaches, particularly those that involve little to no additional burden for school staff or the curriculum, is warranted. One unexplored approach is to try and increase physical activity by involving some risk and challenge.

Risky play holds an evolutionary role that allows children to test their limits within a known environment, and has multiple demonstrated benefits for development.11, 12, 13 Not exposing children to gradually increasing risk and challenge over time is thought to lead to an inability to assess real danger over the longer term.11, 14 However, concern about accidents and litigation influences what is perceived as acceptable play within the school playground, with many schools limiting rough-and-tumble play (for example, physical wrestling games or contact sports) or tree climbing.15, 16

Whether altering the school environment to provide greater opportunities for children to experience appropriate levels of risk and challenge during outdoor active play is a good way of increasing physical activity in children has only been examined in short-term studies.17, 18 The aim of our 2-year cluster randomised controlled trial (PLAY) was to determine whether providing greater opportunities for risk and challenge in primary schools increased physical activity and reduced relative body weight over the long term. A secondary aim considered the effect of the intervention on how children interacted with one another which forms the basis of a separate paper.19

Materials and methods

PLAY was a cluster randomised controlled trial in 16 primary schools in the Otago (n=8) and Auckland (n=8) regions of New Zealand. Schools were randomised to intervention (provided with funds to assist with altering school play spaces) or control (asked to not change anything in their school play spaces) conditions for 2 years (1-year intervention, 1-year follow-up). Ethical approval was obtained from the University of Otago (reference 10/077) and Auckland University of Technology (reference 10/95) Ethics Committees.

Recruitment of schools and students

State primary schools (years 1–8 that are fully funded by the state and co-educational) with at least 150 pupils, and a school decile ranking of 1–6 were eligible. New Zealand schools are ranked into deciles from 1 to 10, where 1 indicates the 10% of schools with the highest proportion of pupils from low socioeconomic areas and decile 10 indicates the 10% of schools with the lowest proportion.20 Eleven schools met these criteria within the Otago region and 31 in Waitakere City (within the Auckland region). Waitakere City (population: 186 447) and the Otago region (population: 193 803) are similar in population size,21, 22 but are culturally quite different.23 Eleven schools were approached in Otago and 10 in Auckland and recruitment stopped once 16 schools (eight in each region) provided informed consent to participate (November 2010 to March 2011). Pairs of schools were created by matching for region, school roll and decile ranking, and were randomly assigned to intervention or control by tossing a coin (Supplementary Table 1).

Although all children in intervention schools were exposed to the intervention, only children in school years 2 and 4 were invited to participate in outcome assessments. These years were chosen pragmatically to cover a wide age range (typically ages 6–9) and enable 2-year outcomes to be collected. There were no exclusion criteria. Information sheets and consent forms were sent home with all children in these year groups from each school, and signed consent was obtained from parents. Baseline data were collected from March to December 2011 and 2-year data collection was completed in December 2013.

Outcome assessments: schools

At baseline and 1 year a qualified playworker24, 25 undertook an evaluation of the play environment in each school according to a schedule of seven items rating opportunities for (i) risk and challenge, (ii) engagement with natural elements, (iii) ability to actively manipulate and change the play environment (for example, loose parts), (iv) wheeled play (for example, bicycles and skateboards), (v) ball games, (vi) children to socialise, and (vii) quality of independent access (no restrictions on the ability to access all parts of the school). Each item was scored from 1 (very poor) to 5 (excellent) and an overall score was determined for the whole school play environment (maximum score of 35). At each time point, principals completed an 18-item questionnaire assessing physical activity policies within their school (break time, using physical activity as a punishment, promotion of community activities, adequacy and availability of facilities during school/after hours, enjoyment and promotion of physical activity regardless of skill level, amount and quality of physical education, and safety issues).26 Principals indicated whether the policies were fully in place (score of 3), partially in place (2), under development (1), or not in place (0). At 2 years, exit interviews were undertaken to explore the teachers’ thoughts about PLAY and the associated outcomes for their school.27

Development of schools’ playground action plans

The researchers, playworker and school community worked together to develop a playground action plan that met the needs of each school community. Following baseline evaluations of their play space, each intervention school was provided with a list of tailored suggestions for improvements. This was specific to each school but could include the addition of more interactive play equipment, and alterations to school rules and policies that may limit risk-taking during play (for example, no tree climbing, separation of older and younger children into physically separate play areas), with all alterations meeting playground safety standards. The research team then met with each school community to finalise the plan. Although intervention schools were provided with initial start-up funds of NZD$15 000, the majority of recommendations involved no to little cost, such as leaving trees that had been cut down in pieces or letting the grass grow long to encourage imaginative play, re-purposing real-estate signs for sledding down hills, purchase of raincoats and gumboots to allow outside play when wet, and using plastic piping and sand for water play. Developing each school’s playground action plan from assessment through to implementation and completion took ~1 year and Supplementary Table 2 illustrates the changes made by schools.

Outcome assessments: children

Measurements were obtained at baseline, 1 year (2–3 months after changes to the intervention school play environments) and 2 years (follow-up) by researchers blinded to group allocation. Children’s height, weight and waist circumference were measured in light clothing without shoes using standard techniques during school time. Height was measured using a portable stadiometer (seca 213, Hamburg, Germany) and weight using electronic scales (seca 813, Seca, Seca 213, Hamburg, Germany). Waist circumference was measured at the umbilicus using a non-elastic tape (Lufkin 2 m Executive Thinline w606PM, Apex Tool Group, Sparks, MD, USA). All measures were obtained in duplicate with a third undertaken if the first and second measures did not fall within the maximum allowable difference (0.5 cm for height, 0.5 kg for weight and 1.0 cm for waist), and the mean taken from the two closest measurements. Body mass index (BMI) z-scores were calculated using World Health Organisation reference data.28

All children wore an accelerometer (ActiGraph GT3X, Actigraph Corp, Pensacola, FL, USA) 24 h a day for 7 days, positioned over the right hip (accelerometers were removed for bathing, showering and water-based activities). Accelerometers were initialised using ActiLife (version 6, Actigraph Corp) in uniaxial mode using 15 s epochs. Data were cleaned and scored using an automated script developed in MATLAB (MathWorks, Natick, MA, USA) that removes the appropriate sleep period for each day for each child individually, to avoid sleep being misclassified as sedentary time.29 A day was considered valid if there were at least eight valid awake hours.30 Non-wear time (awake hours only) was defined as at least 20 min of consecutive zeros.30 Participants were excluded from the analysis if fewer than three valid days of wear were obtained. Activity intensities were calculated using the Evenson cut-points developed for children aged 5–8 years.31 The specific start, end, break and lunch times for each school were used within the MATLAB programme to obtain filtered activity information for particular time periods (school day, break time, lunch time and ‘whole day’, which excludes the sleep period).

Statistical analysis

Assuming a mean (s.d.) of 100 (35) for minutes of moderate-to-vigorous physical activity (MVPA),32 intraclass correlation of 0.04 and correlation between repeat measures of 0.4,33 our study has 80% power (5% level of significance) to detect a difference in MVPA of 15 min with 97 children in each group. Assuming corresponding figures of 0.7 (0.85), 0.03 and 0.85 for BMI z-score,34 our study has 80% power (P<0.05) to detect a difference of 0.2 in BMI z-score with 106 participants per group. Accounting for clustering (2.56 for MVPA and 2.17 for BMI z-score) and 25% drop-out increases our required numbers to 331 per group.

The data were analysed according to modified intention to treat following the Consolidated Standards of Reporting Trials statement, which has been extended for cluster randomised trials.35, 36 Data were analysed using Stata 13.1 (StataCorp LP, College Station, TX, USA).

School-level data

Because pairs of schools were matched before randomisation by area and decile, paired t-tests were used to compare scores of the control and intervention schools for each of the seven items assessed during the play space evaluations that were undertaken at baseline and 1 year. The 18-item activity policy scores were summed and expressed as a percentage of the total maximum score (54). Regression analysis adjusting for the first measure was used to compare the final policy measure.

Student-level data

Because schools (not students) are the primary sampling unit, generalised estimating equations with an exchangeable correlation matrix were used to analyse the data. This type of model takes into account the correlational structure within each school and estimates population averaged models. Robust s.e.’s (based on the sandwich estimator) were used to calculate the confidence intervals and P-values. The models included the baseline variable and an interaction term between time and group so that differences at both year 1 and year 2 could be estimated. The models also included terms for sex, age at the initial assessment and time difference between assessments as well as terms for the pairs to take account of the randomisation procedure. Analysis was undertaken using modified intention to treat. That is, participants were included in the analysis if they had at least one repeat measure at 1 and/or 2 years for the relevant outcome.

The accelerometry data are presented for the whole day and specific periods of interest (for example, break times) as counts per minute (CPM, estimate of overall activity) and minutes of MVPA. Minutes of MVPA are adjusted for total time awake (as this differs within and between children) and time spent in less intense activity (sedentary and light activity) using the logarithm of the ratio of the proportions of both variables and fitted using the Stata procedure fmlogit, which fits a fractional multinomial logit model by quasi maximum likelihood.37 The estimate for proportion of MVPA during each time period (for example, school time) was multiplied by the number of minutes during that time period based on the means of all observations (for example, for school time this was 360 min). The difference between estimates is therefore presented in minutes. As the observation period for each child was different (depending on the child’s sleep duration and the start, finish and break times for each school), the estimated differences in proportion were scaled so that they applied to 360, 30, 60 and 800 min for the school day, break time, lunch time and the whole day (that is, mean total valid minutes, excluding sleep time), respectively.

The missing data were imputed using chained equations assuming that the data were missing at random.38, 39 Data for all the outcomes, CPM, BMI and BMI z-score were imputed in the same model, which also included terms for sex, age, group, school, school decile and accounted for the clustered nature of the study. Thirty data sets were imputed using a burn in length of 750.

Results

All eight intervention schools made substantial changes to their school play spaces, which promoted greater risk and challenge through building new areas (hills and tunnels), adding more dynamic equipment (for example, a Kompan ‘Supernova’ that requires co-ordination and teamwork to use; Supplementary Figure 1), or via relaxing the rules (letting children ride their bicycles or scooters on school grounds, allowing children to climb trees or play outside in the rain). Children were able to engage more with natural elements, create imaginative play opportunities with loose parts (carrying tree stumps around to form a spaceship or a jumping game where the ground cannot be touched) or long grass (places to hide, roll around in, throw mown grass at each other). Some schools also ensured greater equity in access for all ages, or made different aspects of the playground more attractive (Supplementary Table 2). These changes to the school play spaces were evaluated by the playworker at 1 year with a higher score indicating a better play space (Table 1). At 1 year, intervention schools provided more opportunities than control schools as indicated by a significantly higher overall summed score from the seven components (mean difference: 4.50 (95% confidence interval: 1.82 to 7.18, P=0.005)) as well as for the individual components of interest including risk and challenge (1.00 (0.11 to 1.89, P=0.033)) and wheeled play (1.25 (0.18 to 2.32, P=0.028)). Apart from safety surfacing, control schools did not make any substantial changes to their play environment. Nor were school policies relating to physical activity different between intervention and control schools; mean (s.d.) scores at 2 years were 76.4% (10.6) for control schools and 76.2% (10.4) for intervention schools (P=0.568).

Table 1 Play space evaluation scores and the differences between intervention and control schools at 1 year

The eight intervention schools were all positive about their experience in the PLAY Study. Interviewees reported many benefits resulting from intervention and these included their belief that children were fitter and more active, and that the altered play environments provided many more play opportunities as well as being more dynamic. These findings are discussed in detail in a separate paper.27

Figure 1 describes the flow of children throughout the study. A total of 840 were measured at baseline and 630 of these children remained at 2 years. Final analyses were undertaken in children for whom we had baseline and at least one follow-up measure (n=704 for MVPA and n=743 for BMI z-score). Only four children (0.5%) withdrew from the study, but a further 180 children were lost to follow-up (21.4% of the 840 baseline participants) from moving out of the area. Similar characteristics were observed between groups at baseline (Table 2). At baseline, 2.7% of children were underweight, 73.4% normal weight, 16.0% overweight and 7.9% obese.28

Figure 1
figure 1

Flow of participants in the PLAY study by group. aOtago (n=11), Auckland (n=10); bOtago (n=8), Auckland (n=8); cOtago (n=4), Auckland (n=4); dOtago (n=4), Auckland (n=4); erandomly excluded from consents received because of lack of accelerometers.

Table 2 Characteristics of the study population at baseline

No significant differences were observed between control and intervention groups in break time or lunch time activity at either 1 or 2 years, or in overall activity (CPM) across the school day (for example, 0900–1500 hours) or whole day (time the child woke up until they went to sleep) at 1 year (Table 3). However, at 2 years, CPM were significantly lower in the intervention group compared with the control group during the school day (adjusted difference, 95% confidence interval: −63, −107 to −20, P=0.005). A similar effect was seen for minutes of MVPA across the school day (−3.3 min, −6.1 to −0.6), P=0.017) and whole day (−3.8, −7.1 to −0.5, P=0.022). The small sample of schools limited school-specific analysis but examination of change in adjusted CPM during school time at 1 and 2 years indicated differences with no particular pattern for intervention or control schools (Supplementary Figure 2). Imputation of missing data did not influence the estimates for physical activity. For example, the imputed estimates for CPM during the whole day were −6 (95% confidence interval: −32, 20) at 1 year and −28 (−53, −2) at 2 years compared with −6 (−37, 24) and −28 (−66, 10) in the original analysis.

Table 3 Total physical activity and MVPA at different times of the day

Table 4 demonstrates that no statistically significant differences were observed at either 1 or 2 years for BMI, BMI z-score, waist circumference or waist-to-height ratio. As was observed for physical activity, imputed estimates did not alter our conclusions; estimates at 1 and 2 years for BMI z-score were −0.02 (−0.08, 0.04) and 0.02 (−0.04, 0.08) compared with −0.02 (−0.06, 0.02) and 0.02 (−0.04, 0.07) in the original analysis.

Table 4 Characteristics of the study population at each time point and adjusted differences in anthropometric outcome variables at 1 and 2 years

Discussion

Our findings demonstrate that schools were successful in altering their play spaces to encourage greater opportunities for risk and challenge in children’s play. Proposed changes were well received by schools, with all noting positive outcomes for children including improvements in behaviour and greater participation in physical activity.27 However, these anecdotal views of school staff were not supported by objective evidence from accelerometry, with similar levels of total activity and MVPA observed between intervention and control children across the study. We also found no effect of increasing risk and challenge in the school playground on anthropometric outcomes, which is perhaps not surprising given we did not affect physical activity or include a dietary component to our intervention.

It is difficult to compare our findings with other published studies because even though risk is an important component of play,11 studies investigating the role of risk and challenge in promoting physical activity are scarce. The only relatively comparable intervention is the Sydney Playground Project, which educated parents and teachers about the benefits of risk-taking in play.18 Although these sessions changed teacher and parental attitudes to risky play,40 the actual intervention of incorporating loose parts (that is, equipment with no obvious play value) had little effect on physical activity itself,18 similar to our findings. Although these authors reported a significant effect of the 13-week intervention on MVPA at break time (P=0.006), at only 1.8 min, this difference seems unlikely to be clinically significant given the errors inherent in estimation of MVPA, even when using objective measures such as accelerometry.5 Furthermore, this difference at break time did not translate to differences over the whole day, a compensatory effect that has been observed in other physical activity interventions in children.2, 41 It is also possible that the short-term nature of the Sydney intervention produced a novelty effect; although follow-up measurements were undertaken at 2 years, these were only undertaken in 16 children from one school, and unsurprisingly were not significant at this time. Our findings and those of the Sydney project should not be taken to mean that changing the school playground to include more risk and challenge is not beneficial for children or, indeed, schools in a wider context. Interviews with intervention schools clearly showed they believed the intervention to have been beneficial for their school community.27 It is feasible that use of accelerometry to assess physical activity may obscure real changes in the type or quality of play that intervention children reportedly engaged in such as building huts, climbing trees and imaginative play. Although these activities are designed to increase risk and challenge, they may not necessarily increase MVPA and accelerometers are not particularly effective at accurately measuring the upper-body movement involved in these behaviours.42 Alternative measures of activity assessment in children such as the System for Observing Play and Leisure Activity in Youth43 also classify physical activity into intensities and do not measure type or quality of play itself.

Alternatively, the lack of intervention benefit we observed may have been because our population was already sufficiently active (baseline mean MVPA of 67 min), leaving little opportunity for increases in MVPA. Although choice of physical activity cutoff can substantially affect levels of MVPA,44 CPM—which reflect overall activity and are independent of intensity cutoffs—also did not increase in our study, suggesting that choice of arbitrary intensity cut-point is not relevant. Certainly, the literature is clear that increasing physical activity has proven to be extremely challenging in children,2, 3, 4 whether referring to family-based3 interventions or those targeting the school environment45, 46 or curriculum.47 Why it is so difficult to substantially increase physical activity in children is not clear, but increasing evidence suggests that Comprehensive School Physical Activity Programs may offer the best solution,6 with several studies demonstrating their efficacy.7, 48, 49 Although schools in New Zealand are required to include many aspects of Comprehensive School Physical Activity Programs as part of the curriculum,50 analysis of our policy questionnaire illustrated that schools could still improve in this area, with overall scores of 76%.

The main strengths of the PLAY study were the extended follow-up over 2 years, which allowed us to evaluate potential sustainability, and the high retention rate (84–88% for the two main outcomes). All previous play interventions have been of short duration lasting a few weeks to months.18, 51, 52 Because of the inclusion of two different regions within New Zealand with markedly different ethnic distributions, we had a sample that reflected the wider New Zealand population.53 Because of the strong association between socioeconomic status and body size in children54 and a perceived greater need, we intentionally recruited less advantaged schools. Given the financial constraints within such communities, many of the recommendations made to each school involved zero to minimal cost changes including pipes for water play, boxes, tarpaulins and logs, which most schools, regardless of income, would have the capacity to incorporate. Although we did provide a reasonable level of funding to each school, most of this was spent on a single item of fixed equipment (Kompan ‘Supernova’), which represented only a small proportion of the overall changes made within each intervention play environment. Very few children withdrew from the study (n=4 of the baseline sample) and although a further 21.4% moved away from the area, this level of ‘transience’ or ‘churn’ reflects national data in New Zealand (15.7% nationwide55 and 22% in South Auckland schools).56 Finally, we measured physical activity objectively, and for a much longer period that is typically examined with most similar studies being limited by duration, size and/or measurement methods.

Our study also has some limitations. Because this was a community intervention in a real-world setting, each school developed their own plan with input from the research team that took variable amounts of time for implementation. Although this individualised approach should be viewed as a strength, it also meant that changes were being implemented in an ongoing manner as opposed to them all occurring together such that there was no single ideal time to measure changes in physical activity. For this reason, we chose to avoid measuring immediately after the majority of redevelopments to avoid any observable differences occurring due to novelty. It was also not possible to blind the play space evaluator to the school’s condition because of their involvement in assisting schools to develop their playground action plans. Some accelerometry data were also missing (4.4% at baseline, 6.3% at 1 year and 6.4% at 2 years), because the devices were not always worn as requested or malfunctions occurred. However, the results from imputation analyses showed that missing data did not change the estimates and made no difference to our findings.

In conclusion, although schools embraced the concept of introducing more risk and challenge to the school play environment and made numerous effective changes, these changes did not translate into significant benefits to activity or weight. It is possible that interventions such as ours will not influence physical activity to the extent that is required to get children more physically active and increase MVPA, but will instead improve the play experiences of children. Studies investigating children’s play would therefore benefit from the development of a tool which is able to more definitively assess how children play and any associated benefits. However, our findings also highlight the continued difficulties faced in effectively increasing physical activity in children over the long term. As current methods focusing solely on physical activity are only minimally successful at best, attempts to reduce excess body weight in children need to incorporate both physical activity and diet.