Long-term follow-up reveals non-utility of nonsurgical management in moderate adolescent idiopathic scoliosis: a comprehensive meta-analysis

Article information

Asian Spine J. 2025;19(4):638-651
Publication date (electronic) : 2025 June 24
doi : https://doi.org/10.31616/asj.2024.0358
1Spine Unit, University of Navarra Clinic, Madrid, Spain
2Vithas Internacional, Madrid, Spain
3Spine Surgery Section, Araba University Hospital, Vitoria, Spain
4Neuropediatrics Unit, San Carlos Clinic Hospital, Madrid, Spain
5Faculty of Medicine, Alexandria University, Alexandria, Egypt
6Mediterranean Observatory for Clinical and Health Research (OMEDICS), Valencia, Spain
7Institute for Research on Musculoskeletal Disorders, Valencia Catholic University, Valencia, Spain
Corresponding author: Ibrahim Khalil, Faculty of Medicine, Alexandria University, Alexandria 5372066, Egypt, Tel: +20-1141579270, E-mail: ibrahim.ibrahim1908@alexmed.edu.eg; ietfeiheh2016@gmail.com
Received 2024 September 2; Revised 2024 October 22; Accepted 2024 November 13.

Abstract

This systematic review and meta-analysis aimed to evaluate the outcomes of conservative management of moderate adolescent idiopathic scoliosis (AIS), focusing on long-term curve behavior, surgical rates, patient-reported outcomes, and the influence of follow-up duration. A comprehensive literature search was conducted adhering to PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-analyses) guidelines. Statistical analyses, using Review Manager, encompassed mean differences, risk ratios, pooled incidences, and random-effects models. Heterogeneity was assessed using the I2 statistic. The outcome measures were radiographic curve progression, percentage of patients with significant (>5°) progression, surgery rates, sagittal profile changes, back pain rates, quality of life scales, and functional outcomes. Thirteen studies involving 1,492 patients with AIS curves within 30°–45°, treated conservatively, with a minimum 10-year follow-up, were included. At 20+ years of follow-up (mean age, 34.5 years), curves worsened by an average of −5.69° (95% confidence interval [CI], −11.66 to 0.29). At 25+ years of follow-up (mean age, 49.8 years), curves worsened by an average of −14.32° (95% CI, −20.14 to −8.50). The incidence of significant progression was 35.68% (95% CI, 22.85 to 48.50). The surgery rate was 14.20% (95% CI, 0.87 to 27.53). Sagittal alignment (thoracic kyphosis and lumbar lordosis) remained within normal ranges at the final follow-up, with no significant changes from baseline. Back pain rate was 63.35% (95% CI, 38.36 to 88.34). These findings highlight the alarming incidence of curve progression and pain in conservatively managed AIS patients. A critical re-evaluation of conservative versus operative indications is imperative to mitigate long-term impacts and improve outcomes for this population.

Graphical Abstract

Introduction

Adolescent idiopathic scoliosis (AIS), the most prevalent type of scoliosis, affects 2%–3% of adolescents worldwide, although its underlying causes remain unclear [1]. AIS is characterized by a lateral curvature of the spine exceeding 10°, typically emerging during puberty without an identifiable medical cause [1]. Clinical guidelines recommend bracing for moderate curves between 25°–30° and 40°–45° [2]. Bracing aims to prevent curve progression through the application of external force. However, its effects are complex, with studies showing post-treatment loss of correction and improvement. Its efficacy also depends on wear time, with greater adherence linked to superior radiographic outcomes [3].

The long-term efficacy of bracing and the impact of factors including patient age, curve flexibility, skeletal maturity, and adherence to wear protocols, require further investigation. Untreated AIS can have psychological consequences [4]. While bracing aims to avoid surgical intervention, its influence on curve behavior has not been fully elucidated. Studies have yielded mixed results, with some studies reporting decreased correction over time, while others have documented improvements [4]. Treatment strategies vary based on curve severity. Mild curves with minimal progression risk may require only observation after bracing, whereas severe curves often necessitate surgical intervention for spinal alignment. Although bracing seeks to correct spinal curvature, it is not without limitations and uncertainties. Bracing imposes restrictions on movement and positional freedom, and its long-term impact after removal remains unclear.

According to Gabos et al. [5], braces impose positional limitations, and the durability of correction after removal remains uncertain. Discomfort is a common complication, with incidence rates reaching up to 36% in various spinal regions [6,7]. Notably, the long-term impact of bracing on curve behavior is still poorly understood, as acknowledged by Aulisa et al. [8]. Furthermore, Danielsson et al. [9] in 2010 also highlighted the significant knowledge gaps in understanding the natural history of AIS after bracing and its quality-of-life implications. The current literature is characterized by predominantly short-term follow-up periods, whereas the long-term effects of bracing on curvature behavior and patient well-being remain understudied. Moreover, patient-reported outcome measures (PROMs) that capture satisfaction and perspective over extended periods are scarce. The marked heterogeneity in individual study results underscores the need for a well-powered meta-analysis to synthesize multicenter data, providing insight into critical issues such as long-term efficacy, pain impact, and natural history.

Previous reviews of bracing efficacy have significant limitations. A systematic review conducted in 2008 only provided descriptive comparisons of pre-post means across studies, omitting meta-analyses, long-term follow-up assessments, quality-of-life evaluations, or adherence to contemporary reporting standards such as Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) [10]. Another meta-analysis of 18 studies was restricted by its focus on skeletal maturity, resulting in follow-up periods as brief as 2–5 years, which is insufficient to address the durability of effects over decades [11].

Heterogeneous and predominantly short-term results from previous studies underscore the need for a rigorous meta-analysis synthesizing multicenter evidence. This review aims to systematically evaluate bracing effects, assessing curve progression using radiographic outcomes and patient-centered measures of functioning and well-being over an extended follow-up period of ≥10 years.

Materials and Methods

Eligibility criteria

This meta-analysis was registered in PROSPERO (CRD42024510983) and adhered to PRISMA guidelines [12]. The inclusion criteria were formulated using the PICOS (Population, Intervention, Comparison, and Outcome) framework. The population included patients with AIS having curves within the range of 30º–45º, treated nonsurgically, and followed up for ≥10 years. The 30º threshold was chosen based on evidence suggesting progression risk above this level [13]. Intervention/exposure and comparator were not applicable, as this review aimed to evaluate pre-post outcomes. Outcomes of interest included radiological progression, quality of life, and functionality. Both comparative studies and case series were eligible, focusing on patients meeting the specified inclusion criteria.

Studies were excluded based on the following criteria: patients with other types of scoliosis (syndromic, congenital, or neuromuscular), non-adolescent or adult populations, case reports, letters to the editor, duplicate or overlapping data, and incomplete or missing data.

Information sources and literature search methods

A comprehensive literature search was performed in PubMed, Embase, Scopus, and the Cochrane Collaboration Library databases. The search strategies employed medical subject headings and keywords related to “adolescent idiopathic scoliosis,” “treatment,” “bracing,” “spinal fusion,” “surgery,” and other relevant terms (Supplement 1). The reference lists of included papers were also manually screened to identify any additional studies. Article selection was conducted independently by two reviewers. Any discrepancies in study selection between reviewers were resolved by consensus. A third reviewer was available to arbitrate any persisting disagreements. No restrictions were placed on the publication date or language.

Data extraction and data items

Two reviewers independently extracted data from the selected studies. Any inconsistencies in data extraction between the two reviewers were resolved by consensus. Any disagreement was resolved by involving a third reviewer. The baseline characteristics extracted from the included articles comprised study region, study design, follow-up duration (years), treatment, sample size, sex distribution, and age at final follow-up. The details of bracing protocols including brace type, usage duration, and indication criteria were also collected. The main variables were mean curve progression, significant progression (≥5º), surgical requirement, sagittal plane progression (specifically, thoracic kyphosis [TK] and lumbar lordosis [LL]), and PROMs. The PROMs that could be analyzed were the number of patients with back pain, as well as scores from the following standardized questionnaires: General Function Score (GFS), Oswestry Disability Index (ODI), EuroQol-5D (EQ-5D), Visual Analog Scale, Scoliosis Research Society-22 (SRS-22), and Short Form-36 (SF-36) at the end of follow-up.

The GFS assesses physical functionality through 17 items, evaluating common activities such as stair navigation, sitting, standing, walking, and dressing, scored on a 0–2 scale and calculated as a percentage where 0% represents no disability and 100% represents maximal disability [14]. The ODI measures back pain-related impairment as a percentage, categorizing disability levels as follows: 0%–20% (minimal); 21%–40% (moderate, impacting strenuous activities); 41%–60% (severe, hindering most daily functions); 61%–80% (crippling, disability due to back pain); and 81%–100% (bed-bound with minimal symptom relief) [15]. The EQ-5D, SRS-22r, and SF-36 questionnaires assess generic and disease-specific quality of life. The EQ-5D yields a score from 0 to 1, with values near 1 reflecting high quality and 0 reflecting low quality [16]. The SRS-22r is scored on a scale of 1 to 5, with 1 representing low quality and 5 high quality [17]. The SF-36 interprets health status results from 0 to 100, where 100 signifies high quality or functionality and 0 indicates low quality [18].

Assessment of risk of bias in the included studies

The Methodological Index for Non-Randomized Studies (MINORS) criteria were used to evaluate the quality and risk of bias in the included studies (Table 1) [4,6,7,9,1928]. This 12-tem tool assesses methodological conduct and presentation, with scores ranging from 0–2, yielding a maximum score of 24 for comparative studies and 16 for single-group designs, indicating higher quality. The MINORS scores were categorized into four quality levels: very low, low, fair, and high. For non-comparative studies, scores of 0–4 corresponded to very low quality, 5–7 corresponded to low quality, 8–12 corresponded to fair quality, and ≥13 corresponded to high quality. For comparative studies, scores of 0–6 corresponded to very low quality, 7–10 corresponded to low quality, 11–15 corresponded to fair quality, and ≥16 correspond to high quality [19]. Two reviewers independently evaluated these scores. Any discrepancies in assessment between the two reviewers were resolved by consensus.

Assessment of the quality of studies through MINORS

Assessment of results

Meta-analyses were conducted using Review Manager ver. 5.4 software (Cochrane, London, UK). Continuous outcomes were pooled using mean differences (MDs) with 95% confidence intervals (CIs) [29]. Curve progression was analyzed using a pre-post intervention design, comparing initial and final timepoints. For dichotomous outcomes where standard error (SE) was not reported, it was calculated from incidence using the formula: SE=√I (1-i)/n and 95% CI=I±1.96×SE; where I=incidence [30]. Pooled incidences and 95% CIs were then calculated using random-effects and fixed-effect models, as appropriate. Heterogeneity among studies was assessed using the chi-square test and I2 statistic, with I2 values of >25%, 50%, and 75% reflecting low, moderate, and high heterogeneity, respectively. A fixed-effects model was utilized when there was no significant heterogeneity, otherwise a random-effects model was applied. Variables with insufficient data for meta-analysis were summarized qualitatively through narrative synthesis. Missing data were handled according to the Cochrane Handbook [31].

Risk of bias across the studies

Publication bias was assessed by examining funnel plot asymmetry. Using Review Manager software (Cochrane), individual funnel plots were generated for each outcome. These plots graphically illustrated the relationship between intervention effect estimates (x-axis) and their precision or SE (y-axis). In the absence of bias, the funnel plot assumes a symmetrical inverted shape, with the most precise studies (typically larger trials) clustered at the top, while those with less precision (smaller trials) are scattered toward the bottom, with increasing asymmetry as sample sizes decrease. Visual inspection of funnel plots enabled evaluation of publication bias, where the absence of smaller, less precise studies reporting non-significant or negative results can cause the funnel to skew, indicating potential publication bias.

Assessment of quality of evidence

The quality of evidence for each outcome was assessed using the Grading of Recommendations Assessment Development and Evaluation (GRADE) system [32]. Two authors (G.M. and I.K.) independently evaluated the evidence, following the GRADE profile.

In this system, the quality of studies is initially graded as high and can be downgraded due to (1) inconsistency, (2) risk of bias, (3) indirectness, (4) imprecision, and (5) publication bias. The GRADE system classifies the quality of evidence into four categories: high, moderate, low, and very low.

Additional analyses

Subgroup analyses were planned to explore differences based on the duration of follow-up. For the follow-up subgroup analysis, selecting a cutoff point that appropriately dichotomizes studies into balanced groups for comparison is important. Upon inclusion of all eligible studies, the distribution of follow-up durations was assessed. A natural break was identified at approximately the 20-year mark, where half of the studies had follow-ups below this time point and half extended beyond it. Therefore, using the threshold of 20 years ensured subgroups of comparable sizes for statistical analysis, reducing the risk of one group dominating and producing biased results due to an imbalance in sample distributions.

Results

Study selection

A total of 277 studies were retrieved through a search of electronic databases. After screening titles and abstracts, 251 studies were excluded due to duplication, non-relevance, or ineligibility. The remaining 26 studies underwent full-text review, resulting in the exclusion of 15 additional studies. These exclusions were made due to incorrect focus, incomplete data, duplicate studies with different follow-up periods (with the longest follow-up selected), or curve magnitudes outside the specified range of 30º–45º. This resulted in 11 studies. After screening the reference lists of the individual studies, two additional studies that met the inclusion criteria were also included. Finally, 13 studies were included in the meta-analysis (Fig. 1) [4,6,7,9,20,3239]. To avoid duplication and ensure independent observations, studies reporting on the same patient sample as an already included publication were excluded if they offered shorter follow-ups or additional variables that did not provide significant new evidence. This approach prevented overlapping study cohorts from introducing unwarranted duplication into the analyses.

Fig. 1

Study selection flow diagram (Preferred Reporting Items for Systematic Reviews and Meta-analyses).

Study characteristics

The baseline characteristics of the included studies are shown in Table 2 [4,6,7,9,2028]. All 13 studies included in this review had a retrospective design. These studies enrolled a total of 1,492 patients, with the follow-up periods ranging from 12 to 42 years. The average curves included at the initial time point of the studies ranged from 30° to 39°. Age and sex distributions are shown in Table 2.

Baseline data of 13 included studies

The bracing protocols employed in studies that reported this information are detailed in Supplement 2 [4,6,7,9,21,23,24,26,28,40]. Study quality was rated as high in 9/13 studies and fair in 4/13 studies (Table 1).

Curve progression and surgery

The pooled mean progression across the included studies was −1.76 (95% CI, −4.97 to 1.46; studies=11; I2=92%) with no significant differences observed pre- and post-follow-up (Fig. 2) [4,6,7,9,20,21,2326,28]. However, this estimate may be conservative, as patients with significant curve progression often undergo surgery, thereby preventing further deterioration. Consequently, the reported values likely underestimate the true progression, as surgical intervention effectively truncates the curve’s potential worsening. Ascani et al. [22] only reported the mean progression at the final follow-up, showing a progression of 11.6° at the final follow-up. Studies with follow-up duration of <20 years had less progression than those with >20 years of follow-up: (MD, 0.94; 95% CI, −3.58 to 5.47; studies=5; I2=95%) vs. (MD, −5.69; 95% CI, −11.66 to 0.29; studies=6; I2=87%). Analysis of studies with over 25 years of follow-up revealed significant curve progression at final follow-up (MD, −14.32; 95% CI, −20.14 to −8.50; studies=2; I2=0%).

Fig. 2

Forest plot showing mean curve progression before and after conservative treatment. SD, standard deviation; IV, inverse variance; CI, confidence interval; df, degree of freedom.

The percentage of patients considered as having significant progression (≥5°) was 35.68 (95% CI, 22.85 to 48.50; studies=6) (Fig. 3) [6,9,21,23,25,27]. Studies with <20 years of follow-up had a lower incidence of significant progression compared to those with over 20 years of follow-up: 32.89% (95% CI, 0.00 to 74.44) vs. 37.07% (95% CI, 15.25 to 58.90). On analysis of studies with >25 years of follow-up, the incidence of significant progression was 46.70% (95% CI, 46.52 to 46.88).

Fig. 3

Forest plot illustrating the percentage of patients with significant progression (≥5°). SE, standard error; IV, inverse variance; CI, confidence interval; df, degree of freedom.

The percentage of patients requiring surgery at final follow-up was 14.20% (95% CI, 0.87 to 27.53; studies=2) (Fig. 4) [26,27]. The small number of studies prevented subgroup analysis. These studies did not provide the justification and indications for surgical intervention.

Fig. 4

Forest plot demonstrating the percentage of patients requiring surgery at the end of follow-up. SE, standard error; IV, inverse variance; CI, confidence interval; df, degree of freedom.

Sagittal profile

TK increased at the end of follow-up but not significantly (MD, −2.18; 95% CI, −5.26 to 0.91; participants=219; studies=3; I2=65%) (Fig. 5A) [20,23,24]. In all studies, the final TK remained within the normal range (20.9°–36.4°). LL decreased at the end of the follow-up but there were no significant differences (MD, 3.11; 95% CI, −3.18 to 9.39; participants=220; studies=3; I2=80%) (Fig. 5B) [20,23,24]. Similarly, all curves were within the normal range (44.0º–54.6º), except in the study by Simony et al. [20] (38.9º). The small number of studies providing this data prevented subgroup analysis.

Fig. 5

Forest plots showing thoracic kyphosis (A) and lumbar lordosis (B) at the end of the follow-up period. SD, standard deviation; IV, inverse variance; CI, confidence interval; df, degree of freedom.

Quality of life and functionality

The percentage of patients with global back pain at the end of follow-up was 63.35 (95% CI, 38.36 to 88.34; studies=2) (Fig. 6) [6,7]. Table 3 shows the final mean scores of different quality-of-life scales in studies reporting this data [7,20,24,25,27,28]. Analysis of the cutoff points revealed that GFS lacked defined thresholds to establish the degree of improvement. The ODI values ranged between 7.0 and 11.8 points, indicating minimal disability. The EQ-5D demonstrated high quality in two studies and acceptable quality in one, suggesting a favorable overall quality of life. Regarding the SRS-22 scores, no cutoff values were established. For the SF-36, studies reported these variables had a mean age range of 39–41 years [7,20]. According to the literature, patients in this age group presented a quality of life between the 10th and 20th percentile for the physical component summary, indicating significantly impaired physical health (scale: 0=worse quality of life; 100=better quality of life). The mental component summary values fell between the 30th and 40th percentile.

Fig. 6

Forest plot showing the percentage of patients with pain at the end of the follow-up period. SE, standard error; IV, inverse variance; CI, confidence interval; df, degree of freedom.

Quality of life and functionality assessed by different scalesa)

Publication bias

Fig. 7 presents the funnel plots assessing publication bias. Evidence of publication bias was found for two primary variables: curve progression (Fig. 7A) and percentage of patients with significant progression (Fig. 7B). This was supported by the asymmetry and clustering of studies in the upper part of the funnel.

Fig. 7

Funnel plot assessing publication bias for curve progression (A) and percentage of patients with significant progression (B). SE, standard error; MD, mean difference.

Quality of evidence

The certainty of evidence for the outcomes assessed in this meta-analysis varied. For curve progression and surgery, the evidence was of low certainty due to the significant risk of bias and substantial heterogeneity among studies. Similarly, the evidence was rated as very low for outcomes related to curve progression over 25 years and the percentage of patients requiring surgery. This downgrading was driven by serious concerns regarding inconsistency and variability in the results. The evidence for TK was of moderate certainty, reflecting some variability but a lack of statistical significance in the findings. In contrast, the evidence for LL was of low certainty, affected by substantial heterogeneity and a lack of significant change. Finally, the evidence for global back pain was of very low certainty, primarily due to high heterogeneity and differences in study designs and populations (Supplement 3).

Discussion

The curve progression and surgery rates in the studies included in this review varied depending on the follow-up duration. Longer follow-up periods were associated with an increased risk of progression over time. Studies with shorter follow-up durations (<20 years) reported less curve progression and lower surgical requirements than longer-term studies. However, studies with follow-up periods exceeding 25 years found a significant curve worsening in the final evaluation. The analysis revealed mixed results for quality of life and back pain outcomes. While some studies reported minimal disability and fair quality of life using the ODI, EQ-5D, and SF-36 scales, respectively, these patient-centered outcomes were assessed in only a limited number of studies. Bracing did not yield significant changes in sagittal profile, as measured by TK and LL. Both TK and LL remained within normal ranges at the end of treatment across the included studies.

Studies with follow-up periods of less than 20 years showed minimal curve progression, suggesting that the effects of conservative treatments might not be fully apparent in shorter study durations. Notably, when follow-up exceeded 25 years, significant progression was observed, with curves initially ranging from 30° to 45° progressing on average by 14°. By the end of 25 years, these curves reached between 44° and 59°, underscoring the potential need for reconsidering treatment strategies as patients age. This long-term progression underscores the need for extended monitoring beyond skeletal maturity to enable a comprehensive assessment of the efficacy of conservative therapies and better characterize their long-term impact on spinal health and quality of life. Given the increased life expectancy, it is imperative to continue ongoing vigilance and further long-term follow-up. These findings indicate that while initial treatments may delay progression, their effectiveness may wane over time, underscoring the need to evaluate long-term functional outcomes and patient satisfaction. Furthermore, similar brace protocol durations across studies, with limited variance, indicate a standardized approach to conservative treatments during the study period. However, the long-term impact of these treatments, particularly concerning patient functionality, quality of life, and satisfaction, requires further investigation and longer follow-up assessment.

Although our inclusion criteria specified curves measuring 30°–45º, based on the average curve measurements reported in the studies, individual patient data within these studies may have included curves slightly below 30º. Notably, there is no clear consensus regarding the impact of initial curve size on outcomes. For instance, Aulisa et al. [4] found no significant differences in outcomes based on the initial curve size. In contrast, Weinstein et al. [33] reported that curves exceeding 30° are more likely to progress. Meanwhile, Pellios et al. [41] observed that curves under 30° typically exhibit minimal progression. Our review revealed a critical knowledge gap in the literature, with remarkably few studies investigating the proportion of curves progressing beyond clinically meaningful thresholds, such as an increase of 5° or 6°. This metric is crucial to understanding the real-world impact of curve progression, as changes in average Cobb angle can be skewed by outliers. Unfortunately, only six of the 13 reviewed studies (less than 50%) provided this crucial data.

Our meta-analysis revealed that low skeletal maturity, assessed through various methods (Risser score, Sanders score, chronological age, menarche status, and triradiate cartilage status), consistently predicted brace treatment failure across six studies. This findings contrasts with non-meta-analytical literature reviews, such as that by Hawary et al. [13]. Additionally, several studies have identified an initial Cobb angle greater than 30°–40° as a risk factor for curve progression. However, the long-term implications of surgical interventions are crucial. For instance, procedures utilizing Cotrel-Dubousset instrumentation are associated with a significant long-term revision rate of 47.5% [36], highlighting some controversies surrounding this surgical method. These findings underscore the need for an in-depth evaluation of the impact of skeletal maturity on long-term outcomes. Our meta-analysis could not fully address this aspect due to insufficient data or lack of categorization of results based on this factor.

The existing literature lacks comprehensive information on complementary conservative strategies such as physiotherapy exercises. Although some studies have outlined brace protocols, none have thoroughly explicated complementary nonsurgical treatments that may impact outcomes. Furthermore, the included studies revealed that bracing does not significantly alter or correct sagittal alignment. This raises concerns about the potential risk of flatback syndrome without comprehensive multiplanar correction.

Studies have also not reported the specific causes necessitating surgery. It is reasonable to assume that progression drove many surgical indications, but quality-of-life impacts may also have factored during surgical decision-making. Capturing this context would enhance the comprehension of when and why treatment fails in different patients.

The correlation between brace adherence and stabilization of spinal conditions is well documented, with studies demonstrating less progression in patients wearing braces for over 20 hours daily [40,4,43]. However, the reported compliance rates, ranging from 54% to 70%, suggest that discomfort and psychological factors (such as body image concerns), which are often underreported, significantly limit adherence [44]. Suboptimal compliance presents a major challenge for achieving long-term stability after skeletal maturation. To address this issue, there is an ongoing debate and research into alternative therapies to enhance patient engagement and compliance. Innovative approaches, such as real-time feedback through sensor monitoring, more collaborative care models incorporating psychological support, and psychosocial interventions, are under exploration. Furthermore, researchers have suggested designing novel braces that enhance comfort and customization to improve adherence. While surgery is sometimes considered for non-compliant cases [37], the focus of our study and subsequent discussions should be on optimizing conservative treatment strategies to improve long-term outcomes for all patients, particularly by enhancing compliance through these innovative approaches.

Several studies have highlighted the importance of capturing PROMs to gain a comprehensive understanding of treatment impact. The study by Danielsson et al. [9], which included PROMs but only evaluated them at skeletal maturity, revealed that curves under 35° showed minimal differences from healthy controls. However, untreated or braced AIS patients reported significantly higher pain levels compared to controls. Danielsson et al. [9] also estimated that 70% of patients underwent unnecessary bracing, while surgery was avoided only in one in 10 cases (10%). Research on long-term PROMs for AIS patients during pregnancy is scarce, despite the majority being female. Most studies have focused on comparing conservative and surgical treatments, yielding no significant differences between the two approaches. Danielsson et al. [7] also reported greater functional limitations in braced AIS versus controls. SF-36 and ODI have emerged as reliable quality-of-life measures. Our findings indicate that patients with braced AIS experience impaired quality of life, as measured by SF-36, compared to the general population. Conversely, the ODI indicated minimal disability. Few studies have consistently evaluated the relationships between curve progression and patient-centered outcomes using PROMs. Although some studies have found no correlation at the individual level, capturing these metrics remains important for a comprehensive understanding of treatment impact.

Pulmonary function is a vital, yet frequently neglected aspect of evaluating AIS treatment outcomes. Despite this oversight, studies addressing this dimension have yielded crucial insights into the broader implications of both conservative and surgical interventions. Misterka et al. [21] found that vital capacity remained stable at the final follow-up in their study cohort; however, these findings provide only a limited window into true long-term respiratory outcomes [45].

A knowledge gap persists regarding the long-term effects of bracing on disc health. Given that AIS primarily affects adolescents during active growth, with a lengthy life expectancy ahead, understanding spinal degeneration risks is crucial. While bracing aims to mitigate progression by modulating vertebral biomechanics, the potential effects on intervertebral discs warrant investigation. Whether conservative options influence this outcome is unknown [46].

The studies reviewed in our analysis exhibited a strong element of selection bias. Frequently, patients exhibiting significant worsening of their scoliotic curves while undergoing bracing quickly transition to surgical interventions, which excludes them from the cohort analyzed for long-term curve progression under bracing alone. This practice likely skews the data, leading to an under-representation of the true extent of curve progression associated with bracing alone. Owing to this selection bias, the outcomes reported in these studies might reflect a best-case scenario rather than a comprehensive depiction of natural disease progression under nonsurgical management. It is likely that including more severe cases that progress despite bracing would significantly increase the reported average curve progression. However, withholding surgical intervention from deteriorating patients solely to observe natural disease progression raises profound ethical concerns. This selection bias undermines the validity of study outcomes, making it challenging to accurately evaluate the long-term efficacy of bracing as a standalone treatment.

Some limitations of this study should be acknowledged. The small sample size hindered robust subgroup analyses and adjusted modeling approaches. Furthermore, the retrospective nature of the data introduces potential bias compared to prospective designs. Furthermore, there was high heterogeneity (I2=92%) among the included studies regarding curve progression outcomes. Many studies focused on mean Cobb angle changes rather than reporting the clinically relevant proportion of patients progressing over specific thresholds, such as 5° or 6°. Future studies should prioritize this outcome to enable a direct comparison. Additionally, important factors such as skeletal maturity, curve location, and side were not adequately assessed. Furthermore, the exact timing of patient follow-ups was not consistently reported. Moreover, approximately half of the studies did not report data on curve progression, limiting the power of the meta-analysis. Furthermore, critical patient-reported factors were generally lacking and the determinants of low compliance were under-researched. The psychosocial impact of bracing and quality of life metrics were infrequently reported despite their essential role in enabling in-depth characterization of the treatment burden. Another major limitation of this study was the lack of baseline and very long-term PROMs, which preempts a comprehensive evaluation of the impact and effectiveness of conservative treatments on quality of life over time. Cost analyses that consider health spending and absenteeism related to nonsurgical care would provide valuable insights for decision-makers. Pulmonary function and the risk of spinal degeneration have rarely been addressed, preventing the evaluation of their systemic implications. Moreover, the rationale behind surgical decisions is often unclear, highlighting the need for distinguishing between surgeries driven by curve progression and those aimed at improving quality of life. Clarifying these distinctions would enhance clinical understanding, allowing for more informed treatment choices. Overall, follow-up tended to end at skeletal maturity, without a long-term assessment of adult outcomes. While individual studies offer preliminary insights, the presence of high unexplained heterogeneity and inadvertent omissions hinders conclusive comparisons of optimal management strategies across diverse presentations and time frames.

Conclusions

This review indicates that a significant proportion of patients with moderate curves (30°–45º) treated conservatively experienced a progression of their curvatures that surpassed established thresholds, raising concerns that the current treatment algorithms may not adequately address such variations. Long-term monitoring over 25 years revealed that conservative treatment often failed to halt progression, with a significant proportion of the patients ultimately requiring fusion surgery. These findings challenge the efficacy of existing treatment approaches and underscore the need for re-evaluation. Standardized criteria for transitioning from nonsurgical to definitive correction must be reassessed, with a focus on tailoring interventions to individual risk profiles. Surgical interventions with fusion require careful consideration. Additionally, there is suboptimal adherence to conservative options, highlighting the need for optimized approaches to minimize the need for bracing and surgical procedures with definitive fusion. Finally, the exclusion of patients fast-tracked for surgery due to significant curve worsening under bracing alone from original studies may have underestimated true curve progression. A more comprehensive study incorporating these cases could provide valuable insights. Considering factors such as patient selection, quality of life, cost-effectiveness, and technological advancements in fusionless techniques can provide deeper insights informing a personalized and balanced treatment approach.

Key Points

  • Long-term follow-up (>25 years) of conservatively treated adolescent idiopathic scoliosis with moderate curves (30°–45°) showed significant curve progression (average 14.32°).

  • The incidence of significant curve progression (≥5°) was 35.68% across all follow-up periods, increasing to 46.70% at >25 years follow-up.

  • Surgical intervention was required in 14.20% of conservatively managed patients during follow-up.

  • Back pain was reported by 63.35% of patients at final follow-up, despite minimal disability measured by standardized scales.

  • Sagittal alignment parameters (thoracic kyphosis and lumbar lordosis) remained within normal ranges at final follow-up, with no significant changes from baseline.

Notes

Conflict of Interest

No potential conflict of interest relevant to this article was reported.

Author Contributions

Conceptualization: EH, JB, IS, VG, MTSM, IK, GM, CB. Methodology: EH, JB, IS, VG. Data curation: EH, JB, IS, VG. Formal analysis: EH, JB, IS, VG. Visualization: EH, JB, IS, VG. Project administration: EH, JB, IS, VG. Writing–original draft preparation: EH, JB, IS, VG. Writing–review and editing: EH, JB, IS, VG. Supervision: EH, JB, IS, VG. Final approval of the manuscript: all authors.

Supplementary Materials

Supplementary materials can be available from https://doi.org/10.31616/asj.2024.01358.

Supplement 1. PubMed search strategy.

asj-2024-0358-Supplementary-1.pdf

Supplement 2. Brace protocol.

asj-2024-0358-Supplementary-2.pdf

Supplement 3. GRADE summary.

asj-2024-0358-Supplementary-3.pdf

References

1. Weinstein SL. The natural history of adolescent idiopathic scoliosis. J Pediatr Orthop 2019;39(6 Suppl 1):S44–6.
2. Johnson MA, Gohel S, Flynn JM, et al. “Will I Need a Brace?”: likelihood of curve progression to bracing range in adolescent idiopathic scoliosis”. Spine Deform 2022;10:537–42.
3. Aulisa AG, Giordano M, Falciglia F, Marzetti E, Poscia A, Guzzanti V. Correlation between compliance and brace treatment in juvenile and adolescent idiopathic scoliosis: SOSORT 2014 award winner. Scoliosis 2014;9:6.
4. Aulisa AG, Toniolo RM, Falciglia F, Giordano M, Aulisa L. Long-term results after brace treatment with Progressive Action Short Brace in adolescent idiopathic scoliosis. Eur J Phys Rehabil Med 2021;57:406–13.
5. Gabos PG, Bojescul JA, Bowen JR, Keeler K, Rich L. Long-term follow-up of female patients with idiopathic scoliosis treated with the Wilmington orthosis. J Bone Joint Surg Am 2004;86:1891–9.
6. Cochran T, Irstam L, Nachemson A. Long-term anatomic and functional changes in patients with adolescent idiopathic scoliosis treated by Harrington rod fusion. Spine (Phila Pa 1976) 1983;8:576–84.
7. Danielsson AJ, Wiklund I, Pehrsson K, Nachemson AL. Health-related quality of life in patients with adolescent idiopathic scoliosis: a matched follow-up at least 20 years after treatment with brace or surgery. Eur Spine J 2001;10:278–88.
8. Aulisa AG, Guzzanti V, Falciglia F, Galli M, Pizzetti P, Aulisa L. Curve progression after long-term brace treatment in adolescent idiopathic scoliosis: comparative results between over and under 30 Cobb degrees: SOSORT 2017 award winner. Scoliosis Spinal Disord 2017;12:36.
9. Danielsson AJ, Hasserius R, Ohlin A, Nachemson AL. Health-related quality of life in untreated versus brace-treated patients with adolescent idiopathic scoliosis: a long-term follow-up. Spine (Phila Pa 1976) 2010;35:199–205.
10. Maruyama T. Bracing adolescent idiopathic scoliosis: a systematic review of the literature of effective conservative treatment looking for end results 5 years after weaning. Disabil Rehabil 2008;30:786–91.
11. Dolan LA, Weinstein SL. Surgical rates after observation and bracing for adolescent idiopathic scoliosis: an evidence-based review. Spine (Phila Pa 1976) 2007;32(19 Suppl):S91–100.
12. Liberati A, Altman DG, Tetzlaff J, et al. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate health care interventions: explanation and elaboration. PLoS Med 2009;6:e1000100.
13. Hawary RE, Zaaroor-Regev D, Floman Y, Lonner BS, Alkhalife YI, Betz RR. Brace treatment in adolescent idiopathic scoliosis: risk factors for failure: a literature review. Spine J 2019;19:1917–25.
14. Hagg O, Fritzell P, Romberg K, Nordwall A. The General Function Score: a useful tool for measurement of physical disability: validity and reliability. Eur Spine J 2001;10:203–10.
15. Fairbank JC, Pynsent PB. The Oswestry Disability Index. Spine (Phila Pa 1976) 2000;25:2940–52.
16. Rabin R, de Charro F. EQ-5D: a measure of health status from the EuroQol Group. Ann Med 2001;33:337–43.
17. Aghdasi B, Bachmann KR, Clark D, et al. Patient-reported outcomes following surgical intervention for adolescent idiopathic scoliosis: a systematic review and meta-analysis. Clin Spine Surg 2020;33:24–34.
18. Vilagut G, Valderas JM, Ferrer M, Garin O, Lopez-Garcia E, Alonso J. Interpretation of SF-36 and SF-12 questionnaires in Spain: physical and mental components. Med Clin (Barc) 2008;130:726–35.
19. Slim K, Nini E, Forestier D, Kwiatkowski F, Panis Y, Chipponi J. Methodological index for non-randomized studies (minors): development and validation of a new instrument. ANZ J Surg 2003;73:712–6.
20. Simony A, Christensen SB, Carreon LY, Andersen MO. Radiological outcomes in adolescent idiopathic scoliosis patients more than 22 years after treatment. Spine Deform 2015;3:436–9.
21. Misterska E, Glowacki J, Okret A, Laurentowska M, Głowacki M. Back and neck pain and function in females with adolescent idiopathic scoliosis: a follow-up at least 23 years after conservative treatment with a Milwaukee brace. PLoS One 2017;12:e0189358.
22. Ascani E, Bartolozzi P, Logroscino CA, et al. Natural history of untreated idiopathic scoliosis after skeletal maturity. Spine (Phila Pa 1976) 1986;11:784–9.
23. Fang MQ, Wang C, Xiang GH, Lou C, Tian NF, Xu HZ. Long-term effects of the Cheneau brace on coronal and sagittal alignment in adolescent idiopathic scoliosis. J Neurosurg Spine 2015;23:505–9.
24. Farshad M, Gotschi T, Bauer DE, Boni T, Laux CJ, Kabelitz M. Long-term outcome of patients with adolescent idiopathic scoliosis seeking nonoperative treatment after a mean follow-up of 42 years. Spine Deform 2022;10:1331–8.
25. Haefeli M, Elfering A, Kilian R, Min K, Boos N. Nonoperative treatment for adolescent idiopathic scoliosis: a 10- to 60-year follow-up with special reference to health-related quality of life. Spine (Phila Pa 1976) 2006;31:355–67.
26. Montgomery F, Willner S, Appelgren G. Long-term follow-up of patients with adolescent idiopathic scoliosis treated conservatively: an analysis of the clinical value of progression. J Pediatr Orthop 1990;10:48–52.
27. Steen H, Pripp AH, Lange JE, Brox JI. Predictors for long-term curve progression after Boston brace treatment of idiopathic scoliosis. Eur J Phys Rehabil Med 2021;57:101–9.
28. Weigert KP, Nygaard LM, Christensen FB, Hansen ES, Bunger C. Outcome in adolescent idiopathic scoliosis after brace treatment and surgery assessed by means of the Scoliosis Research Society Instrument 24. Eur Spine J 2006;15:1108–17.
29. Wang J, Han B, Hai Y, Su Q, Chen Y. How helpful is the halo-gravity traction in severe spinal deformity patients?: a systematic review and meta-analysis. Eur Spine J 2021;30:3162–71.
30. Allagh KP, Shamanna BR, Murthy GV, et al. Birth prevalence of neural tube defects and orofacial clefts in India: a systematic review and meta-analysis. PLoS One 2015;10:e0118961.
31. Higgins JP, Thomas J, Chandler J, et al. Cochrane handbook for systematic reviews of interventions Hoboken (NJ): John Wiley & Sons; 2019.
32. Guyatt GH, Oxman AD, Vist GE, et al. GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. BMJ 2008;336:924–6.
33. Weinstein SL, Dolan LA, Spratt KF, Peterson KK, Spoonamore MJ, Ponseti IV. Health and function of patients with untreated idiopathic scoliosis: a 50-year natural history study. JAMA 2003;289:559–67.
34. Nachemson AL, Peterson LE. Effectiveness of treatment with a brace in girls who have adolescent idiopathic scoliosis: a prospective, controlled study based on data from the Brace Study of the Scoliosis Research Society. J Bone Joint Surg Am 1995;77:815–22.
35. Karol LA, Virostek D, Felton K, Jo C, Butler L. The effect of the Risser stage on bracing outcome in adolescent idiopathic scoliosis. J Bone Joint Surg Am 2016;98:1253–9.
36. Mueller FJ, Gluch H. Cotrel-dubousset instrumentation for the correction of adolescent idiopathic scoliosis: long-term results with an unexpected high revision rate. Scoliosis 2012;7:13.
37. Li X, Huo Z, Hu Z, et al. Which interventions may improve bracing compliance in adolescent idiopathic scoliosis?: a systematic review and meta-analysis. PLoS One 2022;17:e0271612.
38. Liu J, Lavie CJ, Park YM, Bagiella E. Geographic variation and trends in prevalence of obesity among US adolescents, 2016–2021. Public Health 2023;223:128–30.
39. Wang Y, Beydoun MA, Min J, Xue H, Kaminsky LA, Cheskin LJ. Has the prevalence of overweight, obesity and central obesity levelled off in the United States?: trends, patterns, disparities, and future projections for the obesity epidemic. Int J Epidemiol 2020;49:810–23.
40. Brox JI, Lange JE, Gunderson RB, Steen H. Good brace compliance reduced curve progression and surgical rates in patients with idiopathic scoliosis. Eur Spine J 2012;21:1957–63.
41. Pellios S, Kenanidis E, Potoupnis M, et al. Curve progression 25 years after bracing for adolescent idiopathic scoliosis: long term comparative results between two matched groups of 18 versus 23 hours daily bracing. Scoliosis Spinal Disord 2016;11:3.
42. Rahman T, Bowen JR, Takemitsu M, Scott C. The association between brace compliance and outcome for patients with idiopathic scoliosis. J Pediatr Orthop 2005;25:420–2.
43. Seifert J, Selle A, Flieger C, Gunther KP. Compliance as a prognostic factor in the treatment of idiopathic scoliosis. Orthopade 2009;38:151–8.
44. Rahimi S, Kiaghadi A, Fallahian N. Effective factors on brace compliance in idiopathic scoliosis: a literature review. Disabil Rehabil Assist Technol 2020;15:917–23.
45. Lorente A, Barrios C, Burgos J, et al. Cardiorespiratory function does not improve 2 years after posterior surgical correction of adolescent idiopathic scoliosis. Spine (Phila Pa 1976) 2017;42:1391–7.
46. Akazawa T, Kotani T, Sakuma T, et al. Spinal fusion on adolescent idiopathic scoliosis patients with the level of L4 or lower can increase lumbar disc degeneration with sagittal imbalance 35 years after surgery. Spine Surg Relat Res 2017;1:72–7.

Article information Continued

Fig. 1

Study selection flow diagram (Preferred Reporting Items for Systematic Reviews and Meta-analyses).

Fig. 2

Forest plot showing mean curve progression before and after conservative treatment. SD, standard deviation; IV, inverse variance; CI, confidence interval; df, degree of freedom.

Fig. 3

Forest plot illustrating the percentage of patients with significant progression (≥5°). SE, standard error; IV, inverse variance; CI, confidence interval; df, degree of freedom.

Fig. 4

Forest plot demonstrating the percentage of patients requiring surgery at the end of follow-up. SE, standard error; IV, inverse variance; CI, confidence interval; df, degree of freedom.

Fig. 5

Forest plots showing thoracic kyphosis (A) and lumbar lordosis (B) at the end of the follow-up period. SD, standard deviation; IV, inverse variance; CI, confidence interval; df, degree of freedom.

Fig. 6

Forest plot showing the percentage of patients with pain at the end of the follow-up period. SE, standard error; IV, inverse variance; CI, confidence interval; df, degree of freedom.

Fig. 7

Funnel plot assessing publication bias for curve progression (A) and percentage of patients with significant progression (B). SE, standard error; MD, mean difference.

Table 1

Assessment of the quality of studies through MINORS

Study Clearly stated aim Conseutive patients Prospective collection data End-points Assessment endpoint Follow-up period Loss less than 5% Study size Adequate control group Contemporary group Baseline control Statistical analyses MINORS
Ascani et al. [22] (1986) 1 2 0 1 1 2 0 2 0 2 0 0 11
Aulisa et al. [4] (2021) 2 2 0 2 2 2 0 2 0 2 0 2 16
Cochran et al. [6] (1983) 1 2 0 1 1 2 0 2 0 2 0 0 11
Danielsson et al. [7] (2001) 2 2 0 2 2 2 2 2 1 2 1 2 20
Danielsson et al. [9] (2010) 2 2 0 2 2 2 1 1 1 2 1 2 18
Fang et al. [23] (2015) 2 2 0 2 2 2 0 1 - - - - 11
Farshad et al. [24] (2022) 2 2 0 2 2 2 2 2 1 2 1 2 20
Haefeli et al. [25] (2006) 2 2 0 2 2 2 2 2 2 2 2 2 22
Misterska et al. [21] (2017) 2 2 0 2 2 2 2 1 0 2 0 2 17
Montgomery et al. [26] (1990) 2 2 0 2 2 2 2 2 - - - - 14
Simony et al. [20] (2015) 2 2 0 2 2 2 0 1 0 2 0 2 15
Steen et al. [27] (2021) 2 2 0 2 2 2 0 2 1 2 1 2 18
Weigert et al. [28] (2006) 2 2 0 2 2 2 2 1 0 2 0 2 17

MINORS, Methodological Index for Non-Randomized Studies.

Table 2

Baseline data of 13 included studies

Study Region Type of study FU (yr) Treatment Skeletal maturity No. of patients Female (%) Age at final FU (yr) Curve initially (º)
Ascani et al. [22] (1986) Italy Retrospective cohort 36.0 Untreated NS 187 151 (80.8) 43.0 30.0–39.0
Aulisa et al. [4] (2021) Italy Retrospective cohort 13.4 Progressive action short brace Risser 0–4 163 163 (100.0) 32.2 30.0
Cochran et al. [6] (1983) Sweden Retrospective cohort 12.0 Milwaukee brace NS 85 81 (95.1) 24.0 30.0
Danielsson et al. [7] (2001) Sweden Retrospective cohort 22.2 Milwaukee brace or Boston brace NS 109 105 (96.3) 39.3 33.2
Danielsson et al. [9] (2010) Sweden Retrospective cohort 16.0 Boston brace Skeletal age of 10 to 15 yr 37 37 (100.0) 32.4 30.5
Fang et al. [23] (2015) China Retrospective series 24.4 Chêneau-type thoracolumbosacral orthosis Risser stage 0–2 32 29 (91.0) NS 30.6
Farshad et al. [24] (2022) Switzerland Retrospective cohort 42.0 Nonoperative NS 20 16 (80.0) 56.5 31.7
Haefeli et al. [25] (2006) Germany Retrospective cohort 23.0 Nonoperative (brace or physiotherapy) NS 121 102 (85.0) 37.8 30.0
Misterska et al. [21] (2017) Poland Retrospective cohort 27.8 Milwaukee brace NS 30 30 (100.0) 41.1 32.2
Montgomery et al. [26] (1990) Sweden Retrospective series ≥10.0 Brace NS 233 NS NS 33.2
Simony et al. [20] (2015) Denmark Retrospective cohort 24.5 Boston brace NS 66 62 (94.0) 41.4 37.5
Steen et al. [27] (2021) Norway Retrospective cohort 23.3 Boston brace Risser sign <3 365 339 (92.9) 23.3 33.6
Weigert et al. [28] (2006) Denmark Retrospective cohort 19.1 Boston brace NS 44 NS NS 33.6

FU, follow-up; NS, not specified.

Table 3

Quality of life and functionality assessed by different scalesa)

Study GFS ODI EQ-5D SRS-22 pain SRS-22 PF SRS-22 MH SRS-22 SI SRS-22 satisfaction SF-36 PCS SF-36 MCS
Steen et al. [27] (2021) (1)b) 6.5 (11.4) 7.7 (11.8) 0.83 (0.20) 4.1 (0.8) 4.1 (0.6) 4.2 (0.6) 3.8 (0.7) 3.8 (1.0) - -
Steen et al. [27] (2021) (2)b) 8.5 (12.3) 11.8 (13.1) 0.75 (0.24) 3.9 (0.9) 4.0 (0.7) 4.0 (0.7) 3.5 (0.7) 3.3 (1.0) - -
Danielsson et al. [7] (2001) 9.2 (12.6) 7.7 (9.0) - - - - - - 49.5 (47.7–51.3) 50.2 (48.2–52.1)
Farshad et al. [24] (2022) - 7.0 (7.9) - - - - - - - -
Haefeli et al. [25] (2006) - 11.4 (0.0–45.0) - - - - - - - -
Simony et al. [20] (2015) - - 0.82 (0.15) 3.73 (0.95) 4.21 (0.82) 3.99 (0.74) 3.60 (0.94) 3.46 (0.91) 47.9 (10.3) 50.38 (11.32)
Weigert et al. [28] (2006) - - 3.94 (0.77) 2.82 (0.95) 2.99 (0.76) 3.32 (0.74) - -

GFS, General Function Score; ODI, Oswestry Disability Index; EQ-5D, EuroQol-5D; SRS-22, Scoliosis Research Society-22; PF, Physical Function; MH, Mental Health; SI, Self-Image; SF-36, Short Form-36; PCS, physical component summary; MCS, mental component summary; (1), compliers; (2), non-compliers; -, not specified.

a)

The studies reported the mean with its standard deviation or confidence intervals.

b)

These data were collected from the study conducted by Brox et al. [40] which is the same study as Steen et al. [27] but included different endpoints.