ABSTRACT
Background: Cervical disc arthroplasty (CDA) is an established treatment for degenerative disc disease with radiculopathy and/or myelopathy. There is, however, little published evidence of its effectiveness to relieve pain and improve function in patients with a primary diagnosis of axial neck pain. Such patients were excluded from all previous Food and Drug Administration clinical trials for CDA. We compare the outcomes of patients who underwent CDA for 3 common cervical conditions from 2003 to 2018.
Methods: Seven hundred and eighty-two CDA patients at a single site were grouped by primary diagnosis: predominant axial neck pain (ANP) (n = 257), predominant radiculopathy (RAD) (n = 331), or a combination of both (ANP + RAD) (n = 195). Mixed models for repeated measures predicted and analyzed scores at all time points, adjusting for diagnosis group, time point, and, if statistically significant, number of operative levels and demographic characteristics. Outcome measures included the Neck Disability Index, numerical pain scales (intensity plus frequency), the Veterans RAND 12 Item Health Survey (VR-12) Physical Component Score, the Mental Component Score, and reoperations. Patients were assessed preoperatively and postoperatively at 6 weeks, 3 months, 6 months, 1 year, and annually thereafter.
Results: At baseline, arm pain scores in the ANP group were statistically lower (P = .0002) than in the RAD and ANP + RAD groups, consistent with preoperative diagnoses. Surgeries included 40.8% 1-level, 41.6% 2-level, 14.7% 3-level, and 2.9% 4-level. For all outcome measures, improvements were statistically significant from baseline to each postoperative time point without statistical difference between the 3 diagnosis groups. In total, 45/782 patients (5.8%) underwent a secondary surgery: 3.5% ANP, 5.8% RAD, and 8.7% ANP + RAD. Days to reoperation did not statistically differ between groups (P = .489).
Conclusions: Appropriately selected patients with predominant axial neck pain treated with CDA may achieve clinical and functional outcomes comparable to patients with a primary diagnosis of radiculopathy or of axial neck pain with concomitant radiculopathy.
Clinical Relevance: This study provides information that should help clinicians decide whether to offer CDA for patients with a primary diagnosis of axial neck pain and to appropriately counsel such patients about expected outcomes.
Level of Evidence: 4.
INTRODUCTION
Of the 328 conditions studied in the Global Burden of Disease 2016 Study, neck pain was ranked the sixth-highest cause of disability.1 From 2006 to 2016, the prevalence of disability caused by neck pain increased by 21.9%.1 Worldwide, nearly half of all individuals will experience an episode of severe neck pain over the course of their lifetimes.2 Most episodes of neck pain will resolve within 2 months, but almost 50% of individuals will continue to have some pain or frequent reoccurrences.3,4 A 2010 study found that, in their search for treatment, chronic neck pain sufferers had on average 21 visits with more than 5 types of providers.5 Similar to back pain, the treatment of neck pain ranges from conservative to surgical care. While the necessity for surgery is evident in cases of neurologic compression, its benefits are not well defined in cases of axial neck pain.6,7
Cervical disc arthroplasty (CDA) is now a well-accepted treatment for radiculopathy and myelopathy, with clinical trial long-term results showing safety and efficacy. However, axial neck pain is a constellation of symptoms without clinically significant radiculopathy or myelopathy, and its surgical treatment is still debated. Patients with this diagnosis were excluded from all previous Food and Drug Administration (FDA) clinical trials for CDA. One review was unable to conclude as to the benefits of anterior cervical discectomy and fusion (ACDF) to treat axial neck pain, while others reported significant improvements after ACDF.8,9 Separately, the Neck Pain Task Force reporting on the decade 2000–2010 found insufficient evidence to support either fusion or arthroplasty for the treatment of neck pain alone.10 More recently, it was reported that preoperative neck pain was a risk factor for persistent neck pain following cervical disc arthroplasty (CDA).11
Given the limited evidence regarding the effectiveness of CDA in relieving pain and improving function in patients with a primary complaint of axial neck pain, the purpose of this study is to document the outcomes of patients suffering from predominant axial neck pain due to degenerative disc disease and treated with CDA in clinical practice. These patients are compared to patients with radiculopathy or a combination of axial neck pain and radiculopathy in whom axial neck pain was not the primary complaint in order to assess whether results of CDA for primary axial neck pain are similar to those in patients for whom device clinical trials have previously established safety and efficacy.
MATERIALS AND METHODS
Patient Sample
Our research team developed and uses the proprietary SPIRITT database for all spine surgery cases. An approved protocol for assessing preoperative and validated patient-reported outcomes (PROs) is used for all patients. These data are gathered in clinic and remotely via Web-based questionnaires and stored in the database. Data entry codes are used in the database to distinguish primary complaint, with additional complaints and/or diagnoses coded separately. The measurements used for cervical disc disease in this study are described below. Following approval by a central institutional review board (Western Investigational Review Board, protocol no. 20080163, study no. SR-2013-02), the clinic database was queried to identify all patients who underwent CDA since 2003 with the relevant diagnosis codes. Patients included in this study were treated for 1 of the following diagnoses: predominant axial neck pain, predominant radiculopathy, or a combination of both axial neck pain and radiculopathy. Diagnosis was made by the attending surgeon based on the history and symptoms reported at the time of the patient's office visit and confirmed by physical examination and radiographic studies. The diagnosis of patients with painful cervical pathology can best be depicted as a continuum, with pure radicular symptoms (arm pain, decreased sensation, weakness) at one extreme and pure neck pain at the other. In clinical practice, very few instances of 1 in the complete absence of the other are observed. Radicular arm pain is the most common complaint seen in patients undergoing surgery for cervical degenerative conditions and serves as the starting point for all patient diagnoses. This study classified patients based on their predominant pain complaint, with many patients reporting a relatively equal combination of arm and neck pain symptoms. On preoperative assessment, the group with predominant neck pain had no or dramatically less arm pain than the other 2 groups. Axial neck pain is ill defined in the literature. For this study, axial neck pain is defined as pain contained within the distribution from the occiput to the C7 spinous process, with or without headaches, that radiates to the trapezius muscle but not to the shoulders and arms, without significant radicular symptoms.
Before surgery, all patients had failed at least 6 weeks of conservative measures, including rest, education, physical therapy or chiropractic care, nonsteroidal anti-inflammatory drugs, and/or injections. Patients with severe radiculopathy unresponsive to initial nonoperative treatment may have gone on to surgery more quickly. For this analysis, the patients were divided into 3 groups: predominant axial neck pain (ANP), predominant radiculopathy (RAD), and axial neck pain with concomitant radiculopathy (ANP + RAD). Those with severe myelopathy, traumatic fracture, or ossification of the posterior longitudinal ligament were excluded because these are different disease processes and generally not considered good candidates for CDA. For the axial neck pain group, operative segments were selected based on high-resolution 3T magnetic resonance imaging with foraminal views. Segments chosen had pathology that correlated with subjective complaints. Structural disc pathology, including annular changes, herniations, degenerations, and osteophytes, were also considered as possibilities for the source of neck pain. Severe facet arthropathy was ruled out by computed tomography.
Over the time period used for this study, there was considerable development and evolution of CDA devices, with at least 7 different devices having received FDA marketing approval following clinical trials. Therefore, a variety of different devices were used in this study population as they became available. Choice of device was not related to the diagnoses included here.
From April 2003 to May 2018, 825 patients underwent CDA at 1 or more levels from C3 to C7, with 782 meeting inclusion criteria for this study: 257 for predominant axial neck pain, 331 for predominant radiculopathy, and 195 for combined axial neck pain with radiculopathy (Figure 1). Demographic, surgical, and outcomes data were retrieved from the database for these patients.
Demographic and Surgical Information
Basic demographic information and surgical characteristics were collected: age, sex, body mass index (BMI), race/ethnicity, surgical levels, operative time, turnover time, intravenous fluids, and estimated blood loss.
PROs
Patients completed validated self-reported measures: Neck Disability Index (NDI),12 Veterans RAND 12 Item Health Survey (VR-12)13 Physical Component Score (PCS) and Mental Component Score (MCS), and numerical rating scales (intensity + frequency, 0–20 scale) for neck and arm pain.14 These measures are those most used in FDA device clinical trials for CDA and are routinely collected on our cervical spine surgery cases. Preoperative and postoperative (6 weeks, 3 months, 6 months, 1 year, and annually thereafter) scores from these PROs questionnaires were retrieved.
Secondary Surgeries
The timing, reason, and type of all secondary surgeries to date were recorded.
STATISTICAL METHODS
Comparisons of demographic characteristics, surgical characteristics, and preoperative PRO scores among the three diagnosis groups were performed using Pearson's chi-square test for categorical data and 1-way analysis of variance with the Student-Newman-Keuls post hoc test for continuous data.
Mixed models for repeated measures (MMRM) statistical analyses were used to assess the effect of diagnosis, time point, and statistically significant covariates (age at surgery, sex, BMI, and number of surgical levels) on each PRO score. MMRM effectively analyze longitudinal data with repeated measures at irregularly timed intervals (eg, weekly, monthly, and yearly) and missing dependent variable data points to produce a score prediction model. Each model then outputs a predicted PRO score for each patient at every time point as long as there are no missing independent variables. Analysis of repeated measures across time using mixed models is common in medical and biological studies and is considered less likely to lead to misinterpretation of results than some of the other methods used to handle missing data.15–17 Additionally, these models use all available data, as there is no need to exclude subjects who do not have data at every interval. Therefore, the mixed model was deemed particularly appropriate for this study, which included more than 5000 data points across multiyear intervals for each outcome measure (Table 1). Although the number of patients with postsurgical follow-up differed at any given time interval, the vast amounts of data overall were sufficient to produce score predictions out to 7 years follow-up. Statistical analysis was performed using SAS 9.4 (SAS Institute Inc, Cary, North Carolina).
RESULTS
Patient Characteristics (Table 2)
The sex distribution was statistically different between the three groups: 47% male in ANP, 60% male in RAD, and 57% male in ANP + RAD (P ≤ .007), as was mean age (ANP = 43.2 ± 10.0 years, RAD = 45.6 ± 9.2, and ANP + RAD = 47.4 ± 9.9; P ≤ .001). In addition, there is a difference in overall distribution of ethnicity between groups (P ≤ .03), most pronounced in the small number of patients in each group who were not White or African American.
Surgeries were performed at up to 4 levels: 40.8% 1-level, 41.6% 2-level, 14.7% 3-level, and 2.9% 4-level. The number of surgical levels was statistically different between groups (P < .001); the majority of surgeries were 1-level in the RAD group (59.7%) and multilevel in the other 2 groups. Although CDA at more than 2 levels is generally considered “off label,” it is not uncommon in clinical practice, so these patients have been included.18 Operative time also differed between the groups, driven primarily by number of operated levels, with the longest mean time for the combined ANP + RAD group.
Preoperative Scores (Table 3)
Preoperatively, NDI score was statistically different (P = .034) between the groups overall and specifically differing between ANP (mean = 55.5) and RAD (mean = 52.1), with ANP + RAD not differing significantly from either of the other 2 groups (mean = 53.1).
Mean arm pain scores were, as expected, also statistically different between the 3 diagnosis groups (P < .001), with post hoc testing finding no statistical differences between the RAD and ANP + RAD groups, in both of which arm pain scores were significantly greater than in the ANP group, corroborating the diagnoses based on clinical presentation.
There were no statistically significant differences between diagnosis groups for the remaining preoperative PRO scores: VR-12 PCS, VR-12 MCS, and neck pain intensity/frequency.
MMRM
For each of the outcome measures, a mixed-effects regression was performed, controlling for significant patient characteristics. That is, age at surgery, sex, BMI, and number of surgical levels were included in the MMRM only if that variable significantly contributed to the prediction of the outcome variable. Figure 2 provides an example of the fit between actual scores and predicted scores for the NDI outcome regression model. The figure shows a good fit between the predicted and actual observed scores. Figure 3 shows the results of the MMRM for all 5 of the outcome measures, showing the mean predicted scores at each time interval for each group.
NDI Score (Figure 3A)
In addition to intercept, diagnosis group, and time point, which are always included, the NDI score mixed model includes number of surgical levels (P = .008) and BMI (P = .015). As the number of surgical levels increases, so does NDI score; similarly, as BMI increases, so does NDI. The mean predicted NDI score for each diagnosis group at each time point portrays a significant improvement from baseline for all 3 groups over time (P < .001 for all time points) after controlling for all other variables in the model. NDI scores did not statistically differ among the 3 diagnosis groups after controlling for all other variables in the model (P = .606). At 6 weeks postoperative, a greater than 15-point improvement in mean NDI score is shown for all 3 diagnosis groups, with continued score improvement at each time point until a slight increase at 7 years postoperative.
Neck Pain (Figure 3B)
The mixed-effects model for neck pain includes intercept, diagnosis group, time point, and sex (P = .015), favoring females. Mean predicted neck pain scores were statistically lower at each follow-up time point compared to baseline (P < .001 for all time points) after controlling for all other variables in the model. There were no statistical differences between diagnosis groups (P < .17). A dramatic initial drop-off of greater than 50% in neck pain scores is seen from baseline to 6 weeks postoperatively, and neck pain scores continued to remain low over time.
Arm Pain (Figure 3C)
In addition to intercept, diagnosis group, and time point, age at surgery (P < .001) and BMI (P = .003) were also included in the mixed model for arm pain. After controlling for all other variables, mean predicted arm pain is statistically different from baseline to follow-up (P < .001 for all time points), demonstrating significant postoperative improvement. Predicted arm pain scores are not statistically different between the 3 diagnosis groups after controlling for all other variables in the model (P = .127). Figure 3C shows a large drop-off in mean postoperative arm pain from baseline to 6 weeks postoperative, and predicted scores remain low out to 7 years.
VR-12 PCS (Figure 3D)
In addition to intercept, diagnosis group, and time point, the VR-12 PCS mixed-effects model includes age at surgery (P < .001), sex (P = .013), and BMI (P < .001). That is, as BMI and age at surgery increase, the PCS decreases, while PCS scores are predicted to be higher for females than for males. After controlling for all other variables in the model, predicted PCS is statistically improved from baseline to follow-up at each time point (P < .001 for all time points) and not statistically different between the three diagnosis groups (P = .349). The mean predicted PCS portrays a steady improvement over time for all diagnosis groups but with a decline at 72 months postoperative.
VR-12 MCS (Figure 3E)
The model for VR-12 MCS adjusts for the intercept, diagnosis group, and time point only. Number of surgical levels, age at surgery, BMI, and sex were tested, but none of these factors contributed significantly to the prediction of MCS. In this model, MCS is statistically improved from baseline to follow-up for each time point (P < .001 for all time points) but not statistically different between the three diagnosis groups (P = .482). The predicted means for each diagnosis group show an initial dramatic predicted MCS improvement from baseline to 6 weeks postoperative and with incremental improvement seen out to 7 years after surgery.
Secondary Surgeries (Table 4)
A total of 45 patients (5.8%) underwent a secondary surgery. Of those, 11 (24.4%) experienced a new injury or trauma, unrelated to their initial symptoms or index treatment, that led to their additional surgery. The proportion of patients undergoing secondary surgeries was not statistically different between the 3 groups, although there was a trend (P = .06) toward a slightly higher rate in the ANP + RAD group at 8.7%. The most frequent types of secondary surgery were revision surgery at the index level (48.9%) and surgery at the adjacent segment (40.0%). Of these 45 patients, 4 (8.9%) also underwent a third cervical surgery (2 RAD and 2 ANP + RAD).
The average time elapsed between the index surgery and the second surgery was 1.7 ± 2.3 years for ANP, 3.2 ± 3.7 years for RAD, and 2.8 ± 2.7 years for ANP + RAD. Secondary surgeries appeared to occur sooner in ANP patients, but the difference in time intervals was not statistically different (P = .489) between the 3 groups. This is possibly due to the small number of patients with secondary surgeries and to large standard deviations.
DISCUSSION
To date, this is the largest known sample with the longest follow-up period of patients suffering from predominant axial neck pain and treated with CDA. Pain and disability levels decreased by about half in the 3 diagnosis groups without favoring 1 diagnosis group over another. The reduction of symptoms is commensurate with the improvement reported in 2 studies of patients suffering from axial neck pain who were treated with ACDF: in 1, reduction in mean NDI score from 58.8 to 30.7 and in mean neck pain from 8.4 to 3.8 was reported for a sample of 87 patients with an average follow-up of 4.4 years.19 In another study, a sample of 38 patients with axial neck pain reported a decrease in mean NDI score from 57.5 to 38.9 and in mean neck pain score from 8.3 to 4.1 at an average of 4.4 years after ACDF.20
The improvements in PROs of ANP patients in the current study were not statistically different from the improvements reported by RAD patients. Similar results were found in a study comparing the outcomes of patients with axial neck pain (n = 41) and patients with radiculopathy (n = 161) who underwent ACDF. After an average follow-up of 3.3 years, the axial neck pain patients reported a decrease in mean NDI score from 32.6 to 19.8 and mean neck pain from 6.9 to 2.9, while the radiculopathy patients reported an NDI decrease from 37.0 to 22.4 and neck pain decrease from 6.8 to 3.3.8
To our knowledge, only 1 study has previously investigated the outcomes of CDA for patients with axial neck pain.11 In that study, 24.4% (14/45) of axial neck pain patients reported persistent neck pain (at 3 months or later) after CDA. Compared to patients with radiculopathy and/or myelopathy, axial neck pain patients were 3 times more likely to experience persistent neck pain after CDA. In contrast, neck pain scores in the current study improved nearly 50% by 6 weeks and remained very low up to 7 years postoperative. The decrease in neck pain scores for ANP patients was not statistically different from the neck pain decrease for RAD and ANP + RAD patients, indicating similarly significant improvement.
The current study is a retrospective analysis of the prospectively collected data of a single treatment group from a single institution. As such, it lacks the strength of prospective, randomized, controlled studies. Other potentially confounding factors that might be related to axial neck pain, such as cervical curvature and disc motility, were not included in the current study, nor were some patient characteristics that could potentially affect PROs, such as precipitating cause of injury and duration of symptoms. Future studies will benefit from consistent criteria for classifying patients as having axial neck pain as the primary complaint. In clinical practice, this is based on the history and symptoms reported at the time of the patient's office visit but confirmed by physical examination and radiographic studies. But rarely is the patient complaint strictly all or none when it comes to neck pain. A consistent and/or more objective approach to defining primary axial neck pain, such as the definition we offer in the “Methods” section, will help in making comparisons across the literature. However, there is a clear need for additional evidence regarding the surgical treatment of axial neck pain, particularly with CDA. For that very reason and despite the nonrandomization, the results of this study provide valuable information about the effectiveness of CDA in relieving pain and improving function in patients with a primary diagnosis of axial neck pain. A major strength of this study is the large number of patients, the largest study of its kind to date, as well as the long-term follow-up using a database of regular assessment with standardized measurement tools. Even though not all subjects had follow-up at every interval, a validated statistical method was used that allows for missing data as well as control of possible confounding factors. This statistical method, MMRM, is particularly well suited to studies with large quantities of data collected over many time intervals, and in the present study, enough data points were available on enough subjects at every interval to provide predicted values out to 7 years.
CONCLUSIONS
The results of this study indicate that appropriately selected patients with predominant axial neck pain treated with CDA may achieve success with clinical and functional outcomes comparable to patients with a primary diagnosis of radiculopathy or of axial neck pain with concomitant radiculopathy. Further studies will provide additional confirmation of the effectiveness of CDA in relieving pain and improving function in patients with a primary diagnosis of axial neck pain.
ACKNOWLEDGMENTS
The authors thank Karen I. Berliner, PhD, for help in manuscript preparation, content suggestions, and editing.
Footnotes
Disclosures: The authors report the following potential or perceived conflicts. M.F.G. reports consulting for Aesculap and Medtronic; stock ownership in Bonovo, International Spine & Orthopedic Institute, LLC, Nocimed, OuroBoros; royalties from Medtronic and RTI. K.M.S. has no conflicts to disclose. F.W.S. reports stock ownership in Nocimed.
- This manuscript is generously published free of charge by ISASS, the International Society for the Advancement of Spine Surgery. Copyright © 2021 ISASS