Given the unsustainable costs of US health care, universal agreement exists among payers, regulatory agencies, and other health care stakeholders that reform must include substantial improvements in the quality, effectiveness, and value of health care delivery. The Institute of Medicine and the American Recovery and Reinvestment Act of 2009 have called for the establishment of prospective registries to capture patient-centered data from real-world practice as a high priority to guide evidence-based reform. As a result, the American Association of Neurological Surgeons launched the National Neurosurgery Quality and Outcomes Database (N2QOD) and began enrolling patients in March 2012 into its initial pilot project: a web-based lumbar spine module. As a nationwide, prospective longitudinal registry utilizing patient reported outcome instruments, the N2QOD lumbar spine surgery pilot aims to systematically measure and aggregate surgical safety and 1-year postoperative outcome data from approximately 30 neurosurgical practices across the US with the primary aim of demonstrating the feasibility and validity of standardized 1-year outcome measurement from everyday real-world practice. At the end of the pilot year, 1) risk-adjusted modeling will be developed for the safety, quality, and effectiveness of lumbar surgical care (morbidity, readmission, improvements in pain, disability, quality of life, and return to work); 2) data integrity and validation will be demonstrated via internal quality control analyses and auditing, and 3) the feasibility of obtaining a high level of follow-up (~80%) of nationwide 1-year outcome measurement will be established. N2QOD will use only prospective clinical data, will avoid the use of administrative data proxies, and will rely on neurosurgically relevant risk factors for risk adjustment. Once national benchmarks of quality and effectiveness are accurately established and validated utilizing practice-based data extractors in the pilot year, N2QOD aims to introduce non–full-time employee (FTE)–dependent methodologies such as electronic medical record auto-extraction. N2QOD's non–FTE-dependent methodologies can then be validated against practice-based data extractor–derived measures of safety and effectiveness with the aim of more rapid expansion into the majority of US practice groups. The general overview, methods, and registry design of the N2QOD pilot year (lumbar module) are presented here.
Matthew J. McGirt, Theodore Speroff, Robert S. Dittus, Frank E. Harrell Jr. and Anthony L. Asher
Matthew F. Gornet, J. Kenneth Burkus, Mark E. Shaffrey, Perry J. Argires, Hui Nian and Frank E. Harrell Jr.
This study compared the safety and efficacy of treatment with the PRESTIGE LP cervical disc versus a historical control anterior cervical discectomy and fusion (ACDF).
Prospectively collected PRESTIGE LP data from 20 investigational sites were compared with data from 265 historical control ACDF patients in the initial PRESTIGE Cervical Disc IDE study. The 280 investigational patients with single-level cervical disc disease with radiculopathy and/or myelopathy underwent arthroplasty with a low-profile artificial disc. Key safety/efficacy outcomes included Neck Disability Index (NDI), Neck and Arm Pain Numerical Rating Scale scores, 36-Item Short Form Health Survey (SF-36) score, work status, disc height, range of motion, adverse events (AEs), additional surgeries, and neurological status. Clinical and radiographic evaluations were completed preoperatively, intraoperatively, and at 1.5, 3, 6, 12, and 24 months postoperatively. Predefined Bayesian statistical methods with noninformative priors were used, along with the propensity score technique for controlling confounding factors. Analysis by independent statisticians confirmed initial statistical findings.
The investigational and control groups were mostly similar demographically. There was no significant difference in blood loss (51.0 ml [investigational] vs 57.1 ml [control]) or hospital stay (0.98 days [investigational] vs 0.95 days [control]). The investigational group had a significantly longer operative time (1.49 hours vs 1.38 hours); 95% Bayesian credible interval of the difference was 0.01–0.21 hours. Significant improvements versus preoperative in NDI, neck/arm pain, SF-36, and neurological status were achieved by 1.5 months in both groups and were sustained at 24 months. Patient follow-up at 24 months was 97.1% for the investigational group and 84.0% for the control group. The mean NDI score improvements versus preoperative exceeded 30 points in both groups at 12 and 24 months. SF-36 Mental Component Summary superiority was established (Bayesian probability 0.993). The mean SF-36 PCS scores improved by 14.3 points in the investigational group and by 11.9 points in the control group from baseline to 24 months postoperatively. Neurological success at 24 months was 93.5% in the investigational group and 83.5% in the control group (probability of superiority ~ 1.00). At 24 months, 12.1% of investigational and 15.5% of control patients had an AE classified as device or device/surgical procedure related; 14 (5.0%) investigational and 21 (7.9%) control patients had a second surgery at the index level. The median return-to-work time for the investigational group was 40 days compared with 60 days for the control group (p = 0.020 after adjusting for preoperative work status and propensity score). Following implantation of the PRESTIGE LP device, the mean angular motion was maintained at 12 months (7.9°) and 24 months (7.5°). At 24 months, 90.0% of investigational and 87.7% of control patients were satisfied with the results of surgery. PRESTIGE LP superiority on overall success (without disc height success), a composite safety/efficacy end point, was strongly supported with 0.994 Bayesian probability.
This device maintains mean postoperative segmental motion while providing the potential for biomechanical stability. Investigational patients reported significantly improved clinical outcomes compared with baseline, at least noninferior to ACDF, up to 24 months after surgery.
Sharad Goyal, Dheerendra Prasad, Frank Harrell Jr., Julie Matsumoto, Tyvin Rich and Ladislau Steiner
Object. The goal of this study was to evaluate the effectiveness and limitations of gamma knife surgery (GKS) in the treatment of intracranial breast carcinoma lesions.
Methods. A retrospective analysis of the GKS database at the University of Virginia Health System identified 43 patients with a total of 84 lesions who were treated between 1989 and 2000. All patients who received treatment were included in this study. Imaging studies were available in 35 patients with 67 treated lesions.
The overall duration of median survival was 13 months (95% confidence interval [CI] 7–16 months) after radiosurgery. A univariable Cox regression analysis revealed that a single lesion (p = 0.035), a high Karnofsky Performance Scale (KPS) score (p = 0.019), and a high Score Index for Radiosurgery (SIR) in Brain Metastases (p = 0.036) were associated with a significantly lengthened time to local treatment failure. The median duration of survival for patients grouped according to the SIR as low, middle, and high was 3, 8, and 21 months, respectively (p = 0.00033). A multivariable analysis showed that a high KPS score (p = 0.006), a high SIR (p = 0.014), and advanced age (0.038) were predictive of survival. The 1-, 2-, 3-, and 5-year survival rates were 49, 23, 12, and 2%, respectively.
The overall median time to local treatment failure was 10 months (95% CI 6–14 months) after GKS. A univariable analysis demonstrated that a single lesion, higher KPS score, and a higher SIR were associated with a significantly longer time until local treatment failure. A multivariable analysis showed that a higher KPS score and SIR and patients who had received chemotherapy were associated with a significantly longer time to local treatment failure.
Neuroimaging scores given for the enhancement pattern (ring-enhancing, heterogeneous, and homogeneous signal), amount of necrosis (none, < 50%, and > 50%), and mass effect (none, mild, moderate, and severe) of each treated lesion did not correlate with survival or local treatment failure.
Conclusions. The SIR and the KPS score are prognostic factors in patients whose intracranial breast cancer metastases are treated with GKS. The SIR, which includes the KPS score, patient age, systemic disease status, largest lesion volume, and number of lesions, can be used to identify those patients with breast cancer metastasis who would benefit from GKS better than KPS score alone. The contribution of whole-brain radiation therapy to GKS with regard to local tumor control or survival could not be identified.
Anthony L. Asher, Clinton J. Devin, Brandon McCutcheon, Silky Chotai, Kristin R. Archer, Hui Nian, Frank E. Harrell Jr., Matthew McGirt, Praveen V. Mummaneni, Christopher I. Shaffrey, Kevin Foley, Steven D. Glassman and Mohamad Bydon
In this analysis the authors compare the characteristics of smokers to nonsmokers using demographic, socioeconomic, and comorbidity variables. They also investigate which of these characteristics are most strongly associated with smoking status. Finally, the authors investigate whether the association between known patient risk factors and disability outcome is differentially modified by patient smoking status for those who have undergone surgery for lumbar degeneration.
A total of 7547 patients undergoing degenerative lumbar surgery were entered into a prospective multicenter registry (Quality Outcomes Database [QOD]). A retrospective analysis of the prospectively collected data was conducted. Patients were dichotomized as smokers (current smokers) and nonsmokers. Multivariable logistic regression analysis fitted for patient smoking status and subsequent measurement of variable importance was performed to identify the strongest patient characteristics associated with smoking status. Multivariable linear regression models fitted for 12-month Oswestry Disability Index (ODI) scores in subsets of smokers and nonsmokers was performed to investigate whether differential effects of risk factors by smoking status might be present.
In total, 18% (n = 1365) of patients were smokers and 82% (n = 6182) were nonsmokers. In a multivariable logistic regression analysis, the factors significantly associated with patients’ smoking status were sex (p < 0.0001), age (p < 0.0001), body mass index (p < 0.0001), educational status (p < 0.0001), insurance status (p < 0.001), and employment/occupation (p = 0.0024). Patients with diabetes had lowers odds of being a smoker (p = 0.0008), while patients with coronary artery disease had greater odds of being a smoker (p = 0.044). Patients’ propensity for smoking was also significantly associated with higher American Society of Anesthesiologists (ASA) class (p < 0.0001), anterior-alone surgical approach (p = 0.018), greater number of levels (p = 0.0246), decompression only (p = 0.0001), and higher baseline ODI score (p < 0.0001). In a multivariable proportional odds logistic regression model, the adjusted odds ratio of risk factors and direction of improvement in 12-month ODI scores remained similar between the subsets of smokers and nonsmokers.
Using a large, national, multiinstitutional registry, the authors described the profile of patients who undergo lumbar spine surgery and its association with their smoking status. Compared with nonsmokers, smokers were younger, male, nondiabetic, nonobese patients presenting with leg pain more so than back pain, with higher ASA classes, higher disability, less education, more likely to be unemployed, and with Medicaid/uninsured insurance status. Smoking status did not affect the association between these risk factors and 12-month ODI outcome, suggesting that interventions for modifiable risk factors are equally efficacious between smokers and nonsmokers.
Matthew F. Gornet, Todd H. Lanman, J. Kenneth Burkus, Scott D. Hodges, Jeffrey R. McConnell, Randall F. Dryer, Anne G. Copay, Hui Nian and Frank E. Harrell Jr.
The authors compared the efficacy and safety of arthroplasty using the Prestige LP cervical disc with those of anterior cervical discectomy and fusion (ACDF) for the treatment of degenerative disc disease (DDD) at 2 adjacent levels.
Patients from 30 investigational sites were randomized to 1 of 2 groups: investigational patients (209) underwent arthroplasty using a Prestige LP artificial disc, and control patients (188) underwent ACDF with a cortical ring allograft and anterior cervical plate. Patients were evaluated preoperatively, intraoperatively, and at 1.5, 3, 6, 12, and 24 months postoperatively. Efficacy and safety outcomes were measured according to the Neck Disability Index (NDI), Numeric Rating Scales for neck and arm pain, 36-Item Short-Form Health Survey (SF-36), gait abnormality, disc height, range of motion (investigational) or fusion (control), adverse events (AEs), additional surgeries, and neurological status. Treatment was considered an overall success when all 4 of the following criteria were met: 1) NDI score improvement of ≥ 15 points over the preoperative score, 2) maintenance or improvement in neurological status compared with preoperatively, 3) no serious AE caused by the implant or by the implant and surgical procedure, and 4) no additional surgery (supplemental fixation, revision, or nonelective implant removal). Independent statisticians performed Bayesian statistical analyses.
The 24-month rates of overall success were 81.4% for the investigational group and 69.4% for the control group. The posterior mean for overall success in the investigational group exceeded that in the control group by 0.112 (95% highest posterior density interval = 0.023 to 0.201) with a posterior probability of 1 for noninferiority and 0.993 for superiority, demonstrating the superiority of the investigational group for overall success. Noninferiority of the investigational group was demonstrated for all individual components of overall success and individual effectiveness end points, except for the SF-36 Mental Component Summary. The investigational group was superior to the control group for NDI success. The proportion of patients experiencing any AE was 93.3% (195/209) in the investigational group and 92.0% (173/188) in the control group, which were not statistically different. The rate of patients who reported any serious AE (Grade 3 or 4) was significantly higher in the control group (90 [47.9%] of 188) than in the investigational group (72 [34.4%] of 209) with a posterior probability of superiority of 0.996. Radiographic success was achieved in 51.0% (100/196) of the investigational patients (maintenance of motion without evidence of bridging bone) and 82.1% (119/145) of the control patients (fusion). At 24 months, heterotopic ossification was identified in 27.8% (55/198) of the superior levels and 36.4% (72/198) of the inferior levels of investigational patients.
Arthroplasty with the Prestige LP cervical disc is as effective and safe as ACDF for the treatment of cervical DDD at 2 contiguous levels and is an alternative treatment for intractable radiculopathy or myelopathy at 2 adjacent levels.
Clinical trial registration no.: NCT00637156 (clinicaltrials.gov)
Matthew J. McGirt, Mohamad Bydon, Kristin R. Archer, Clinton J. Devin, Silky Chotai, Scott L. Parker, Hui Nian, Frank E. Harrell Jr., Theodore Speroff, Robert S. Dittus, Sharon E. Philips, Christopher I. Shaffrey, Kevin T. Foley and Anthony L. Asher
Quality and outcomes registry platforms lie at the center of many emerging evidence-driven reform models. Specifically, clinical registry data are progressively informing health care decision-making. In this analysis, the authors used data from a national prospective outcomes registry (the Quality Outcomes Database) to develop a predictive model for 12-month postoperative pain, disability, and quality of life (QOL) in patients undergoing elective lumbar spine surgery.
Included in this analysis were 7618 patients who had completed 12 months of follow-up. The authors prospectively assessed baseline and 12-month patient-reported outcomes (PROs) via telephone interviews. The PROs assessed were those ascertained using the Oswestry Disability Index (ODI), EQ-5D, and numeric rating scale (NRS) for back pain (BP) and leg pain (LP). Variables analyzed for the predictive model included age, gender, body mass index, race, education level, history of prior surgery, smoking status, comorbid conditions, American Society of Anesthesiologists (ASA) score, symptom duration, indication for surgery, number of levels surgically treated, history of fusion surgery, surgical approach, receipt of workers’ compensation, liability insurance, insurance status, and ambulatory ability. To create a predictive model, each 12-month PRO was treated as an ordinal dependent variable and a separate proportional-odds ordinal logistic regression model was fitted for each PRO.
There was a significant improvement in all PROs (p < 0.0001) at 12 months following lumbar spine surgery. The most important predictors of overall disability, QOL, and pain outcomes following lumbar spine surgery were employment status, baseline NRS-BP scores, psychological distress, baseline ODI scores, level of education, workers’ compensation status, symptom duration, race, baseline NRS-LP scores, ASA score, age, predominant symptom, smoking status, and insurance status. The prediction discrimination of the 4 separate novel predictive models was good, with a c-index of 0.69 for ODI, 0.69 for EQ-5D, 0.67 for NRS-BP, and 0.64 for NRS-LP (i.e., good concordance between predicted outcomes and observed outcomes).
This study found that preoperative patient-specific factors derived from a prospective national outcomes registry significantly influence PRO measures of treatment effectiveness at 12 months after lumbar surgery. Novel predictive models constructed with these data hold the potential to improve surgical effectiveness and the overall value of spine surgery by optimizing patient selection and identifying important modifiable factors before a surgery even takes place. Furthermore, these models can advance patient-focused care when used as shared decision-making tools during preoperative patient counseling.
Anthony L. Asher, Clinton J. Devin, Kristin R. Archer, Silky Chotai, Scott L. Parker, Mohamad Bydon, Hui Nian, Frank E. Harrell Jr., Theodore Speroff, Robert S. Dittus, Sharon E. Philips, Christopher I. Shaffrey, Kevin T. Foley and Matthew J. McGirt
Current costs associated with spine care are unsustainable. Productivity loss and time away from work for patients who were once gainfully employed contributes greatly to the financial burden experienced by individuals and, more broadly, society. Therefore, it is vital to identify the factors associated with return to work (RTW) after lumbar spine surgery. In this analysis, the authors used data from a national prospective outcomes registry to create a predictive model of patients’ ability to RTW after undergoing lumbar spine surgery for degenerative spine disease.
Data from 4694 patients who underwent elective spine surgery for degenerative lumbar disease, who had been employed preoperatively, and who had completed a 3-month follow-up evaluation, were entered into a prospective, multicenter registry. Patient-reported outcomes—Oswestry Disability Index (ODI), numeric rating scale (NRS) for back pain (BP) and leg pain (LP), and EQ-5D scores—were recorded at baseline and at 3 months postoperatively. The time to RTW was defined as the period between operation and date of returning to work. A multivariable Cox proportional hazards regression model, including an array of preoperative factors, was fitted for RTW. The model performance was measured using the concordance index (c-index).
Eighty-two percent of patients (n = 3855) returned to work within 3 months postoperatively. The risk-adjusted predictors of a lower likelihood of RTW were being preoperatively employed but not working at the time of presentation, manual labor as an occupation, worker’s compensation, liability insurance for disability, higher preoperative ODI score, higher preoperative NRS-BP score, and demographic factors such as female sex, African American race, history of diabetes, and higher American Society of Anesthesiologists score. The likelihood of a RTW within 3 months was higher in patients with higher education level than in those with less than high school–level education. The c-index of the model’s performance was 0.71.
This study presents a novel predictive model for the probability of returning to work after lumbar spine surgery. Spine care providers can use this model to educate patients and encourage them in shared decision-making regarding the RTW outcome. This evidence-based decision support will result in better communication between patients and clinicians and improve postoperative recovery expectations, which will ultimately increase the likelihood of a positive RTW trajectory.
Anthony L. Asher, Silky Chotai, Clinton J. Devin, Theodore Speroff, Frank E. Harrell Jr., Hui Nian, Robert S. Dittus, Praveen V. Mummaneni, John J. Knightly, Steven D. Glassman, Mohamad Bydon, Kristin R. Archer, Kevin T. Foley and Matthew J. McGirt
Prospective longitudinal outcomes registries are at the center of evidence-driven health care reform. Obtaining real-world outcomes data at 12 months can be costly and challenging. In the present study, the authors analyzed whether 3-month outcome measurements sufficiently represent 12-month outcomes for patients with degenerative lumbar disease undergoing surgery.
Data from 3073 patients undergoing elective spine surgery for degenerative lumbar disease were entered into a prospective multicenter registry (N2QOD). Baseline, 3-month, and 12-month follow-up Oswestry Disability Index (ODI) scores were recorded. The absolute differences between actual 12- and 3-month ODI scores was evaluated. Additionally, the authors analyzed the absolute difference between actual 12-month ODI scores and a model-predicted 12-month ODI score (the model used patients' baseline characteristics and actual 3-month scores). The minimal clinically important difference (MCID) for ODI of 12.8 points and the substantial clinical benefit (SCB) for ODI of 18.8 points were used based on the previously published values. The concordance rate of achieving MCID and SCB for ODI at 3-and 12-months was computed.
The 3-month ODI scores differed from 12-month scores by an absolute difference of 11.9 ± 10.8, and predictive modeling estimations of 12-month ODI scores differed from actual 12-month scores by a mean (± SD) of 10.7 ± 9.0 points (p = 0.001). Sixty-four percent of patients (n = 1982) achieved an MCID for ODI at 3 months in comparison with 67% of patients (n = 2088) by 12 months; 51% (n = 1731) and 61% (n = 1860) of patients achieved SCB for ODI at 3 months and 12 months, respectively. Almost 20% of patients had ODI scores that varied at least 20 points (the point span of an ODI functional category) between actual 3- and 12-month values. In the aggregate analysis of achieving MCID, 77% of patients were concordant and 23% were discordant in achieving or not achieving MCID at 3 and 12 months. The discordance rates of achieving or not achieving MCID for ODI were in the range of 19% to 27% for all diagnoses and treatments (decompression with and without fusion). The positive and negative predictive value of 3-months ODI to predict 12-month ODI was 86% and 60% for MCID and 82% and 67% for SCB.
Based on their findings, the authors conclude the following: 1) Predictive methods for functional outcome based on early patient experience (i.e., baseline and/or 3-month data) should be used to help evaluate the effectiveness of procedures in patient populations, rather than serving as a proxy for long-term individual patient experience. 2) Prospective longitudinal registries need to span at least 12 months to determine the effectiveness of spine care at the individual patient and practitioner level.