Skip to main content

Adjusting DRG Payment Targets for Low Volume

Considerable discussion has taken place during the Centers for Medicare and Medicaid Services (CMS) resource calls about which process to use when computing episode payment rates for low-volume Diagnosis Related Groups (DRG).  This is an issue because Bundled Payment for Care Improvement (BPCI) applicants are required to participate for all DRG severity levels within a selected episode definition, even though the applicant may have extremely low volumes for one or more of those DRGs.  Low episode volumes cause inaccuracies in calculating average prices and may create inappropriate advantages or disadvantages for the provider if its historical costs were extremely high or low.

The Empirical Bayes Method


To correct for this, CMS is evaluating an approach that has two components;  first, the introduction of geographically-based averages to “correct” for the small sample size of an individual provider, and second, the use of the empirical Bayes statistical technique to create the varying proportions for blending the provider’s historical cost with the average regional cost.  Another approach has apparently been suggested to CMS, which is to use a longer base period of provider history to increase the sample size; however CMS appears to be favoring the empirical Bayes approach.

There are two issues with the Bayes approach.  First, it assumes that use of the geographic average improves the accuracy of the target price over a price that which would be set based solely on the providers historical cost for low-volume DRGs.  This assumption may not be true.  Episode cost variations among hospitals across a geographic region occur for a variety of reasons, most notably the difference in post-discharge care which can be based on the availability of post-acute providers such as rehabilitation hospitals and skilled nursing facilities.  The graph below shows the average cost per episode for DRG 470 (a DRG with a relatively low coefficient of variation) across hospitals having more than 100 episodes of that DRG. The red bar shows the average cost across the hospital referral cluster (HRC). The coefficient of variation across these hospitals is 8%, meaning that substituting the geographic average for the hospital actual average cost could introduce an error (at a 95% confidence level) of almost 16%.  Hospitals whose actual cost was lower than the geographic average would have a built-in advantage of this amount, while hospitals on the other side of the cost curve would have this difference as a built-in disadvantage.



This effect can reverse the incentives for potential participants in the BPCI program.  Without a geographic component of the target episode cost budget, participants would prefer to participate in DRGs in which their costs are higher than other hospitals, since this excessive cost may indicate an opportunity for cost reduction.  However, when a geographic average becomes part of the target payment budget calculation, the opposite incentive is created.  Under that scenario, hospitals would participate in DRGs in which they are a lower cost provider because they will have an automatic cost advantage.  This is contrary to the original concept of the BPCI.

The second problem with this approach is the use of the empirical Bayes technique, as opposed to a more transparent arithmetic approach.  For most participants, empirical Bayes will be a “black box” that cannot be duplicated and therefore assessed for its accuracy and reasonableness.  Applicants may be wary of participating in the program in which rates are set using a process that cannot easily be analyzed or understood.

The “Alternative Method”


An alternative to the empirical Bayes method has been proposed that utilizes a much simpler approach. This approach computes a regional or nationally-based weight from the episode cost of all DRGs, and applies those weights to the average episode cost for each participant across all DRGs to compute the cost for each DRG. In this way it develops an “index number” for the cost of each DRG, similar to that DRG weights use in the inpatient prospective payment system.  The formula for that method appears as follows:
(National cost for each DRG) / (National cost for all DRGs) * Applicant cost for all DRGs = Applicant cost for each DRG

This alternative is a significant improvement over the empirical Bayes method that uses regional cost components because it avoids the issues described above.  It does, however, retain one negative aspect in that it does not utilize the applicant’s historical cost for DRGs that have a sufficient sample size to create reliable pricing.  In our opinion, adjustments should only be used for low-volume DRGs; high-volume DRGs should always be based upon the applicant’s historical cost. Under the alternative method, changes in case mix could create advantages or disadvantages for the applicant, depending on which way the case mix moved.

Another Alternative Method


To avoid this issue, we proposed to CMS a different variation of the calculation, which would only be used to estimate episode cost for low-volume DRGs.  In our method, national ratios are developed between the episode costs of DRGs within the same group, relative to the cost of the highest volume DRG.  For example, the average cost of DRG 469 would be computed on a regional or national basis relative to the cost of DRG 470, which has a much higher volume.  This relationship would be utilized for applicants who do not have sufficient volume in DRG 469 by applying the resulting ratio to the applicant’s cost for DRG 470.
(National average cost for low-volume DRG) / (National average cost for high-volume DRG) * Applicant cost for high-volume DRG = Applicant cost for low-volume DRG

The advantage of this method is that it would retain the applicant’s historical average costs for most of the DRGs, substituting estimated values only for low-volume DRGs. This would retain historical accuracy and avoid the effects of changes in patient mix.

A More Radical Alternative

With the current emphasis on computing weights for individual DRGs within the same DRG family, another payment alternative may have been overlooked.  In this alternative, an average payment rate is computed for all DRGs within the same DRG family (without complications, w/CC and w/MCC), and that average is applied to all cases within that family. It effectively eliminates the differences in complication levels of patients. Under this alternative, participants would create a financial advantage by lowering their case mix, which would occur by having relatively fewer patients in the more complicated DRGs.  Since creating incentives to reduce complications should be a goal of any payment system, this payment system would meet that goal. It would also eliminate the incentive for upcoding, since there would be no increased payment for patients with higher-severity DRGs.  However, it will also create risk for participants whose severity mix increased beyond their control, since payment for more severe cases will be identical to that for less severe cases.  Such a payment system may be considered in future bundled payment initiatives, but given the risk adversity of current applicants it appears to be less likely to be adopted for current payment systems.

Summary

CMS has indicated that the proportion of the target developed from geographic data will be greater for DRGs with smaller sample sizes and larger amounts of variation; however CMS has not yet made a final determination of the methodology. It will be important to monitor the decision on this process, since introduction of geographic adjusters to DRGs in which historical data may provide reliable averages may create some of the errors and biases described above.

--------
To assist applicants in understanding the implications of recent changes to the BPCI structure, DataGen and Singletrack Analytics are preparing a series of articles dealing with some of the changes that are being implemented in the second phase of the application process.  These articles focus on analysis of the data, and on quantitative and financial aspects of the BPCI payment model. This article focuses on adjusting the episode payment targets to account for low volumes.  Subsequent articles will deal with quantification of the risk assumed by BPCI participants due to random variation in patient selection.  Please see our previous articles about the effect of outliers on risk and shared savings and the introduction of new required DRGs.

DataGen has partnered with Singletrack Analytics, to develop an analytical toolkit and consulting assistance for BPCI applicants. Our organizations specialize in payment system analytics and modeling, and assisting provider organizations to quantify and measure risks and returns under various systems of payment.  For more information about these services, contact Gloria Kupferman (DataGen) at GKupferm@hanys.org or Jon Pearce (Singletrack Analytics) at jon@singletrackanalytics.com.  Comments or questions about these articles are welcome.

Popular posts from this blog

Five key components of a strong patient safety culture

In today’s healthcare environment, ensuring patient safety is more than just a priority — it’s a fundamental component of quality care. Establishing a strong patient safety culture within hospitals and health organizations can dramatically reduce errors, increase patient satisfaction and improve overall healthcare outcomes. But what exactly is a patient safety culture, and how can institutions cultivate it effectively?  This blog post explores the five key components that make up a robust patient safety culture, along with insights from the Agency for Healthcare Research and Quality (AHRQ) and The Joint Commission.  What is patient safety culture?  AHRQ defines patient safety culture as how an organization's culture supports and promotes patient safety. This can extend to multiple levels, from individual units to departments to system levels. The AHRQ patient safety culture survey encompasses the shared values, beliefs and norms of healthcare practitioners and staff that...

Community Health Needs Assessment examples: Q&A on CHNA data reporting

Include integral data in your Community Health Needs Assessment examples  Community Health Needs Assessment (CHNA) examples tend to focus on case studies. However, using integral data for your CHNA can provide overall guidance, making it easier to start and complete your final report. In this Q&A, DataGen’s Melissa Bauer, principal healthcare informatics analyst, explains why using data in the CHNA is key and what types of data you should collect.   Q1: What is CHNA data?   A CHNA requires two types of data: primary and secondary. Using these two data streams, organizations can better characterize the community's health. This helps the organization conducting the CHNA best understand their community needs. It also informs them on the best ways to respond to them, providing guidance on where to start and how to evaluate impact and outcomes. Here’s a further explanation of the data found in a CHNA:  Primary data includes community surveys, focus groups, in-dep...