June 1, 2023

News Article

The Role of Data Equity in Achieving Health Equity

Network:

Milbank State Leadership Network

Focus Area:

State Health Policy Leadership

Topic:

Health Equity

Ninez Ponce

Paris “AJ” Adkins-Jackson

To celebrate its 100th year, The Milbank Quarterly has published a centennial anniversary issue with 36 articles that consider the future of population health. In this Q&A with Ninez Ponce, PhD, MPP, of the University of California, Los Angeles (UCLA) Center for Health Policy Research, and Paris “AJ” Adkins-Jackson, PhD, MPH, of the Columbia University Mailman School of Public Health, we discuss their contribution to the issue, “Making Communities More Visible: Equity-Centered Data to Achieve Health Equity,” coauthored with Riti Shimkhada, PhD, MPH, of the UCLA Center for Health Policy Research.

Their article argues that structural and systemic biases in health data systems render certain communities “invisible” and excluded from policy decisions, contributing to health inequities. The authors define data equity, discuss policy recommendations from the Biden administration’s Equitable Data Working Group Report, and highlight the need for community-centered data.

What is data equity?

Ponce: Data equity is about putting marginalized voices front and center. Data systems are limited to broad racialized categories and don’t measure the more granular, more precise identities of people from subgroups or intersections among racialized groups, sexual orientation groups, gender identity groups, and disability groups.

Adkins-Jackson: To acknowledge and then address systemic issues, biases, and inequities, you have to acknowledge and address that people are invisible in the data that we use. We group people together, or we might decide not to capture the data because the data points are just too small. Not only is this systemic bias, but it also sets the tone for racist measurement. What we’re asking is to turn that on its head. Acknowledge that these groups exist, and after you acknowledge that they exist, start to capture their experiences with inequity. Then you can address it because you see it.

Please describe the Equitable Data Working Group and what it set out to do.

Ponce: I’ve been in this data equity business for over 30 years, and I’m just thrilled that there’s momentum now led by the federal government. In 2022, the Biden administration’s Equitable Data Working Group released recommendations in the following areas: disaggregated data, underused data, capacity for equity assessment, partnerships across government and research communities, and accountability to the public. Our paper looks at these key priority areas, the selected recommendations, and some of the identified tasks.

In disaggregated data, the monumental change right now is revising the Office of Management and Budget (OMB) Statistical Policy Directive 15 that puts forth suggestions for racialized group categories. The last time that it was revised was in 1997. The five current racialized categories are American Indian or Alaska Native; Asian, Black or African American, Native Hawaiian or other Pacific Islander, and White. The Latino category is an ethnic overlay over these five racialized groups. One of the proposed revisions is to include Middle Eastern and North African in the minimum set of categories, and to add the Latino Hispanic identity as one of these racialized groups. There are also suggestions for a more detailed set of categories where subgroups are included in forms that elicit self-report of somebody’s identity. We fully endorse more disaggregation of racialized groups.

I think the most important piece is accountability to the public, where you increase transparency in serving marginalized communities, as well as build access, like through data-friendly tools, to make it not so expensive or so hard to get restricted data.

What does disaggregating data mean, and why is data visibility so important?

Ponce: Disaggregated data means unlocking the truths that are behind a lumped identity. The average aggregation trumps detection of the pain or the assets of the groups underneath this lumped identity. Even though the current OMB Directive 15 mandates that Asians be disaggregated from Native Hawaiians and Pacific Islanders, when the Native Hawaiian Pacific Islander Data Policy Lab at UCLA was looking state by state and jurisdiction by jurisdiction at COVID data, over 20 states were still not disaggregating. Lumping Native Hawaiians and Pacific Islanders with Asians hid the high death rates and case rates of COVID-19 for the Native Hawaiian and Pacific Islander group. Disaggregation increases data quality because it gets at this more precise inference of what’s happening with different groups in the US population.

Adkins-Jackson: I always think of social contracts. In my social contract with my country, I pay taxes. And for those taxes, I expect public amenities, I expect access to a good quality life. A lot of those societal functions are based on the data that the government has available for groups. If we’ve made you invisible in the data, how then can you advocate for your communities, for your needs? [Data] has such downstream implications for groups. If you can take my dollar, then you need to tell my story.

Another recommendation from the working group is about developing metrics on racism. Why is it important to develop metrics on racism?

Ponce: You need to have structural racism measures and not just look at the disparities outcomes. That’s why you need to disaggregate. You can’t have health equity without data equity.

Adkins-Jackson: Capturing structural racism and other structural and social determinants of health are capturing the exposure. What health disparities [research] does is capture the result of that exposure. Depending on how you see the relationship between the two, you might think that when you capture the outcome, which is the disease outcome or the health experience, that disparity encapsulates that exposure. And I think that is true, but I also think that it’s important to tell the complete story. If the distinction is not clear, when combined with the non-scientific categories, you end up with the illusion that there’s a biological basis for disparities, which is even more dangerous.

The COVID-19 pandemic highlighted the weaknesses of data collection. Why is it important to collect and combine both place-based and population-level data?

Ponce: You need both to triage and direct public policy resources [during an emergency]. Place-based is a quick way of deploying these resources. In some states, social vulnerability indices (SVI), which are multi-dimensional place-based measures, were used to prioritize where vaccine pop-ups would be, and to determine the allocation of resources to certain counties, community groups, and community organizations. For smaller groups that are dispersed, this geographic-based targeting misses out on the risk of groups that may not be in the so-called worst quintile or quartile where we’re targeting resources. Place-based data have value because you can immediately target places, but it must be augmented for populations that may not necessarily live in those areas that are the legacy of residential segregation.

Adkins-Jackson: We use easy targets like location as a way to understand an experience for a group of people. I often ask my colleagues what is more important: an exposure at home, an exposure at work, or an exposure going in between? Because for some racialized groups, they’ve never been mostly in their neighborhoods, they’ve always commuted out for work. So, while you’re looking at their neighborhood exposure, they were exposed on the drive to work, and you missed it because you didn’t capture the intersection of all these parts.

Your article closes with the concept of community-centered data and how that can help build community trust. What will it look like when communities have access to their data?

Ponce: If we’re trying to generate knowledge and evidence for populations and communities of interest, particularly marginalized communities, then they must be alongside us every step of the way. It’s not just creating a community advisory board and only asking them to give feedback at the beginning and the end of the project, or reaching out to community groups so they can help spread the word and disseminate our study. Data equity is the process and the product, but the big P is the process.

Adkins-Jackson: My colleagues and I have been joking about whether Chat GPT and other AI resources are going to replace us as data analysts. But AI cannot replace the community because you won’t understand context, you won’t understand meaning, you won’t understand exposure, you won’t understand impact, you won’t understand anything.

Ponce: We’ve also got to build the data capacity of these communities and community organizations. Let us make sure that the grants that we have invest in the data consumption capacity and data production capacity of organizations. You have to build the infrastructure in these community organizations, and that includes encouraging members of these organizations to be part of our pipeline of academics and data producers.

Adkins-Jackson: Accountability is being so connected to the community that they are parts of your institution and organization. Without them, you can’t even get your basic processes or products completed. And you can’t have data equity without the people at the table.

READ THE CENTENNIAL ISSUE ARTICLE

Back to Articles and Updates

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
li_gc	2 years	LinkedIn - Used to store consent of guests regarding the use of cookies for non-essential purposes
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__atuvc	1 year 1 month	AddThis sets this cookie to ensure that the updated count is seen when one shares a page and returns to it, before the share count cache is updated.
__atuvs	30 minutes	AddThis sets this cookie to ensure that the updated count is seen when one shares a page and returns to it, before the share count cache is updated.
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_0JXFP3TZJG	2 years	This cookie is installed by Google Analytics.
_gat_UA-35374969-1	1 minute	This is a pattern type cookie set by Google Analytics, where the pattern element on the name contains the unique identity number of the account or website it relates to. It appears to be a variation of the _gat cookie which is used to limit the amount of data recorded by Google on high traffic volume websites.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_hjAbsoluteSessionInProgress	30 minutes	No description available.
_hjFirstSeen	30 minutes	This is set by Hotjar to identify a new user’s first session. It stores a true/false value, indicating whether this was the first time Hotjar saw this user. It is used by Recording filters to identify new user sessions.
_hjid	1 year	This is a Hotjar cookie that is set when the customer first lands on a page using the Hotjar script.
_hjIncludedInPageviewSample	2 minutes	No description available.
_hjTLDTest	session	When the Hotjar script executes we try to determine the most generic cookie path we should use, instead of the page hostname. This is done so that cookies can be shared across subdomains (where applicable). To determine this, we try to store the _hjTLDTest cookie for different URL substring alternatives until it fails. After this check, the cookie is removed.
AnalyticsSyncHistory	1 month	LinkedIn - Used to store information about the time a sync with the lms_analytics cookie took place for users in the Designated Countries
at-rand	never	AddThis - Used by social sharing platform AddThis
CONSENT	16 years 3 months 17 days	These cookies are set via embedded youtube-videos. They register anonymous statistical data on for example how many times the video is displayed and what settings are used for playback.No sensitive data is collected unless you log in to your google account, in that case your choices are linked with your account, for example if you click “like” on a video.
uvc	1 year 1 month	Set by addthis.com to determine the usage of addthis.com service.
vuid	2 years	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.
xtc	1 year 1 month	AddThis - Registers the users sharing of content via social media

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
fr	3 months	Facebook sets this cookie to show relevant advertisements to users by tracking user behaviour across the web, on sites that have Facebook pixel or Facebook social plugin.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
loc	1 year 1 month	AddThis sets this geolocation cookie to help understand the location of users who share the information.
personalization_id	2 years	Twitter sets this cookie to integrate and share features for social media and also store information about how the user uses the website, for tracking and targeting.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
UserMatchHistory	1 month	Linkedin - Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	These cookies are set via embedded youtube-videos.
yt.innertube::requests	never	These cookies are set via embedded youtube-videos.

The Role of Data Equity in Achieving Health Equity

Author:

Related Content:

Depolarization Through Public-Sector Leadership Development: Emerging Leaders and Fellows Programs

Steve Eliason: Partnering with the Gun Lobby to Enact Suicide Prevention Laws in Utah

Ann Gillespie: Succeeding in Both Branches of Government