We believe in the power of high-quality, disaggregated data describing conditions in our region to drive better decision-making and more powerful campaigns for equity solutions. We provide a variety of trainings and support to help you use our data.
What geographies are available on the Bay Area Equity Atlas?
The Bay Area Equity Atlas draws some of its data from the National Equity Atlas indicators database developed by PolicyLink and PERE, and also includes more than a dozen new indicators derived from local and state data sources as well as unique surveys. The Atlas includes data for the following geographies:
- Region: The Five- and Nine-County Bay Area regions
- County: The nine Bay Area counties (Alameda, Contra Costa, Marin, Napa, San Mateo, San Francisco, Santa Clara, Solano, and Sonoma)
- Sub-county: 40 Consistent Public Use Microdata Areas (CPUMAs)
- Large city: Six large Bay Area cities (Antioch, Fremont, Oakland, San Francisco, San José, and Sunnyvale)
- Other city or town: 95 other Bay Area cities and towns
- Census Designated Place (CDP): 119 unincorporated areas of Bay Area counties identified by the Census for statistical purposes
- State: California
The Atlas also data in some maps for the 1,588 census tracts in the region. Our data sources include the Integrated Public Use Microdata Series (IPUMS USA), U.S. Census Bureau, GeoLytics, Inc., California Department of Finance, Association of Bay Area Governments, UC Berkeley Statewide Database, California Department of Education, UC Berkeley Urban Displacement Project, California Fair Housing Task Force, California Department of Justice, GovBuddy, and Zillow Group, Inc.
Why is data missing for certain race/ethnic groups in some regions?
While our equitable growth indicators database incorporates a variety of data sources, much of our analysis is based on a unique dataset created using microdata samples (i.e., “individual-level” data) from the Integrated Public Use Microdata Series (IPUMS USA), and Census and American Community Survey (ACS) summary file data for three points in time: 2000, and the five-year pools of 2006 through 2010 and 2011 through 2015. The IPUMS microdata allows for the tabulation of detailed population characteristics, but because such tabulations are based on samples, they are subject to a margin of error and should be regarded as estimates—particularly in smaller regions and for smaller demographic subgroups. In an effort to avoid reporting highly unreliable estimates, we do not report any estimates that are based on a universe of fewer than 100 individual (unweighted) survey respondents. Similarly, no data from the Census and ACS summary files is reported if based on a universe of less than 100 (weighted). See our summary of data sources and geographies for more information.
Do you have any data for the LGBTQ community?
We do not currently have any data for the LGBTQ community in the Atlas because this data is not collected in the data sources that we use. Per the above note on gender, this is an important correction needed and we advocate for including non-binary gender categories in surveys. We will add additional indicators to the Atlas over time and are looking for data by sexual orientation and gender identity, particularly for the transgender community, for the Nine-County Bay Area. Please share suggestions for data sources with us at email@example.com.
Why does most data for 2010 and 2015 represent a 2006-2010 and 2011-2015 averages?
Many of the data points for 2010 and 2015 are based on a pooled sample of five years of annual survey data (2006 through 2010 and 2011 through 2015, respectively) from the American Community Survey. Because a single year of the ACS data only covers about one percent of the U.S. population, five years of ACS data were pooled together to improve statistical reliability and to achieve a sample size that is comparable to that of most data points for 2000, which are based on the “long form” of the decennial census.
How often will the data be updated?
Most of the datasets underlying the indicators in the Bay Area Equity Atlas are updated annually, so most of the indicators will be updated annually as well. We also add new indicators periodically. Once added, they will follow a similar updating schedule. IPUMS 2017 data was released this spring and we plan to update that data by the end of 2019.
Where can I find data by gender?
Data by gender is available for the following indicators: College readiness, Educational attainment, Disconnected youth, Median earnings, Police use of force, Extreme commuting, Housing burden, Business ownership, Business revenue, and Diversity of electeds. Note that these data sources do not provide complete data on gender identity: the surveys only allow for male/female gender identity and do not allow respondents to identify themselves as non-binary or transgender.
Most all indicators on the Atlas are measures of central tendency (e.g. means and medians) based on survey data, and are subject to a margin of error. While we do not report margins of error, we do make efforts to avoid reporting highly unreliable estimates. Unless otherwise noted, for all indicators derived from the Census and ACS summary files, we do not report values that are based on fewer than 100 (weighted) observations in the denominator/universe.
How do you define race/ethnicity?
In the Atlas, categorization of people by race/ethnicity is generally based on individual responses to various census surveys. For most indicators, people are categorized into six mutually exclusive groups based on their response to two separate questions on race and Hispanic origin, plus one more category for all people of color combined. Further detail about each category is listed on our Methodology page, located here.
Can the data be further disaggregated for additional ethnic subgroups?
The short answer is yes—but it depends on the underlying sample size in the survey from which data is being drawn, and in the particular geography for which data is being reported.
The Bay Area Equity Atlas contains data for racial/ethnic subgroups defined by self-reported ancestry in some indicators. The addition of racial/ethnic subgroup data was driven by the lack of easily accessible data describing the socio-economic diversity within the Asian or Pacific Islander community, which has long been subject to a "model minority" stereotype that does not accurately describe the experience of many groups within the community. However, to provide a comprehensive picture of the diversity that exists within each of the major racial/ethnic categories that are included in the Atlas, we disaggregated the data by ancestry for five of them (all except for the Mixed/other category, for which we think a more appropriate disaggregation would be by the various racial/ethnic groups people identify with rather than by ancestry).
The data for racial/ethnic subgroups allows a user, for example, to examine data on equity indicators for the large Southeast Asian population in San José, CA or the large German population in Santa Clara County, CA—groups that are typically buried in the broader “Asian” and “White” Census categories, respectively. It is important to note, however, that due to sample size limitations, the detailed racial/ethnic subgroup data is only available for geographies with large enough populations of the subgroups being disaggregated.
Do you have data for the disabled community?
We don't currently have any data for the disabled community. We know that local advocates often need data to describe their demographic and economic realities. We will try to add more indicators to the Atlas over time when possible.
Which racial/ethnic groups are included in the Atlas?
There are some inherent challenges to examining indicators by race. We use self-reported racial and ethnic identifications from the U.S. Census. When possible, we provide data for the six major racial/ethnic categories within the Census (White, Black/African American, Hispanic/Latino, Asian/Pacific Islander, Native American, and Mixed/other race), creating mutually exclusive categories by grouping everyone who identifies as being of Hispanic origin in the Hispanic/Latino category.
We also present data by nativity and racial/ethnic subgroups defined by ancestry for some indicators when the sample size is large enough (we do not present data when the sample size is less than 100). We sometimes consolidate all people of color into a single category (individuals who do not identify as non-Hispanic white) for specific data points or in cases where the sample size is not large enough to disaggregate the data for major racial/ethnic groups. An additional challenge we face is that the Census historically undercounts people of color—something that is important to recognize but not something we are able to effectively address.
Where can I find data by ancestry?
Data by ancestry is available for the following indicators: Nativity and ancestry, Disconnected youth, Extreme commuting, Housing burden, and Linguistic isolation. The availability of data by ancestry depends on the underlying sample size in the survey from which data is being drawn, and in the particular geography for which data is being reported. For more information about how we categorize people by ancestry, please click here.
Where can I find data on the Native American population?
Who can I contact with technical questions about the data and methodology?
We provide an overview of our overall methodological approach on our Methodology page, as well as a detailed methodological breakdown for each indicator on our People, Place, and Power pages. If you have additional questions about the data or methods, please contact Justin Scoggins, Data Manager at PERE, at firstname.lastname@example.org.
How do I cite the Bay Area Equity Atlas?
We encourage you to use the data and graphics from the Bay Area Equity Atlas, and ask that you cite the Atlas as your source. Here are recommended citations:
Can I download the underlying data?
Yes, you can. Our aim is to democratize data and make it available to you to explore and use. If you click on the downward facing arrow on each breakdown, you will find the option to download an excel sheet with the underlying data for the breakdown along with options to download an image or a powerpoint presentation. If you would like access to other data, please email us at email@example.com detailing your request and plans for use, and we will try to package it for you.
What are Public Use Microdata Areas (PUMAs)?
Public Use Microdata Areas (PUMAs) are statistical geographic areas defined by the U.S. Census Burea for the dissemination of Public Use Microdata Sample (PUMS) data, and are the lowest level of geography attached to individual respondents in the PUMS data. They are also used for disseminating American Community Survey (ACS) summary file period estimates. The Bay Area uses Consistent Public Use Microdata Area (referred to as “CPUMAs” on the Atlas), which are a geography created by the Integrated Public Use Microdata Series (IPUMS USA). The version utilized on the Atlas are based on the CPUMA0010 variable and are drawn to essentially form what is the lowest common denominator from a geographic perspective between 2000 and 2010 Public Use Microdata Areas (PUMAs).
Why does the Atlas focus on race/ethnicity? What about the other dimensions of inequity?
The Atlas provides extensive data about racial equity and inclusion to allow users to examine how well diverse groups can access the resources and opportunities they need to participate and prosper. Race is a social construct, not a biological one, and in an equitable society, there would not be major differences across racial groups. The differences we do see are primarily due to historical and ongoing policies, decisions, and institutional practices that have racially discriminatory impacts, whether intended or unintended.
We recognize that inequities exist across many characteristics in addition to race/ethnicity and nativity, including income, gender, age, ability, sexual orientation, and neighborhood. Unfortunately, because we are working with survey data and seek to provide data for metropolitan regions, we are limited to the extent to which we can disaggregate the data. We will seek to add additional layers of data to examine other dimensions of inequity as the Atlas evolves.