Skip to main content
Glama
brockwebb

Open Census MCP Server

by brockwebb
1a1824db38a019ef67fefac0bcfa7d53ab0979a8.txt21.3 kB
Using the U.S. Census Bureau’s Statistical Testing Tool One of the most important uses of American Community Survey (ACS) data is to make comparisons between estimates—across different geographic areas, different time periods, or different population subgroups. In addition to determining whether an estimate is comparatively higher or lower, data users also need to account for the sampling error associated with each estimate by conducting statistical testing. All estimates produced from sample surveys have uncertainty associated with them as a result of being based on a sample of the population rather than the full population. Sampling error is the difference between an estimate based on a sample and the corresponding value that would be obtained if the estimate were based on the entire population. Statistical testing shows whether the observed difference between estimates likely represents a true difference that exists within the full population and that the difference is unlikely to have occurred by chance. In such cases where the observed differences between estimates are statistically significant, making statements regarding their differences is valid. In other cases where the observed differences between estimates are not statistically significant, making comparison statements regarding their differences outside of similarity is invalid as it misrepresents the data. The U.S. Census Bureau developed a Statistical Testing Tool to allow all data users to carry out statistical testing with ACS data. The tool is an Excel spreadsheet with tabs that automatically calculate statistical significance when comparing estimates. While the tool was designed for ACS data, it may be used to conduct statistical testing with data from other Census Bureau surveys.1 When using data from other surveys, data users should consult the relevant technical documentation for how to handle special cases. Otherwise, this tool may return a result of “not applicable” when a statistical test could be carried out. This guide provides step-by-step instructions for using the Statistical Testing Tool to determine whether differences between ACS estimates are statistically different. 1. Key Benefits to Using the Statistical Testing Tool To use the Statistical Testing Tool, you first need to obtain estimates and their associated margins of error (MOE) through one of the Census Bureau’s data tools. These estimates and MOEs will then be copied into the Statistical Testing Tool and the results of the statistical test will be displayed. The results will show “Yes”—the estimates are statistically different from each other, “No”—the estimates are not statistically different, or “N/A”—statistical testing is not applicable. The Statistical Testing Tool uses Z-scores to determine statistical significance at the 90 percent confidence level. In the case of a statistically significant result, this means that there is less than a 10 percent chance that the observed difference between the two estimates occurred randomly. The tool does not include an option for multiple comparison adjustments (e.g., Bonferroni test). The Statistical Testing Tool is flexible and designed for data users to paste data directly into a spreadsheet with very little editing or adjustments needed. You can use the Two-Estimates spreadsheet to compare over 3,200 pairs of estimates, and you can use the Multiple Estimates spreadsheet to compare up to 150 estimates with each other. You can also easily change the parameters to modify the significance level (e.g., from 90 percent to 95 percent) or to conduct a test based on Standard Errors instead of MOEs. 1 If you are looking for information on how to use ACS Variance Replicate Estimates or the Public Use Microdata Sample (PUMS) to create margins of error, please see our Variance Replicate Estimate Tables <www.census.gov/programs-surveys/acs/data/variance-tables.html> and PUMS Documentation <www.census.gov/programs-surveys/acs/microdata/documentation.html> pages. 1 2. Accessing the Statistical Testing Tool The Statistical Testing Tool can be found on the Census Bureau’s Website. From the ACS homepage, click on “Guidance for Data Users” (see Figure 2.1). Figure 2.1 Source: U.S. Census Bureau, American Community Survey (ACS), <www.census.gov/programs- surveys/acs>. Then click on “Statistical Testing Tool” (see Figure 2.2). Figure 2.2 Source: U.S. Census Bureau, American Community Survey (ACS), <www.census.gov/programs- surveys/acs/guidance.html>. 2 The Statistical Testing Tool Web page provides some background information and a link to download the tool (see Figure 2.3). Figure 2.3 Source: U.S. Census Bureau, American Community Survey (ACS), <www.census.gov/programs- surveys/acs/guidance/statistical-testing-tool.html>. 3. Data Requirements to Use the Statistical Testing Tool The data requirements for using the Statistical Testing Tool are minimal. All you need to get started are two or more comparable estimates and their associated margins of error (or standard errors). You can use the tool to conduct statistical tests for counts (such as the number of people or households), means, medians, percentages, and other types of estimates. While the Statistical Testing Tool is designed for data obtained through data.census.gov, you can also use Census Bureau data from other sources, such as the Application Programming Interface (API) and My Congressional District. The Statistical Testing Tool can be used to compare two estimates to each other—for example, comparing change across time for the same geography, comparing two geographies to each other, or comparing two population groups. The tool can also be used to compare many estimates to each other, such as comparing estimates for all counties in a state to each other. This comparison can be useful when trying to rank or order data across many geographies or groups. Data users may wish to compare more than 150 estimates to each other. For example, they may wish to find the county with the highest or lowest estimate for a characteristic. One way to handle this is to use the two statistical testing tabs together. For example, to find the highest county estimate, first sort the estimates. Then compare the top 150 counties using the “Stat_Test_for_Multiple_Est” tab to find the one with the highest estimate. To ensure that this is indeed the highest estimate, copy all county estimates into the first estimate column in the “Stat_Test_for_Two_Est” tab. In the second estimate column, put the 3 highest county estimate for all of the rows. This will ensure that the statistical results hold for all counties and not just the top 150. In addition, this method may be repeated for the 2nd highest estimate, 3rd highest estimate, etc. 4. Data Comparability When using the Statistical Testing Tool, it is important to only compare estimates that are measured in equivalent ways. Be sure to review survey documentation to ensure that the estimates are comparable. Variables may change over time, for example, and new question wording or response categories are occasionally introduced. Geographic boundaries can also change. For example, congressional districts, metro areas, counties and cities occasionally redraw boundaries based on changes in governing structures. Many statistical areas (like census tracts and block groups) are updated once per decade to reflect the most recent decennial census. The Census Bureau’s Comparing ACS Data Web page provides annually updated information about the year-to-year comparability of ACS data across different subjects. With each ACS data release, the Census Bureau also provides a list of Table and Geography changes that may affect data users’ ability to compare ACS data over time. 5. Worksheets in the Statistical Testing Tool The Excel-based Statistical Testing Tool is comprised of multiple sheets, or tabs: • Overview: Provides information on the importance of statistical testing and basic features of the Statistical Testing Tool. • Instructions: Provides brief step-by-step instructions on how to copy data into the tool and how to interpret results. • Statistical Testing for Two Estimates (Stat_Test_for_Two_Est): Use this tab to insert estimates and margins of error when comparing two estimates, such as comparing data across two time periods or two population subgroups. • Statistical Testing for Multiple Estimates (Stat_Test_for_Mult_Est): Use this tab to insert estimates and margins of error when comparing two or more estimates to each other, such as estimates for all counties in a state. • Worked Example: Provides step-by-step instructions with screenshots on how to use the tool for conducting statistical testing for multiple estimates. • Contact information (Contact_US): Provides contact information for help/questions at the Census Bureau as well as links to more information about the ACS. 6. Using the Statistical Testing Tool to Compare Two Estimates • Example: A city planner wants to understand how commuting patterns have changed over time. 4 • Data: 2019 ACS 1-year estimates, 2015 ACS 1-year estimates • Table: S0801: “Commuting Characteristics by Sex” • Variables: Number of workers, Means of transportation to work, mean travel time • Method of obtaining data: Copy-paste from data.census.gov A city planner in Henderson, NV wants to understand how commuting patterns have changed since 2015. The planner needs data on the number of workers, how they travel to work, and the average travel time to work in 2015 and 2019. They start their query by going to the Census Bureau’s data.census.gov Website and clicking on “Advanced Search” (see Figure 6.1). Figure 6.1 Source: U.S. Census Bureau, data.census.gov, <https://data.census.gov/cedsci/>. They select “Geography,” then “Place” (see Figure 6.2). Figure 6.2 Source: U.S. Census Bureau, data.census.gov, <https://data.census.gov/cedsci/>. Within the Place menu, they select “Nevada” and then scroll down to click on “Henderson city, Nevada” (see Figure 6.3). Figure 6.3 5 Source: U.S. Census Bureau, data.census.gov, <https://data.census.gov/cedsci/>. They select “Topics” from the filter menu. From here, they select “Employment” and click on “Commuting.” Then they click on “Search” in the lower right corner of the screen (see Figure 6.4). 6 Figure 6.4 Source: U.S. Census Bureau, data.census.gov, <https://data.census.gov/cedsci/>. They Select Table S0801: “Commuting Characteristics by Sex” (see Figure 6.5). 7 Figure 6.5 Source: U.S. Census Bureau, data.census.gov, <https://data.census.gov/cedsci/>. Next, they select the desired year and data product for the comparison. By default, the table displays the most recent ACS data available, but for this example, the city planner first selects the 2015 ACS 1-Year Estimates Subject Tables from the drop-down menu (see Figure 6.6). Figure 6.6 8 Source: U.S. Census Bureau, data.census.gov, <https://data.census.gov/cedsci/>. Once the data are displayed in the table, they need to copy and paste the desired row labels, estimates, and margins of error into the Statistical Testing for Two Estimates tool. For this example, the city planner selects the labels and data for the total number of workers 16 years and over, means of transportation to work, and mean travel time to work. To copy cells, they hold a left-click on their mouse to highlight the desired cells. Then they right-click on the highlighted cells and have the option to “Copy” or “Copy with Headers” (see Figure 6.7). Either option will work with the Statistical Testing Tool. Figure 6.7 Source: U.S. Census Bureau, data.census.gov, <https://data.census.gov/cedsci/>. Then, they copy the desired labels, estimates, and margins of error into the Statistical Testing Tool (see Figure 6.8). 9 Figure 6.8 Source: U.S. Census Bureau, Statistical Testing Tool, <www.census.gov/programs- surveys/acs/guidance/statistical-testing-tool.html>. They repeat these steps to copy and paste the 2019 ACS estimates and margins of error into the next two columns in the spreadsheet (see Figure 6.9). Figure 6.9 Source: U.S. Census Bureau, Statistical Testing Tool, <www.census.gov/programs- surveys/acs/guidance/statistical-testing-tool.html>. The “Statistically Different?” Column displays the results from the statistical test, comparing the 2015 and 2019 commuting estimates for Henderson, NV. The results show that the number of workers ages 16 and over increased. The share of workers who traveled by car, truck, or van decreased, as well the share who drove alone. While the percent of workers who traveled by carpool appears to have increased from 7.7% in 2015 to 9.1% in 2019, this difference is not statistically significant. The share who traveled by public transportation increased, as did the share who worked from home. The mean travel time to work also increased. There were no statistically significant differences between the remaining estimates. 7. Using the Statistical Testing Tool to Compare Multiple Estimates • Example: A state policymaker in Pennsylvania is comparing child poverty rates across counties and wants to understand whether the rates are statistically different. • Data: 2016-2020 ACS 5-year estimates • Table: S1701: “Poverty Status in the Past 12 Months.” 10 • Variable: Percent of children under age 18 below poverty • Method of obtaining data: download zip files The Statistical Testing for Multiple Estimates tool can be used to compare many groups or geographies to each other. This tool is also useful for comparing ACS estimates that have been ranked from highest to lowest or lowest to highest. For example, a state policymaker in Pennsylvania is interested in understanding how child poverty rates differ across counties in Pennsylvania. These data can be obtained from data.census.gov and then copied into the Statistical Testing for Multiple Estimates tool to conduct statistical testing across all the counties in Pennsylvania. The policymaker begins by going to data.census.gov and typing “poverty” in the search bar. They select Table S1701: “Poverty Status in the Past 12 Months” (see Figure 7.1). This table provides the number and percent of people in poverty by selected social and demographic characteristics. Figure 7.1 Source: U.S. Census Bureau, data.census.gov, <https://data.census.gov/cedsci/>. Next, they click on the “Geos” icon (see Figure 7.2). 11 Figure 7.2 Source: U.S. Census Bureau, data.census.gov, <https://data.census.gov/cedsci/>. They click on “County,” select the state of Pennsylvania, and then select “All counties within Pennsylvania.” Then, they click on the “X” in the upper right corner to return to the results page (see Figures 7.3 and 7.4). 12 Figure 7.3 Source: U.S. Census Bureau, data.census.gov, <https://data.census.gov/cedsci/>. 13 Figure 7.4 Source: U.S. Census Bureau, data.census.gov, <https://data.census.gov/cedsci/>. The estimates displayed in the table are from the 2016-2020 ACS 5-year data—the most recent data available at the time. To obtain these estimates in an easy-to-use format for the Statistical Testing for Multiple Estimates tab, the policymaker downloads the data using the “Zip” option (see Figure 7.5). 14 Figure 7.5 Source: U.S. Census Bureau, data.census.gov, <https://data.census.gov/cedsci/>. They mark the box labeled “2020 ACS 5-Year Estimates Subject Tables” and then click on “Download .CSV” (see Figure 7.6). Figure 7.6 Source: U.S. Census Bureau, data.census.gov, <https://data.census.gov/cedsci/>. They open the folder on their computer with the zipped data files. The data are stored in the file with “ACSST5Y2020.S1701_data_with_overlays” in the title (see Figure 7.7). They open this file. Figure 7.7 Source: U.S. Census Bureau, data.census.gov, <https://data.census.gov/cedsci/>. Next, they find the desired estimates from the downloaded data. They select the columns for the percent estimate and the associated margins of error for children living below the federal poverty line and hide or delete the other columns. 15 To make the results easier to read, they sort the data from lowest poverty rate to highest poverty rate (see Figures 7.8 and 7.9). Figure 7.8 Source: U.S. Census Bureau, data.census.gov, <https://data.census.gov/cedsci/>. 16 Figure 7.9 Note: This figure displays only a portion of the columns and rows in the spreadsheet. Source: U.S. Census Bureau, data.census.gov, <https://data.census.gov/cedsci/>. They copy and paste the Geography Area Names into the column named “Label” in the Statistical Testing for Multiple Estimates tool (see Figure 7.10). They adjust the height and width of the rows and columns as needed to make the labels easier to read. 17 Figure 7.10 Note: This figure displays only a portion of the columns and rows in the spreadsheet. Source: U.S. Census Bureau, Statistical Testing Tool, <www.census.gov/programs- surveys/acs/guidance/statistical-testing-tool.html>. Next, they select the desired estimates from the downloaded data. They select the columns for the percent estimate and the associated margins of error for children living below the federal poverty line and paste the data into the Statistical Testing for Multiple Estimates tool in the “Estimates” and “Margin of Error (MOE)” columns (see Figure 7.11). Figure 7.11 Note: This figure displays only a portion of the columns and rows in the spreadsheet. Source: U.S. Census Bureau, Statistical Testing Tool, <www.census.gov/programs- surveys/acs/guidance/statistical-testing-tool.html>. The statistical testing results appear in the columns on the right side of the table. Results from the tests can be found by reading across a row or down a column for a particular county. For example, the first cell 18 for Bucks County is shaded gray and marked with an X; this is the marker for comparing the county to itself/not a comparison. The next three cells to the right are shaded red with “No.” This means that Butler County, Chester County, and Montgomery County all have child poverty rates that are not statistically different from Bucks County’s rate. Continuing across the row for Bucks County, the remaining cells are “Yes,” which means that the child poverty rate in Bucks County is statistically different than the child poverty rate in the remaining Pennsylvania counties at the 90% confidence level. 8. Additional Options You can change the confidence level by scrolling to the bottom of the spreadsheet and changing the value in the “Parameters” column to the desired level (e.g., 95 or 99) (see Figure 8.1). Figure 8.1 Source: U.S. Census Bureau, Statistical Testing Tool, <www.census.gov/programs- surveys/acs/guidance/statistical-testing-tool.html>. Changing the confidence level to 99 shows that at this higher level of confidence, the child poverty rate in Bucks County is no longer statistically different than the rates in Centre, Cumberland, Forest, Sullivan, or Union Counties. 19 Figure 8.2 Note: This figure displays only a portion of the columns and rows in the spreadsheet. Source: U.S. Census Bureau, Statistical Testing Tool <www.census.gov/programs- surveys/acs/guidance/statistical-testing-tool.html>. 9. For More Information Understanding and Using American Community Survey Data: What All Data Users Need to Know www.census.gov/programs-surveys/acs/library/handbooks/general.html This handbook provides an overview of the ACS to help data users understand the basics of the survey, how the data can be used, how to judge the accuracy of ACS estimates, and how to access ACS data. Comparing ACS Data www.census.gov/programs-surveys/acs/guidance/comparing-acs-data.html This Web page provides guidance on making valid comparisons between the latest ACS data and ACS data from the previous year, the 2000 Census, and the 2010 Census. Sample Size and Data Quality www.census.gov/acs/www/methodology/sample-size-and-data-quality This Web page describes the steps the Census Bureau takes to ensure that ACS data are accurate and reliable. It also includes several measures of ACS data quality for the nation and states. Code Lists, Definitions, and Accuracy www.census.gov/programs-surveys/acs/technical-documentation/code-lists.html View the detailed codes and definitions for variables, statistical testing, and an explanation of sample design, methodology, and accuracy for the ACS. 20

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/brockwebb/open-census-mcp-server'

If you have feedback or need assistance with the MCP directory API, please join our Discord server