This product serves as a central source of government statistics and reference data. It includes a collection of demographic, economic, government spending, and environmental time series data along with commonly used reference data about geographies and holidays. A single, unified schema joins together across the various publishing agencies. Where applicable, data is available at the national, state, county, and municipal levels.
Example topics covered:
-GDP
-Unemployment
-Household income
-US government contracts
-US Treasury fiscal data
-Population
-Crime and disease incidences
-Public holidays
The data is sourced from Data Commons, an aggregator of government data sources that powers contextual Google Search, the American Community Survey (ACS), the System for Award Management (SAM.gov), the US Census Bureau, Statistics Canada, the United Postal Service, and the Python-Holidays package on GitHub. Data Commons, itself, aggregates data from the US Bureau of Labor Statistics, the World Bank, the United Nations, the IMF, the CDC, and other sources.
A Streamlit of the data can be found here: https://app.cybersyn.com/datacommons/
All Cybersyn products follow the EAV (entity, attributes, value) model with a unified schema. Entities are tangible objects (e.g. geography, company). Entities may have characteristics (i.e. descriptors of the entity) in an index table and values (i.e. statistics, measure) in a timeseries table. Data is joinable across all Cybersyn products that have a GEO_ID. Visit docs.cybersyn.com for more details on Cybersyn’s unified schema.
The majority of the data, including that from Data Commons, centers around timeseries containing demographic, economic, government spending, and environmental statistics at national, state, county, and municipal levels. For example, population, GDP, temperature, and disease incidence data is all available. This data primarily revolves around geographic entities from the national, state, county, municipal, zip code, and census tract levels. Variable attributes can be joined to the time series data for additional metadata about the variables themselves (measurement category, units, frequency, etc.).
The geography reference data consists of an index of geographic entities at different levels (e.g., countries, cities, counties, census tracts, etc.); relationships between these geographies (e.g., which cities are contained within which counties); and the characteristics of those geographies (e.g., geospatial boundaries, coordinates, name abbreviations, etc.). The boundaries are sourced from the US Census and Statistics Canada.
The population data is sourced from Data Commons and the American Community Survey (ACS) published by the US Census Bureau. Data Commons aggregates population data from a range of government agencies and international organizations (e.g. World Bank, OECD). The American Community Survey is an ongoing survey that provides population information annually in the US. This is different from the US Census (also available in this dataset) which is published every 10 years. ACS data covers top-line US population figures and detailed population variables by sex, race, and age group for the overall country, states, counties, cities, zip codes, statistical areas, and census tracts. Both 1 and 5 year estimates and margins of error are provided beginning in 2021. 1 year estimates are based on 12 months of collected data (e.g. January 1, 2022 to December 31, 2022) and provided annually for geographies with a population of 20K+. This data has the smallest sample size but is most current. 5 year estimates are based on 60 months of collected data (e.g. January 1, 2018 to December 31, 2022) and provided annually for all geographies. This data is based on the largest sample size but is the least current.
The holiday reference data contains government-designated holidays for 119 countries, joinable to Cybersyn’s other geographic entities, as well as the financial market holidays for the European Central Bank (ECB) and NY Stock Exchange (NYSE).
US government contracts data, sourced from SAM.gov, revolves around two main entities: Contracts and Contract Awards. Contracts represent listings soliciting bids on goods and services that the Federal US government is seeking from contractors. Metadata about contracts includes the department of the Federal government that oversees the contract, the date the contract was originally posted, the deadline for response, and the location where the contract will be fulfilled. The contract_solicitation_id field can be used to find the original contract on sam.gov.
The US Treasury provides a daily overview of net federal revenue collections from income tax deposits, customs duties, fees for government services, fines, and loan repayments. These collections and the channel through which they are processed, such as mail, internet, banking, and over-the-counter transactions, are incorporated within this dataset.
Contract Awards represent accepted bids or solicitations from third-party contractors to fulfill a contract. Metadata about the award contract includes the name of the recipient of the award (business or individual), the value of the award, the date of the award, description of the award, and the primary contact from the government who awarded the contract. Note that the description of the contract award may differ from that of the original contract if the government reopened the contract or awarded multiple awards from a single original contract solicitation. The contract_award_id corresponds to the award number on SAM.gov and can be used to search for the award in the sam.gov portal.
Original data sources, table descriptions, and release frequency can be found at docs.cybersyn.com.
--------------------------------------------------------------------------------------
Disclaimer
The data in this dataset is sourced at docs.cybersyn.com. Links to provider license, terms and disclaimers are provided where appropriate:
-Data Commons: https://www.datacommons.org/disclaimers, https://creativecommons.org/licenses/by/4.0/
-Python-Holidays: https://github.com/vacanza/python-holidays/blob/master/LICENSE
-SAM.gov: https://sam.gov/content/about/disclaimers
-Statistics Canada: https://www.statcan.gc.ca/en/reference/terms-conditions
Cybersyn is not endorsed by or affiliated with any of these providers. Contact support@cybersyn.com for questions.