What's in data.gov
From Data-gov Wiki
| Infobox (Essay) edit with form |
|---|
|
|
In this article, we run a quick survey on the datasets published at http://data.gov.
Contents |
background data
data.gov published a special dataset, Dataset 92, which contains the catalog metadata about all published datasets.
- the original catalog homepage is at http://www.data.gov/details/92
- the original catalog csv file is at http://www.data.gov/data_gov_catalog.csv.
- the converted RDF version is at http://data-gov.tw.rpi.edu/raw/92/data-92.rdf.
- you may browse the RDF dataset using tabulater following this link
Statistics translated RDF datasets
- There are currently 130 converted RDF datasets
- contributing 459430392 table entries
- contributing 5075144265 triples.
- contributing 7655 properties.
Statistics about original datasets at data.gov
data.gov hosts 970 Datasets:
- including 600 Raw Data Catalog
- including 343 Tool Catalog
format of datasets' access points
data.gov mentioned 968 Data files as the access points of the datasets:
- 217 datasets publishing feeds (RSS,ATOM): SPARQL results
- 442 datasets publishing csv/txt: SPARQL results
- 57 datasets publishing xml: SPARQL results
- 76 datasets publishing xls (MS Excel): SPARQL results
- 14 datasets publishing kml or kmz : SPARQL results
- 22 datasets publishing ESRI shape format: SPARQL results
tag cloud of the keywords
sources of datasets
the datasets are contributed by 2 US government agencies. Following is a list sorted by the number of their contributed datasets.
- Data.gov (944)
- Environmental Protection Agency (319)
- Department of Defense (194)
- Department of Health and Human Services (87)
- Department of the Treasury (69)
- Department of Commerce (47)
- Department of Homeland Security (44)
- Department of the Interior (36)
- Department of Labor (35)
- Institute of Museum and Library Services (18)
- Executive Office of the President (12)
- National Archives and Records Administration (11)
- Department of Justice (10)
- Department of Education (8)
- Department of Transportation (7)
- National Transportation Safety Board (7)
- Department of Energy (7)
- US Consumer Product Safety Commission (6)
- Department of Housing and Urban Development (6)
- Rensselaer Polytechnic Institute (6)
- Department of Agriculture (5)
- National Aeronautics and Space Administration (3)
- Department of State (3)
- National Science Foundation (2)
- General Services Administration (2)
- Social Security Administration (2)
- Office of Management and Budget (2)
- Railroad Retirement Board (2)
- Stony Brook University (1)
- University of Pennsylvania (1)
- Michigan State University (1)
- Small Business Administration (1)
- Washington University in St Louis (1)
- Princeton University (1)
- Northwestern University (1)
- National Center for Education Statistics (0)
- US Fish and Wildlife Service (0)
- United States Department of Energy (0)
- Federal Bureau of Investigation (0)
- OpenLink Software (0)
- National Oceanic and Atmospheric Administration (0)
- University of Leipzig (0)
- Bureau of Justice Statistics (0)
- US Patent and Trademark Office (0)
- National Weather Service (0)
- National Park Service (0)
- Agency for Healthcare Research and Quality (0)
- Energy Information Administration (0)
- Bureau of Transportation Statistics (0)
- National Center for Health Statistics (0)
Facts about What's in data.govRDF feed
| Dc:creator | Li Ding +, and Dominic DiFranzo + |
| Dc:description | statistical and analytical survey of the data.gov datasets |
| Dcterms:created | June 26,2009 |
| Foaf:name | What's in data.gov |

