He also mentions how the proposed Digital Accountability and Transparency Act of 2011 (DATA Act) would establish consistent data elements and standards for federal financial information to assure comparability and reliability in reported information and how recipient reporting through federalreporting.gov is the most cutting-edge feature of the transparency process and should be an integral part of federal spending accountability.
The Recovery.gov White Paper claims "the result is the current incarnation of Recovery.gov-which, as anyone who has spent significant amounts of time scouring government websites for information will tell you-is perhaps the clearest, richest interactive database ever produced by the American bureaucracy."
I say good, but first show us all of the data so we can do some real data analysis.
Since the quality and completeness of this data has been questioned, I decided to see for myself by assessing the amount of missing data in each of the 98 columns and by agency.
The scatterplot created in a data analytic software called Spotfire shows in the illustration above, there are considerable number of columns that have far less data in each row than one would expect. (The actual chart extends the plot vertically to more than 500,000 reports, which shows that all reports have at least some data.)
The X axis represents the 98 data elements columns that agencies are suppose to report, such as funding agency name, fiscal year, recipient name, recipient zip code, etc. The Y axis represents the individual reports, assembled by row, of each of 517,589 filings.
Complete data reporting would show a solid line across the top of the graph while no reporting would show a solid line across the bottom of the graph. There is more of the later than the former so show us all of the missing data. The complete data dictionary is given here.
Fully complete data would be a straight line across the top of this graph at 517,589 rows of data. The actual counts are shown in the Data Elements and Number of Rows table in my Spotfire summary. The filters can be used to select an agency like EPA and see the "count of rows by column" in the summary table. One can then reset the filters and select another agency.
In trying this, one sees that only eight of the 98 columns of data elements have been completed for all 517,589 filings (or rows of data.) For example, my former agency, EPA, reports filed 956 reports (rows of data), but filled in only 47 of the 98 data elements (columns.)
This applications succeeds by bringing all of the data into memory on the Web, something the Recovery.gov Web applications does not do, so it can be visualized, sorted, and searched. The data file is about 300 MB and the Spotfire file is about 110 MB.
Brand Niemann is former senior enterprise architect, U.S. Environmental Protection Agency; director and senior data scientist, Semantic Community. He previously Built Recovery.gov in the Cloud where the user can enter their ZIP Code and get results specific for their location.
Recovery.gov: A Good Start But Show Us All the Missing Data
Published: September 8, 2011
Recovery.gov is the U.S. government's official website that provides easy access to data related to Recovery Act spending and allows for the reporting of potential fraud, waste, and abuse. My AOL colleague, Richard Walker wrote recently about how Recovery.gov "Shows The Power Of Transparency In Tracking Federal Spending" since the Recovery Accountability and Transparency Board [RAT Board] has provided "a commendable model of transparency... the tremendous success of the RAT Board is worthy of replication throughout the federal bureaucracy."
In this article
Topics
People
Recent Activity
-
Amazon's Cloud Cover Expanding2012-05-16T14:00:00Z -
Make Government And Health Data Easier To Use2012-05-16T13:05:00Z -
Boehner: Spending Cuts Must Top Debt Ceiling Increase2012-05-16T12:00:00Z -
Hackers Pose Costly Future For Military Jets Warns Cartwright2012-05-16T11:28:00Z -
Innovation At NIH: Donald Lindberg, Senior Statesman For Medicine And Computers2012-05-16T08:54:00Z -
'See' Your Snail Mail: USPS Pursues Mobile, Bar Code Technologies2012-05-15T16:15:00Z -
Government's Virtual Worlds Go On Display2012-05-15T14:35:00Z -
FedRAMP Cloud Security Program Names First Accredited Assessors2012-05-15T09:25:00Z -
Technology's Footprint On Education - 'Federal Spending' Episode 122012-05-15T08:39:00Z -
Verizon Chief Calls On DoD To Explore Sharing Wireless Spectrum2012-05-14T14:49:00Z -
Mobile Milestones: USDA Apps Allow Efficiency In The Field2012-05-14T13:30:00Z -
Smokey Bear Is Back2012-05-14T13:29:00Z -
Cyber Intelligence: Sharing Intelligence With Infrastructure Providers2012-05-14T11:33:00Z -
Federal Consortium of Virtual Worlds (FCVW) Conference2012-05-14T11:20:00Z -
New Group Eyes Wireless Power Ecosystem2012-05-14T08:44:00Z -
This is true.One policy for advice doesn't answer the...2012-05-16T11:32:46Z -
"Arthur said he still has to prove that virtual conferences...2012-05-11T12:08:37Z -
Freemasons, boy.
2012-05-11T00:56:17Z -
There is no such thing as a secure computer on ANY...2012-05-10T16:29:47Z -
Not sure how "ignorant" draison is. However, you've just...2012-05-10T16:15:43Z -
hacked by nearly every country probably everyday ,...2012-05-10T16:06:11Z -
...and yet the politicians act like this is no big deal by...2012-05-10T15:50:55Z -
Hey, bolthead, have you followed the litany of failures of...2012-05-10T15:43:12Z -
If you don't believe that is true, it is indeed you that is...2012-05-10T15:35:42Z -
I work for a Anti-virus company any Draison is right. we...2012-05-10T14:59:25Z -
Ironic reply from you Axis. Obviously you are ignorant to...2012-05-10T14:56:03Z -
"Under Obama, we're getting fleeced by the Russian Business...2012-05-10T14:49:46Z -
Well said.
2012-05-10T14:42:20Z -
What does that mean? We're under constant cyber attack from...2012-05-10T14:41:50Z -
I saw this guy speak at Virtual Edge Summit...2012-05-10T09:18:04Z -
I agree that virtual environments are the next evolutionary...2012-05-10T08:57:55Z -
Overall, good article. The lead-in could be perceived as...2012-05-09T21:53:57Z -
"With the spectacle of the out-of-control GSA...2012-05-09T20:50:32Z -
The "web" is not the internet. It is an application that...2012-05-08T21:35:23Z -
If you only knew how ignorant you are... :(
2012-05-08T21:34:24Z
Featured Videos
Industry Discussions
Industry Headlines
Our Partners
Close
Your Settings



The Texas Department of Motor Vehicles (TxDMV) recently completed a massive systemwide technology upgrade, deploying 2,600 new PCs across the state's 254 counties.
Unsure if your local government website is resonating with citizens?