Visualizar'09: submitted databases

CrocTail: information about U.S.A. corporations and their subsidiaries

CrocTail corporate subsidary browser, recently released by CorpWatch, parsers to extract the information from U.S. Securities and Exchange Commission filings The raw data are made available at  via api calls or in raw database dumps.

Wikipedia research dumps by Libresoft

Based on research by Felipe Ortega, who has been studying the change records of Wikipedia for some years, and has performed some of the most comprehensive quantitative analysis on how main parameters (number of autors, number of changes, number of new articles, etc.) are evolving over time.

Wikipedia research dumps have been created using WikiXRay, our tool to automate the analysis of any language version of Wikipedia. These are compressed mysqldump files, that can be easily loaded again on any local database for research purposes.

More info and downloads:

FLOSSMetrics project by Libresoft

The main objective of FLOSSMETRICS is to construct, publish and analyse a large scale database with information and metrics about libre software development coming from several thousands of software projects, using existing methodologies, and tools already developed. The project will also provide a public platform for validation and industrial exploitation of results.

CKAN is the Comprehensive Knowledge Archive Network, a registry of open knowledge packages and projects (and a few closed ones). CKAN is the place to search for open knowledge resources as well as register your own.

CKAN's aim is make it easy to find, share and reuse open content and data, especially in ways that are machine automatable.

