Colectica Version 4 Improves Documentation of Big Data in Research

Minneapolis, MN - October 31, 2012

Documentation is a key aspect of understanding both public and private datasets. The new Colectica software is built to describe these statistical data using open standards metadata.

Colectica® is a suite of modern metadata management software that is used to document statistical datasets, along with the research methodologies used during their collection and aggregation. When datasets are disseminated there is often inadequate transparency of research methods. Colectica allows organizations to increase their data’s openness and credibility through standardized documentation of their data collection, research process and resulting data. The software is compatible with leading open standards including the Data Documentation Initiative (DDI) Lifecycle version 3 and ISO 11179. Using this software allows data producing organizations to both better describe their resultant data and increases the organization’s reputation for performing credible scientific research by also documenting methodology.

“Using Open Standards such as the Data Documentation Initiative Lifecycle version 3 allows an organization’s data to be described for later reuse.” said Dan Smith, a Partner at Colectica. He added “These XML based standards allow comparison of research done by different institutions, leading to greater comparability of the resulting datasets. Using Open Standards to describe the processes used to gather data also gives more credibility to the organization performing the research.”

Colectica version 4 adds a new Colectica RDF Services that allows publishing survey information as linked data on the semantic web. Publishing structured data, such as that created by Colectica, allows the descriptions of datasets to be interlinked. It enables data from different sources to be connected and queried, and become more useful. Colectica 4 also adds a new Workflow Service which allows an organization to manage the publication of its data documentation.

Colectica consists of several software tools. Colectica Designer enables documenting of survey methodologies, survey instruments, questions, variables, and datasets. Colectica Repository manages changes to the recorded information and allows multiple people and groups to work together. Colectica Portal allows for publication of this recorded information on the web. All of these products use Open Standards to allow for interoperability with other tools. Colectica products, training on Open Standards, and customized software solutions are available through .

About Colectica

Launched in 2010, Colectica® is the fastest way to design, document, and publish statistical research using Open Data standards. The Colectica Platform is an ideal solution for statistical agencies, survey research groups, public opinion research, data archivists, and other data centric collection operations that are looking to increase the expressiveness and longevity of the data collected through standards based metadata documentation. The company offers a range of highly specific products and services designed to give power to people through easy integration and access to data.


Colectica is a registered trademark of Colectica and/or its affiliates. Other names may be trademarks of their respective owners. ###