Context Navigation

Changes between Version 1 and Version 2 of BluePrint/DataRepository

Timestamp:: 12/26/14 04:49:14 (11 years ago)
Author:: devin
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

BluePrint/DataRepository

-              v1
+              v2
 Data Depository tools such as CKAN are becoming popular within the humanitarian aid space.
+Data repository tools such as [http://ckan.org CKAN] are becoming popular within the humanitarian aid space as evidence by projects like [https://data.hdx.rwlabs.org/ HDX] and [http://www.data.gov/disasters/ Data.Gov's disaster portal].
 They allow people share data sets within a searchable environment, making it easier for users to find raw data sets.
+These tools allow users to publish data sets and associate them with metadata that enables others to easily find them.  This is particularly useful for organizations that receive and produce lots of raw and refined data sets.  Many of these organizations are also collecting data sets that they will then integrate into their own information management systems. Sometimes the data they organize in their information management systems is also data they want to make available in a raw format via a data repository.
 This is particularly useful for organizations that receive lots of raw data, want to make that data available as quickly as possible to their stakeholders, while also integrating that data into their own information management system.
+Since Sahana produces the type of information management systems into which people want to integrate data they collect, it makes sense for Sahana to provide data repository functionality that would enable users to publish datasets and metadata that follows the [http://www.w3.org/TR/vocab-dcat/ DKAT standard] and is accessible via API.
+Since Sahana produces the type of information management systems into which people want to integrate raw data, it makes sense for it to support the process of making that raw data, and other data as well, available to Sahana users.
+It's likely this data would fall into a few categories:
+* raw datasets collect (ex. information about medical clinics collected by workers in the field)
+* polished datasets (ex. medical clinics from WHO)
+* datasets produced by the Sahana system (ex. all medical facilities being managed in the Sahana system)
+* documents and reports (ex. PDF of reports and supplemental spreadsheet information)
 The basic idea is to create a "data repository module" that would perform some of the key functions that CKAN does.  Namely:
+The basic idea is to create a "data repository module" that would perform some key functions:
 * Publish Data
 …
   * they can access metadata information via API
+Schema Ideas:
+Title
+Formats
+Author
+Date/Time Submitted
+Submitted through (channel)
+Date/Time Updated
+Updated By
+Purpose
+Permissions: Public, View Metadata, Private
+Status: New, Processing (+ manager), Integrate (+reference_link, +note)
+Manager
+Accessibility Note
+General Note
+Potential Schema:
+* Title
+* Data Formats
+* Original Author (individual, organization or group)
+* Date/Time Submitted
+* Submitted through (channel)
+* Date/Time Updated
+* Updated By (individual, organization or group)
+* Purpose
+* Permissions: Public, View Metadata, Private
+* Status: New, Processing (+ manager), Integrate (+reference_link, +note)
+* Manager (Sahana user managing this data set)
+* Accessibility Notes
+* General Notes
+* Change Log
+* Comments