Changes between Version 3 and Version 4 of BluePrint/DataRepository


Ignore:
Timestamp:
02/09/18 10:24:41 (7 years ago)
Author:
Fran Boon
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • BluePrint/DataRepository

    v3 v4  
    11= Data Repository =
    2 Data repository tools such as [http://ckan.org CKAN] are becoming popular within the humanitarian aid space as evidence by projects like [https://data.hdx.rwlabs.org/ HDX] and [http://www.data.gov/disasters/ Data.Gov's disaster portal].
     2
     3== Support for WebSetup ==
     4We would like to be able to have new Sahana installations be able to easily install relevant data, such as Locations (Hierarchy & Polygons, where available).
     5
     6Currently suitable data is maintained at [wiki:GIS/Data] but this requires CLI commands to make use of.
     7
     8[https://data.humdata.org HDX] has a lot of data however it isn't in a format suitable for import (indeed it isn't in any standard format which would be possible to create an import routine for) and also has no concept of canonical datasets, so there may be several to select between.
     9
     10The plan is to develop functionality to allow Sahana installations using [BluePrint/WebSetup WebSetup] to be able to select the countries for which they wish to install data and have this come in via API.
     11 * Sync could be used for this, but this means that we need to create a new repo on the central server for each remote client (which does have the benefit of giving us a log of where installs are requesting data for)
     12
     13== Data Catalog Module ==
     14Data repository tools such as [http://ckan.org CKAN] are becoming popular within the humanitarian aid space as evidence by projects like [https://data.humdata.org HDX] and [http://www.data.gov/disasters/ Data.Gov's disaster portal].
    315
    416These tools allow users to publish data sets and associate them with metadata that enables others to easily find them.  This is particularly useful for organizations that receive and produce lots of raw and refined data sets.  Many of these organizations are also collecting data sets that they will then integrate into their own information management systems. Sometimes the data they organize in their information management systems is also data they want to make available in a raw format via a data repository.