wiki:UserGuidelines/GIS/Data

Version 127 (modified by Fran Boon, 14 years ago) ( diff )

--

User Guidelines for GIS Data

Assumes installation of the relevant tools: InstallationGuidelinesGISData

Import Data

e.g. PakistanUnionCouncils

UUIDs

See: UUID

CSV

There is a function available in modules/s3/s3gis.py to import from CSV.

The CSV needs to have specific columns:

  • WKT column if we have polygon info (or points for L4)
  • For L1, we need these columns: ADM0_NAME, ADM1_NAME (& WKT)
  • For L2, we need these columns: ADM1_NAME, ADM2_NAME (& WKT) [ADM0_NAME can also be used to help separate duplicates]
  • For L3, we need these columns: ADM2_NAME, ADM3_NAME (& WKT) [ADM1_NAME can also be used to help separate duplicates]
  • CODE column is read, if-present
  • Ensure that names are consistent between Levels
  • The PROPER() spreadsheet function is useful to get the names in the correct format (then Paste as Text).
  • The VLOOKUP() spreadsheet function is useful if the different levels of the hierarchy are in different sheets & linked via a code instead of the name (as we need)

Basic Hierarchy can often be found from Wikipedia (although currently there's no easy way to download this - a student project to enhance Wikipedia for this would be much appreciated!).

For the Polygon data, it is normal to get this from Shapefiles (see below).

Example for Pakistan:

tablename = "gis_location"
table = db[tablename]
db.executesql("DROP INDEX name__idx on %s;" % tablename)
# L0
db(table.name != "Pakistan").delete()
db.commit()
import csv
csv.field_size_limit(2**20 * 10)  # 10 megs
db.import_from_csv_file(open("L0.csv", "rb"))
db.commit()
# L1
gis.import_csv("pak_adm1.csv", check_duplicates=False)
db.commit()
# L2
db(table.name == "Baluchistan").update(name="Balochistan")
db(table.name == "Northern Areas").update(name="Gilgit Baltistan")
db(table.name == "N.W.F.P.").update(name="Khyber Pakhtunkhwa")
db(table.name == "F.A.T.A.").update(name="FATA")
db(table.name == "F.C.T.").update(name="Islamabad")
db(table.name == "Azad Kashmir").update(name="AJK")
gis.import_csv("pak_adm2.csv", check_duplicates=False)
db(table.name == "Sind").update(name="Sindh")
db(table.name == "AJK").update(name="Pakistan Administered Kashmir")
db(table.name == "FATA").update(name="Federally Administered Tribal Areas")
db((table.name == "Islamabad") & (table.level == "L1")).update(name="Federal Capital Territory")
db.commit()
# L3
db(table.name == "Jaccobabad").update(name="Jacobabad")
db(table.name == "Tando Allahyar").update(name="Tando Allah Yar")
db(table.name == "Qambar Shahdad kot").update(name="Qambar Shahdadkot")
gis.import_csv("pak_adm3.csv", check_duplicates=False)
db(table.name == "Islamabad").update(name="Islamabad Capital Territory")
db(table.name == "Tando Allah Yar").update(name="Tando Allahyar")
db(table.name == "Qambar Shahdadkot").update(name="Qambar Shahdad Kot")
db(table.name == "Leiah").update(name="Layyah")
db(table.name == "Leiah Tehsil").update(name="Layyah Tehsil")
db(table.name == "Kalur Kot Tehsil").update(name="Kallur Kot Tehsil")
db(table.name == "De-excluded Area").update(name="Tribal Area")
db(table.name == "De-excluded Area D.g Khan").update(name="Tribal Area")
db.commit()
# L4
db(table.name == "Noorpur Tehsil").update(name="Noorpur Thal Tehsil")
jhang = db((table.name == "Jhang") & (table.level==L2)).select(table.id, limitby=(0, 1)).first().id
table.insert(name="Ahmadpur Sial", parent=jhang, level="L3", url="http://en.wikipedia.org/wiki/Ahmedpur_Sial_Tehsil")
gis.import_csv("punjab_l4.csv", check_duplicates=False)
db.commit()
db(table.name == "Mirwah Taluka").update(name="Thari Mirwah Taluka")
db(table.name == "Shah Bunder Taluka").update(name="Shah Bandar Taluka")
badin = db((table.name == "Badin") & (table.level==L2)).select(table.id, limitby=(0, 1)).first().id
table.insert(name="Talhar", parent=badin, level="L3", url="http://en.wikipedia.org/wiki/Talhar")
jamshoro = db((table.name == "Jamshoro") & (table.level==L2)).select(table.id, limitby=(0, 1)).first().id
table.insert(name="Manjhand Taluka", parent=jamshoro, level="L3", url="http://en.wikipedia.org/wiki/Jamshoro_District")
gis.import_csv("sindh_l4.csv", check_duplicates=False)
db.commit()
db(table.name == "F.r Kala Dhaka").update(name="F.R. Kala Dhaka")
db(table.name == "Martoong Tehsil").update(name="Martung Tehsil")
db(table.name == "Takhat Nasrati Tehsil").update(name="Takht-e-Nasrati Tehsil")
dikhan = db((table.name == "D. I. Khan") & (table.level == "L2")).select(table.id, limitby=(0, 1)).first().id
table.insert(name="Daraban Tehsil", parent=dikhan, level="L3")
table.insert(name="Paroa Tehsil", parent=dikhan, level="L3")
lowerdir = db((table.name == "Lower Dir") & (table.level == "L2")).select(table.id, limitby=(0, 1)).first().id
table.insert(name="Adenzai", parent=lowerdir, level="L3")
table.insert(name="Balambat", parent=lowerdir, level="L3")
table.insert(name="Khal", parent=lowerdir, level="L3")
table.insert(name="Lal Qila", parent=lowerdir, level="L3")
table.insert(name="Munda", parent=lowerdir, level="L3")
table.insert(name="Samar Bagh", parent=lowerdir, level="L3")
table.insert(name="Tazagram", parent=lowerdir, level="L3")
table.insert(name="Timargara", parent=lowerdir, level="L3")
upperdir = db((table.name == "Upper Dir") & (table.level == "L2")).select(table.id, limitby=(0, 1)).first().id
table.insert(name="Barawal Tehsil", parent=upperdir, level="L3")
table.insert(name="Chapar Tehsil", parent=upperdir, level="L3")
table.insert(name="Dir Tehsil", parent=upperdir, level="L3")
table.insert(name="Khal Tehsil", parent=upperdir, level="L3")
table.insert(name="Kalkot Tehsil", parent=upperdir, level="L3")
table.insert(name="Wari Tehsil", parent=upperdir, level="L3")
gis.import_csv("khyber_l4.csv", check_duplicates=False)
db.commit()
# L5
gis.import_csv("punjab_l5.csv", check_duplicates=False)
gis.import_csv("sindh_l5.csv", check_duplicates=False)
gis.import_csv("khyber_l5.csv", check_duplicates=False)
db.commit()
field = "name"
db.executesql("CREATE INDEX %s__idx on %s(%s);" % (field, tablename, field))

Shapefiles

Inspect the data using qGIS.

Use ogr2ogr to convert the data to CSV:

ogr2ogr -f CSV CSV TM_WORLD_BORDERS-0.3.shp -lco GEOMETRY=AS_WKT
ogr2ogr -f geojson TM_WORLD_BORDERS-0.3.json TM_WORLD_BORDERS-0.3.shp

If needing to reproject (e.g. for the Haiti Departements):

ogr2ogr -f CSV haiti_departments Haiti_departementes_edited_01132010.shp -s_srs EPSG:32618 -t_srs EPSG:4326 -lco GEOMETRY=AS_WKT

NB AS_WKT requires OGR v1.6+

Sources

L0:

L1:

L2:

KML

Can convert a KML to CSV using the attached script: python KML2WKT.py <filename>.kml

This can then be imported into Sahana by editing the column headers & using gis.import_csv(<filename>.csv)

qGIS can be used to convert this into a Shapefile (uses ogr2ogr so can also do using the CLI, if you prefer): give it column headers with 'WKT' for the WKT column name.

  • This is the easiest way to load into PostGIS (using PGAdmin III's SHapefile Importer plugin) to allow GeoServer to serve as WMS

Geonames

There is an import_geonames() function in S3GIS which downloads/unzips the country file (a TAB-separated list) from http://download.geonames.org/export/dump/

It should be run for the different levels of hierarchy that you wish to import (generally just the lowest level as Geonames just has Point data, so it's best to use other sources for the Polygons 1st, that way the Geonames importer can locate these Points within the correct Polygons of the hierarchy)

NB It takes some time to do this import! Pakistan imports 95000 locations!

Update: Geonames schema 2.2 supports parentADM(1-4): http://geonames.wordpress.com/2010/09/29/geonames-ontology-2-2/

  • will be good for when we only have hierarchy, not polygons
  • need to check whether much data has this populated though.

Python 2.5 doesn't support Zipfile.extract() & Zipfile.read() isn't unicode-safe. Until this is fixed, download the file manually 1st:

cd ~web2py/applications/eden/cache
wget http://download.geonames.org/export/dump/PK.zip
unzip PK.zip

In Web2py CLI:

gis.import_geonames('PK', 'L5')
db.commit()

Alternate approach:

  1. Transform each line in this file into XML by regular expression:
    ^(\d*)\t([^\t]*)\t([^\t]*)\t([^\t]*)\t([0-9\.]*)\t([0-9\.]*)\t[^\t]*\t([A-Z]*).*
    
    into:
    
    <location>
           <id>$1</id>
           <name>$2</name>
           <asciiName>$3</asciiName>
           <localNames>$4</localNames>
           <lat>$5</lat>
           <lon>$6</lon>
           <featureClass>$7</featureClass>
    </location>
    
    

This can be done using an RE-capable editor (e.g. Kate), Perl or even Python. Note: Need to replace & with &amp; and to remove any invalid characters

  1. Transform into S3XRC-XML using XSLT, stylesheet is available at

OpenStreetMap

See below: UserGuidelinesGISData

WFS

It is possible to use the WFS Plugin to get data into qGIS & thence export into other formats.

May need to use a Custom CRS (in Settings menu - remember to Save!) such as:

Can then go to the Layer Properties & Specify CRS to this User Defined Coordinate System.

Can then Save As and change the CRS to something like the standard WGS84.

Yahoo

Display Data

GeoServer

GeoServer can provide geospatial data in Raster (WMS) or Vector (WFS/KML) formats.

Once you have installed (Linux, Windows), then login:

  • l: admin
  • p: geoserver

Configure:

Import Shapefiles

e.g. Country Outlines:

These can be loaded direct into GeoServer, however there will be better performance by importing into PostGIS:
(can also use pgAdmin III GUI's Shapefile loader on plugins menu)

su postgres
shp2pgsql -s 4326 -I TM_WORLD_BORDERS-0.3.shp public.countries | psql -d gis

To reproject the data into 900913 for a slight performance advantage:

drop constraint srid;
update table set geomcolumn=transform(geomcolumn,900913);

Configure GeoServer

Add WMS Layer to Sahana Eden

  • tbc

WMS Reprojection

  • Have a remote WMS source that you want to access?
  • Have a desire to keep OpenStreetMap/Google/Bing layers?
  • WMS source server doesn't support the 900913 projection?

e.g. TRMM Rainfall Monitoring

Solution: MapProxy

Grid

We have a 'Coordinate Grid' Layer available by default.

Other options:

Topographic Maps

Old Printed Maps

Old Printed Maps can be 'Rectified' to be overlaid on the base maps:

OpenStreetMap

Base Map

We have out-of-the-box the ability to use OpenStreetMap Tiles as base layer.

This can include local OSM sites (OSM Taiwan is included as an example)

Vector Overlays

Can have OSM Vectors displayed over the top of other Base Layers (e.g. Satellite Images)

Import

We have an XSLT stylesheet to import .osm files

e.g for hospitals and clinics:

osmosis --read-xml country.osm --tf accept-nodes amenity=hospital,clinic --tf reject-ways --tf reject-relations --write-xml nodes.osm
osmosis --read-xml country.osm --tf reject-relations --tf accept-ways amenity=hospital,clinic --used-node --write-xml ways.osm
osmosis --rx nodes.osm --rx ways.osm --merge --wx country_hospitals.osm
http://myhost.com/eden/hms/hospital/create.osm?filename=country_hospitals.osm

This needs more work to understand the admin hierarchy properly to be able to import Places.

Geofabrik have updated extracts daily for Pakistan:

Otherwise pull a BBOX directly using Osmosis:

Osmosis requires Java. Python options for filtering based on tag, which would be more suitable for integration within Sahana, however we need to add Polygon filtering using Shapely:

Ruby script to generate KML of recently-added locations by a group of users:

Basemap for Garmin GPS

PostgreSQL management

PostGIS functions

  • Centroids
    SELECT name, iso2, asText(ST_Transform(ST_Centroid(the_geom), 4326)) AS centroid FROM countries;
    

Data Sources

OGC (WMS/WFS)


GIS

Attachments (12)

Download all attachments as: .zip

Note: See TracWiki for help on using the wiki.