Changes between Version 20 and Version 21 of BluePrint/Importer
- Timestamp:
- 05/19/10 21:20:04 (15 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
BluePrint/Importer
v20 v21 36 36 If the templates can be saved out as XSLT then the [wiki:BluePrintSynchronisation Sync scheduler] can be used to do regular imports. 37 37 38 Some links that might be useful: 38 == Useful Links == 39 39 * Karma: a system for doing the Import/Clean/Integrate/Publish workflow through a UI paradigm of 'Programming by Demonstration' (instead of via Widgets): 40 40 * [ftp://ftp.umiacs.umd.edu/pub/louiqa/PUB2010/GeoNets_Shubham.pdf Presentation from ISCRAM 2010] … … 45 45 * http://developer.yahoo.com/yql/guide/ 46 46 * [http://www.unixuser.org/~euske/python/pdfminer/ PDFMiner] is an !OpenSource tool to convert PDF docs into text. 47 * [http://pyparsing.wikispaces.com Pyparsing] is an !OpenSource tool to parse textual content 48 * included in Sahana Eden's {{{modules}}} folder 47 49 48 * Code snippet to extract hyperlinks from HTML docs. 50 == Code snippets == 51 Extract hyperlinks from HTML docs: 49 52 {{{ 50 53 import sgmllib