wiki:UserGuidelines/OCR

Version 6 (modified by Shiv Deepak, 14 years ago) ( diff )

--

Sahana Eden Optical Character Recognition

Sahana OCR Module is responsible to generate OCR-able PDF Form for every resource Form available in HTML. There are two types of interfaces available based on the usecase.

  1. Single Form Upload When there is human resource available to upload and verify the OCR-ed data individually then this interface can be considered. Here the user will upload a scanned Form, let eden OCR it, then he/she can verifies the data and update the database with this new record.
  1. Bulk Form Upload When there is not enough human resource available to upload and verify the OCR-ed data then this usecase can be considered where scanned forms can be uploaded in bulk and verified later.

Phases

The workflow of OCR Module is composed of two phases:

  1. Download PDF Form
  2. Upload Scanned Form

Download PDF Form

A PDF Form for a resource can be downloaded from the buttons available in create/update UI of any resource alternatively it can be downloaded directly from the following links.

http://127.0.0.1:8000/eden/modulePrefix/moduleSuffix/create.pdf

and

http://127.0.0.1:8000/eden/modulePrefix/moduleSuffix/recordId/componentName/create.pdf


http://eden.sahanafoundation.org/raw-attachment/wiki/UserGuidelines/OCR/Screenshot-4.png

Upload Scanned Form

A Scanned form can be uploaded in two different ways.

1. Individual Upload


http://eden.sahanafoundation.org/raw-attachment/wiki/UserGuidelines/OCR/Screenshot-3.png

For this use-case, the user has to use the web UI to upload the scanned OCR Form. The scanned Form can either be a single PDF file or multiple image files, one file corresponds to one page. The OCR upload button is available in create/update UI of any resource alternatively it can be uploaded directly from the following links.

http://127.0.0.1:8000/eden/modulePrefix/moduleSuffix/upload.pdf

and

http://127.0.0.1:8000/eden/modulePrefix/moduleSuffix/recordId/componentName/upload.pdf


http://eden.sahanafoundation.org/raw-attachment/wiki/UserGuidelines/OCR/Screenshot-5.png

While there are two options available for uploading a OCR Form.

  1. one image per page


http://eden.sahanafoundation.org/raw-attachment/wiki/UserGuidelines/OCR/Screenshot-6.png

  1. upload a Scanned OCR form as a PDF.


http://eden.sahanafoundation.org/raw-attachment/wiki/UserGuidelines/OCR/Screenshot-7.png

2. Bulk Upload

TODO

Note: See TracWiki for help on using the wiki.