Changes between Version 25 and Version 26 of BluePrint/TextSearch


Ignore:
Timestamp:
04/20/13 17:44:03 (12 years ago)
Author:
Vishrut Mehta
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • BluePrint/TextSearch

    v25 v26  
    3838*    Proper understanding and the work model of S3Search(depricated) and S3Filter is required.
    3939
    40 *    Literature study of Apache Lucene and PyLucene. Getting familiar with '''PyLucene''' and deploy it into my local machine.
     40*    Literature study of Apache Lucene and Pylucene. Getting familiar with '''Pylucene''' and deploy it into my local machine.
    4141
    4242*    Studying the linkage of the Lucene daemon and web2py server.
     
    5050=== System Constraints ===
    5151
    52 *    The user should have PyLucene installed in there machine.
     52*    The user should have Pylucene installed in there machine.
    5353
    5454*    Also, while starting the web2py server, the Lucence deamon should also start.
     
    9090
    9191=== Wireframes ===
     92todo
    9293
    9394=== Technologies ===
     
    109110[[BR]]
    110111Solr is a platform that uses the Lucene library, the only time it may be preferable to use Lucene is if you want to embed search functionality into your own application. So I choose Lucene for indexing the documents and search string in those documents.
     112[[BR]]
     113Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform.
     114[[BR]]
     115Refer this for more information about its functtionalities:
     116[[BR]]
     117http://lucene.apache.org/core/
    111118
    112119== Implementation ==
     120
     121*    It consists of extending the usage of S3Filter to document search by creating new TextFilter field in the document search form as well as all other resources.
     122
     123*    When a user upload its document, it is indexed using the Lucene deamon, which will be running at background.
     124
     125*    As and when new document is uploaded or edited, it will be indexed, so as to search in it efficiently. Lucene provides a library which does its indexing and stuff efficiently.
     126
     127*    When a user enters a query, a request will be sent to the deamon and the deamon will search through the indexed documents and give the output search results.
     128
     129*    There is also Full-text search over different resources, which would need the resources in which the user wants to search for.
     130
     131*    This would be accomplished by using Pylucene, which is a wrapper on Apache Lucene in Python to carry out these tasks.
     132
     133*    After the response, the part which remains will be displaying the search results in a proper user friendly format.
     134[[BR]]
     135'''Future Implementation:'''
     136
     137*    UI is a secondary concern for how to display the search result. We could take inspiration from the Google and Bing! Search results for an attractive UI format.
    113138
    114139== References ==