Introduction to Text Indexing with Apache Jakarta Lucene
Subject:   performance
Date:   2003-01-16 13:34:59
From:   drunk_injun
Response to: performance

Lucene performs extremeley well, given the correct configuration. We run a few lucene indexes, the largest of which contains well over 3 million records, and our site averages close to 2MM daily pageviews.

Our average query response time (95th percentile) is ~80ms, and we serve thousands of requests per day using a RAM-based servlet configuration in Caucho Resin.

The search algorithm is a standard TFIDF algorithm (with some additional gravy), which can be easily extended or replaced given that the source code is well designed. I would highly recommend this package, having used it over the past 3 years in a variety of applications.

Main Topics Oldest First

Showing messages 1 through 2 of 2.

  • performance
    2003-04-25 02:02:19  anonymous2 [View]

    Hi drunk_injun,

    I am using lucene for text searches. Actually I am trying to make the searches faster.
    I just wanted some inputs regarding the configuration changes that you have made to lucene. What are h/w configurations you are using etc.
    I would be very glad to receive your replies at

  • performance
    2003-01-16 13:36:25  drunk_injun [View]

    Caveat to last message - the response time of 80ms includes our own modifications to lucene to enable additional sorting and filtering parameters. Raw query times generally average < 25ms.