Article:
  Introduction to Text Indexing with Apache Jakarta Lucene
Subject:   performance
Date:   2003-01-15 23:58:01
From:   anonymous2
I have been looking at lucene for a couple of years, but I havent tied it in a large scale setting...so I wonder, how does it perform???


Any clues on how long i will take to search, say 1000 documents???


How effective is the search algorithm, compared to for instance Alta Vista???


thanks

Main Topics Oldest First

Showing messages 1 through 2 of 2.

  • performance
    2004-04-30 22:36:27  ravitiru [View]

    I am testing Lucene with 40 Milion Articles. It is talking 4 to 5 days to index articles and the search time is around 2 to 5 seconds. I am still working on optimizing the index.
  • performance
    2003-01-16 13:34:59  drunk_injun [View]

    Lucene performs extremeley well, given the correct configuration. We run a few lucene indexes, the largest of which contains well over 3 million records, and our site averages close to 2MM daily pageviews.

    Our average query response time (95th percentile) is ~80ms, and we serve thousands of requests per day using a RAM-based servlet configuration in Caucho Resin.

    The search algorithm is a standard TFIDF algorithm (with some additional gravy), which can be easily extended or replaced given that the source code is well designed. I would highly recommend this package, having used it over the past 3 years in a variety of applications.