Changes between Version 1 and Version 2 of FerretVsLucene
- Timestamp:
- 10/11/06 00:53:31 (4 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
FerretVsLucene
v1 v2 5 5 *Disclaimer*: These benchmarks were written by myself, the developer of Ferret so they may be slightly biased. I have submitted them to the Lucene mailing list so that Lucene developers can check the fairness of these benchmarks. The numbers below are in no way an indication of the quality of either library. Lucene is currently a lot more stable than Ferret. The reason I have run these benchmarks against Lucene is that Ferret was originally ported from Lucene and is still very strongly influence by that library. I also believe Lucene is the gold standard for information retrieval libraries. 6 6 7 All suggestions/comments/critiques are most welcome and should be directed to myself at dbalmain.ml at gmail.com. 7 All suggestions/comments/critiques are most welcome and should be directed to myself at dbalmain.ml at gmail.com. Alternatively just add your comments to this page. 8 8 9 9 [http://lucene.apache.org/java/docs/index.html Apache Lucene] can be downloaded [http://www.apache.org/dyn/closer.cgi/lucene/java/ here]. … … 21 21 == Indexing performance == 22 22 23 For the indexing benchmark we need to look a few different situations. Most importantly, the following benchmarks look at performance when storing the field with term-vectors and not storing the field or term vectors. They also have options for reopening the IndexWriter at regular intervals. I use a !WhiteSpaceAnalyzer so that analysis time will have little effect on the results. Here are the indexing benchmarking programs:23 For the indexing benchmark we need to look a few different situations. Most importantly, the following benchmarks look at performance when storing the field with term-vectors and not storing the field or term vectors. They also have options for reopening the !IndexWriter at regular intervals. I use a !WhiteSpaceAnalyzer so that analysis time will have little effect on the results. Here are the indexing benchmarking programs: 24 24 25 25 * LuceneIndexingBenchmarker 26 26 * FerretIndexingBenchmarker 27 27 28 I run each test 6 times and the top and bottom results are thrown away to the HotSpot warmup should have no effect on the Lucene results.28 I run each test 6 times and the top and bottom results are thrown away to the !HotSpot warmup should have no effect on the Lucene results. 29 29 30 30 === Unstored Without Term-Vectors === 31 31 32 to be continued. 32 {{{ 33 dbalmain@ubuntu:~/sandpit/benchmarks $ java -classpath lucene-core-2.0.0.jar:. -server -Xmx500M -XX:CompileThreshold=100 LuceneIndexer -reps 6 34 --------------------------------------------------- 35 1 Secs: 37.96 Docs: 19043 36 2 Secs: 24.17 Docs: 19043 37 3 Secs: 23.19 Docs: 19043 38 4 Secs: 22.43 Docs: 19043 39 5 Secs: 21.23 Docs: 19043 40 6 Secs: 21.86 Docs: 19043 41 --------------------------------------------------- 42 Lucene 2.0.0 43 JVM 1.5.0_06 (Sun Microsystems Inc.) 44 Linux 2.6.15-27-386 i386 45 Mean: 25.14 secs 46 Truncated mean (4 kept, 2 discarded): 22.91 secs 47 --------------------------------------------------- 48 dbalmain@ubuntu:~/sandpit/benchmarks $ ruby ferret_indexer.rb --reps 6 49 ------------------------------------------------------------ 50 0 Secs: 6.18 Docs: 19043 51 1 Secs: 6.37 Docs: 19043 52 2 Secs: 7.25 Docs: 19043 53 3 Secs: 6.15 Docs: 19043 54 4 Secs: 6.15 Docs: 19043 55 5 Secs: 6.23 Docs: 19043 56 ------------------------------------------------------------ 57 Mean 6.39 secs 58 Truncated Mean (4 kept, 2 discarded): 6.23 secs 59 ------------------------------------------------------------ 60 }}} 61 62 === Stored Without Term-Vectors === 63 {{{ 64 dbalmain@ubuntu:~/sandpit/benchmarks $ java -classpath lucene-core-2.0.0.jar:. -server -Xmx500M -XX:CompileThreshold=100 LuceneIndexer -reps 6 -store 1 65 --------------------------------------------------- 66 1 Secs: 53.70 Docs: 19043 67 2 Secs: 37.56 Docs: 19043 68 3 Secs: 36.50 Docs: 19043 69 4 Secs: 34.90 Docs: 19043 70 5 Secs: 41.11 Docs: 19043 71 6 Secs: 34.32 Docs: 19043 72 --------------------------------------------------- 73 Lucene 2.0.0 74 JVM 1.5.0_06 (Sun Microsystems Inc.) 75 Linux 2.6.15-27-386 i386 76 Mean: 39.68 secs 77 Truncated mean (4 kept, 2 discarded): 37.52 secs 78 --------------------------------------------------- 79 dbalmain@ubuntu:~/sandpit/benchmarks $ ruby ferret_indexer.rb --reps 6 --store 80 ------------------------------------------------------------ 81 0 Secs: 12.47 Docs: 19043 82 1 Secs: 13.59 Docs: 19043 83 2 Secs: 12.50 Docs: 19043 84 3 Secs: 12.44 Docs: 19043 85 4 Secs: 12.60 Docs: 19043 86 5 Secs: 12.81 Docs: 19043 87 ------------------------------------------------------------ 88 Mean 12.74 secs 89 Truncated Mean (4 kept, 2 discarded): 12.60 secs 90 ------------------------------------------------------------ 91 }}}
