If you still remember the last round of our PDF iFilter battle, FoxIT won it. Now in this round, we bring in another challenger: TET PDF iFIlter. It is also avaliable on x86 and x64, free for non-commercial desktop use, will need a license for Server installation.
So here's the new result for file set II:
File Number
Total File Size(MB)
Avg File Size(MB)
Crawl Time(m:s)
Crawl Time(s)
File Per Second
Success
Error
FoxIT
2676
2406
0.90
7:46
466
5.74
2759
0
Adobe
40:58
2458
1.09
2757
2
TET
13:48
828
3.23
2752
I also obtained an archive copy from People's Daily, from 2001 to 2006. ~20,000 PDF files, 13.4GB total. Tested on a 8 cores XEON box.
Crawl Time(h:m:s)
19890
13793
0.69
00:30:53
1853
10.73
19884
7
05:19:04
19144
1.03
19887
4
01:40:09
6009
3.31
19879
12
And licensing comparsion for production(USD):
Summary
It is good to see another vendor joined this market. TET showed good performance, although still behind Foxit. But it's licensed based on servers not cores, the cost would be lower than Foxit if you have a typical 2 way quad cores box.