Hear us Roar
Article:
 |
|
Data Mining Email
|
| Subject: |
|
8K Limit, OpenFTS |
| Date: |
|
2004-04-10 07:50:33 |
| From: |
|
agliodbs
|
|
|
Robert,
Two things for your readers:
The 8K limit on text fields has been fixed for 3 years, so with any reasonably current version of PostgreSQL it's no longer necessary to use large objects for the message body or translated Word doc.
Rather than using Regexes, the current waw to so this would be to use OpenFTS (openfts.sourceforge.net) to do ranked word searches on the message subject, body, and attachment.
All in all, thanks for the article and I look forward to tinkering with the tools you mention for my own personal store of 30,000 messages!
-Josh Berkus
|
|
| |