In order to use search_attachments.module, you will need the appropriate helper apps on the same computer that Drupal is running on. These apps need to print out extracted text to standard output; currently, search_attachments cannot read extracted text that is saved to a file.
search_attachments version 5.x-4 and later comes with a built-in PHP helper for .txt and .csv files. No external helper is necessary.
| File type | Helper | Platform(s) | Sample path |
|---|---|---|---|
| .doc | catdoc | *nix, Win | /usr/bin/catdoc %file% |
| .doc | wv | Win | c:\wvWare.exe %file% |
| pdftotext | *nix, Win | c:\pdftotext\pdftotext.exe %file% - | |
| .ppt | catppt | *nix, Win | /usr/local/bin/catppt %file% |
| .xls | xls2csv | *nix, Win | /usr/local/bin/xls2csv %file% |
If you have gotten any other helpers to work, I'd love to hear about them.
Comments
.ppt powerpoint *nix
.ppt powerpoint *nix /usr/local/bin/catppt %file%
.xls excel *nix /usr/local/bin/xls2csv %file%
xls2csv seems to rely on receiving a valid "header" from the file adn will fail if it doesn't get it
from memory, this has something to do with ISO coding <== don't ask me I just read lotsa stuff and can't remember half of it :|
tim
Awesome, thanks
Thanks Tim, I'll add these to the list.
Helpers - Antiword
M$ will do a "fast-save"
Rather than write a complete .doc, it just appends the changes to the end of the file
Catdoc does not like this
httpd_error.log : [This was fast-saved 1 times. Some information is lost]
Antiword can read most of these and increase parsed file numbers
Will extract from Word 2, 6, 7, 97, 2000, 2002 and 2003
Only hassle seems to be that last update was in 2005 - tho the website ( http://www.winfield.demon.nl/ ) has been updated more recently
Worth pointing out that
Worth pointing out that while the catdoc and catppt helpers technically funtion under Windows, they are DOS apps and do not support long filenames etc.
Thanks, that a very good point
Thanks, that a very good point, and one that I'll add to the helpers page.
Helper apps for Macs
Hi,
Thanks for the info. Will the helper apps for *nix work on a Mac server?
Thanks!
Yes, generally
Depends on the specific helper, but there are ports of the most popular helper apps for Mac OS X. Any particular one in mind?
Helper apps for Macs
I need helper apps for all the file types listed. Thanks!
If they work on *nix, they'll likely work on Mac
If Unix/Linux versions don't run on OS X, there might be ports available at http://www.macports.org/.
Post new comment