Recommended helper applications

In order to use search_attachments.module, you will need the appropriate helper apps on the same computer that Drupal is running on. These apps need to print out extracted text to standard output; currently, search_attachments cannot read extracted text that is saved to a file.

search_attachments version 5.x-4 and later comes with a built-in PHP helper for .txt and .csv files. No external helper is necessary.

File type Helper Platform(s) Sample path
.doc catdoc *nix, Win

/usr/bin/catdoc %file%
.doc wv Win c:\wvWare.exe %file%
.pdf pdftotext *nix, Win c:\pdftotext\pdftotext.exe %file% -
.ppt catppt *nix, Win /usr/local/bin/catppt %file%
.xls xls2csv *nix, Win /usr/local/bin/xls2csv %file%

If you have gotten any other helpers to work, I'd love to hear about them.

Comments

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.

.ppt powerpoint *nix

.ppt powerpoint *nix /usr/local/bin/catppt %file%
.xls excel *nix /usr/local/bin/xls2csv %file%

xls2csv seems to rely on receiving a valid "header" from the file adn will fail if it doesn't get it

from memory, this has something to do with ISO coding <== don't ask me I just read lotsa stuff and can't remember half of it :|

tim

Awesome, thanks

Thanks Tim, I'll add these to the list.

Helpers - Antiword

M$ will do a "fast-save"

Rather than write a complete .doc, it just appends the changes to the end of the file

Catdoc does not like this

httpd_error.log : [This was fast-saved 1 times. Some information is lost]

Antiword can read most of these and increase parsed file numbers

Will extract from Word 2, 6, 7, 97, 2000, 2002 and 2003

Only hassle seems to be that last update was in 2005 - tho the website ( http://www.winfield.demon.nl/ ) has been updated more recently

Worth pointing out that

Worth pointing out that while the catdoc and catppt helpers technically funtion under Windows, they are DOS apps and do not support long filenames etc.

Thanks, that a very good point

Thanks, that a very good point, and one that I'll add to the helpers page.

Helper apps for Macs

Hi,
Thanks for the info. Will the helper apps for *nix work on a Mac server?
Thanks!

Yes, generally

Depends on the specific helper, but there are ports of the most popular helper apps for Mac OS X. Any particular one in mind?

Helper apps for Macs

I need helper apps for all the file types listed. Thanks!

If they work on *nix, they'll likely work on Mac

If Unix/Linux versions don't run on OS X, there might be ports available at http://www.macports.org/.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.

Post new comment

The content of this field is kept private and will not be shown publicly.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.
  • Web page addresses and e-mail addresses turn into links automatically.
  • You may post code using <code>...</code> (generic) or <?php ... ?> (highlighted PHP) tags.

More information about formatting options

CAPTCHA
This question is for testing whether you are a human visitor and to prevent automated spam submissions.