Information World Review (IWR) Blog Information World Review (IWR) Blog A blog from www.iwr.co.uk

« Journal editors get guidelines | Main | ProQuest I&L acquired for $222m »

IBM OmniFind Yahoo! Edition: is it any good?

IBM's OmniFind Yahoo Edition, mentioned in IWR a couple of days ago, has to be worth a try although don't expect instant results. There's the small matter of indexing to take care of first. I decided to let its crawlers loose on the IWR blog and website and my own hard drive.

It took a few hours to index around 35,000 documents and build an index of almost 2.5 gigabytes. It works on specific file types, which means that some proprietary format files don't get indexed. And there's a size limit on the files it will index. I don't know what it is, but the first file it baulked at was about 1 gigabyte. [Update: maximum file size is 51.2MB.]

Web crawling is continuous after a delay following the initial session but the hard drive crawling has to be sanctioned by the search administrator.

The search splash screen comes up with all the usual IBM/Yahoo branding, copyright and search destination stuff. You can replace all the elements with your own. I elected for spartan simplicity:

Omni

And here's a glimpse of the start of the results:

Omni1

Set up is dead easy, although it baulked at indexing my machine from the C:\ drive. I settled for making my username folder the root of the search.

Searching is snappy. At the moment (after only half a dozen searches), the average response time is 881 milliseconds. But then you'd expect this if the index is inside your own machine.

If you want to fine-tune the search experience you can import (or key in) synonyms. You can also add featured links which appear at the top of the search results in response to certain keywords being entered. For example you could associate a support URL with the words 'help', 'problem', 'helpdesk', 'faq' and 'support'.

And there's an API for the technically-minded which delivers results as Atom feeds or HTML snippets to embed into your web pages as you wish.

Best of all, it's free. As long as you don't exceed half a million documents and you stick to the local drives in your own system [Update: and network drives mapped/mounted on the search server.]

Warning (added 16/12): You have probably realised that this version does not take account of user access policies. It's for the administrator to ensure that restricted material is not indexed, or to restrict access to the search and its cache.

Comments

Great review! You should note that you can also index network drives...they just have to be mapped/mounted on the server where OmniFind Yahoo! Edition is running.

Also, it does support a bunch of different file formats, including PDFs and all the MS Office formats. You can see a list in one of the forums, here: http://omnifind.ibm.yahoo.net/forums/index.php/topic,15.msg31.html#msg31

The present file size limit is 52,428,800 bytes (51.2MB).

Thanks for your help guys.

I should have put the formats link in. I was so busy feeling ratty that, unlike Copernic, I couldn't get inside certain proprietary files, I forgot my manners. I'll see if I can hotlink that part of the review.

I found that my processor was being more or less completely hogged when indexing was taking place.

A shame, but here's the warning I should have read before downloading:

Important: IBM OmniFind Yahoo! Edition is an enterprise search product, not a desktop search product. As a result, it is recommended that you do not run the search engine for a production environment on a typical notebook or desktop personal computer other than for limited evaluation, testing or development scenarios. Crawling and indexing Web sites and file directories can use up to 100% of the servers processor, network bandwidth or disk space.

It, and plenty of other useful information is on the download page:

http://omnifind.ibm.yahoo.net/download/download.php

David, to your point about your processor being hogged, there is a more detailed explanation of OmniFind Yahoo! Edition's behavior here: http://blog.360.yahoo.com/blog-oPfvMkQ.dqhYv3g7luZuJw--?cq=1&p=82

Post a comment

Bloggers-in-chief

Daniel Griffin, IWR Deputy Editor Daniel Griffin, IWR Deputy Editor
Daniel joined IWR in 2006 after a career as a publisher of guides, supplements and websites for magazine and event companies. His special interest is the evolving publishing and information industry online.

Peter Williams, IWR Editor Peter Williams, IWR Editor
Peter is in his second spell on IWR. Over the last few years he has developed interest in the fields of knowledge management and e-learning, writing and editing extensively on both topics.

Friends of IWR

LI Isues
James Mullan

Lorcan Dempsey’s weblog
Lorcan Dempsey

SocialTech
Josie Fraser

Jennie Law’s blog
Jennie Law

UK Web Focus
Brian Kelly

tfpl blog
James Lappin

e4innovation
Grainne Conole


Powered by Movable Type
Useful links: About | Privacy policy | Terms & conditions | Top of the page
© Incisive Media Ltd. 2008
Incisive Media Limited, Haymarket House, 28-29 Haymarket, London SW1Y 4RX, is a company registered in the United Kingdom with company registration number 04038503