Microsoft Search Server, Microsoft Office SharePoint Server, Microsoft Windows Search — allow indexing electronic documents to search for information.
These systems index files different digital born documents like DOC, XLS,
PDF or TXT.s, but the content of files in
graphic formats, such as scanned paper documents in
TIFF,
JPEG,
PDF, DjVu cannot be indexed and searched directly.
-
An
IFilter is a plugin that allows the Windows Indexing Service and the newer Windows Desktop Search to index different file formats so that they become searchable.
Without an appropriate IFilter,
contents of a file cannot be parsed and
indexed by the search engine.(Form Wikipedia:
IFilter
ABBYY Recognition Server IFilter enables Microsoft Search to “open” scanned documents and make them searchable.
ABBYY offers iFilter which
enable SharePoint Server and
Windows Search to index contents of scanned image and
PDF documents.
ABBYY Recognition Server IFilter works as follows:
receives image files from the
MS search system crawler and sends them to ABBYY Recognition Server for OCR;
picks up the OCR results and sends the full text content of the document back to the
MS search system engine for indexing.
The text of the document is included in the index of the SharePoint Server (or Windows Desktop Search), and the image then becomes discoverable through full-text search.