Google Search Appliance (GSA) is an Enterprise Search System developed by Google.
It is a hardware-software solution with a sole purpose of indexing documents across company’s file servers, web-servers and other resources and making them available for search via a Google-like interface.
GSA supports full-text indexing of searchable and editable file formats, such as
HTML, DOC, XLS, TXT, and text-based
PDF, on file servers and web servers as an “out of the box” feature.
ABBYY Recognition Server offers a ready, easy to use connector to GSA which enables the system to index contents of image documents as well.
The a short overview how the Connector, Recognition Server and the GSA work:
The GSA connector crawls specified network directories
It picks a copy of the image files (original is still there)
Recognition Server 3.0 is processing the files and generates a special
XML feed for the GSA
The connector submits the extracted information GSA
The Google Search Appliance adds the document to the search index.
Users then will find the image documents that contain the relevant data.
They can open the image, since the GSA “knows” the link to the original file.
The result:
To install the GSA Connector, you have to select the option during installation.#
The connector installs on the same computer with the Server Manager. No installation of any components on top of the Google Search Appliance server is required.
Once the connector is installed, you will find an special Windows Service for the Feed Generator:
Connection to the Google Search Appliance server is established automatically after its IP address has been specified in the Connector’s settings.
Back to: Home - New Features in RS 3.0 - IFilter for Microsoft Search