5. SIETS Clustering

This section describes SIETS clustering technology and provides general steps for working with it. Using the SIETS clustering technology, several SIETS servers can be joined into one cluster, which enables that search can be performed in a text collection of an unlimited size.

This section contains the following topics:

5.1. Principles

A SIETS cluster consists of nodes. Each node is a computer, on which the SIETS server is installed.

The SIETS cluster technology has a transparent architecture, which implies that each node is fully functional on its own.

5.2. Creating SIETS Cluster

To create a SIETS cluster, proceed as follows:

  1. Let us assume that we have a very large text collection that is too big to be stored and worked with on single computer. In that case, estimate in how many equal in size parts the text collection can be divided so that each part can be stored and worked with on single computer.

  2. On the number of computers estimated in the previous step, install the SIETS server on each of them.

  3. For more information on installing SIETS servers, see the SIETS Installation Guide.

  4. On each SIETS server installed in the previous step, create a SIETS storage with the same name for all nodes, that is SIETS servers.

  5. For more information on creating storages on SIETS servers, see the SIETS Administration and Configuration Guide.

  6. Create an application, which imports each part of the text collection to its own node.

  7. Note: If you want to use the domain element for grouping search results, then all documents of the same domain must be imported to the same node. Otherwise, that is, if documents of the same domain are stored on several nodes, when performing a search in the whole text collection results will not be grouped by domains properly.

    When you have created a SIETS cluster, you can either: