Skip To Content

Add machines to a data store

ArcGIS Data Store allows you to create highly available or scalable data stores by including more than one machine in each type of data store that you deploy.

Options for adding machines vary by data store type and are explained below:

Relational data store

The relational data store is always deployed in primary-standby mode. The first machine on which you install ArcGIS Data Store and create a relational data store is the primary relational data store machine. You can install and configure one other relational data store with the same GIS Server site, making this second relational data store machine the standby machine. Hosted feature layer data is replicated from the primary to the standby machine.

Data is replicated from the primary to the standby machine.

If the primary machine becomes unavailable (for example, if it crashes, loses power, or loses network connectivity), the standby machine is promoted to primary. You will only be without your hosted feature layer data for the period of time it takes for the standby machine to become the primary—this can take up to a few minutes. Once the standby becomes the primary relational data store, it sends its backup files to the same location as the former primary machine.

Standby machine becomes primary when primary is inaccessible.

If you can correct the problem with the former primary machine—for example, if the machine was unplugged and you plugged it back in, or the machine temporarily lost network connectivity but network connectivity was later restored—the former primary becomes the standby machine for the relational data store.

For information about when a relational data store fails over, see ArcGIS Data Store deployment modes.

Tile cache data store

The number of machines you can add to a tile cache data store depends on the deployment mode you use.

  • Primary-standby mode—In primary-standby mode, a tile cache data store can contain a single machine or, to provide fail-over functionality, it can contain two machines. The default deployment mode for a tile cache data store is primary-standby mode.

    The first machine on which you install ArcGIS Data Store and create a tile cache data store is the primary tile cache data store machine. You can install and configure one other tile cache data store with the same GIS Server site, making this second tile cache data store machine your standby machine. Hosted scene layer cache data is replicated from the primary to the standby machine. Similar to relational data stores, the standby becomes primary if the primary machine fails.

    When both primary and standby machines are running, ArcGIS can use the primary or the standby tile cache data store machine when processing read-only queries. This improves performance when you're viewing a hosted scene layer.

    For information about when a tile cache data store running in primary-standby mode fails over, see ArcGIS Data Store deployment modes.

  • Cluster mode—In cluster mode, a tile cache data store can contain multiple data store machines to provide for scalability.

    When hosted layers are published, the hosted scene layer cache data is duplicated on two of the tile cache data store machines.

    Create a tile cache data store with three machines and data distributed across machines when people publish scene layers.

    You can also add tile cache data store machines to meet demand. If you're running out of space on the current machines, install ArcGIS Data Store on additional machines and register them with the same hosting GIS Server site. When you do, caches for new scene layers will be distributed on the new machines. Existing scene caches remain on the original machines.

    New scene caches are placed on the machines added to the tile cache data store.

    To rebalance the existing caches to use the newly added machines, you must create a backup of the tile cache data store and restore it. For instructions, see Rebalance scene layer caches.

Recovery after loss of a tile cache machine in cluster mode

If you have multiple machines in your tile cache data store for the purpose of high availability and one of the machines is lost or goes offline, you may need to redistribute scene cache data.

Attempt to bring the failed machine back online. For example, if a network or power cable is unplugged or the machine was shut off, you can rectify the issue by plugging in the network or power cable or turning on the computer. Once back online, the machine rejoins the tile cache data store and the data store continues to function.

If a machine crashes and cannot be resurrected, you need to take steps to ensure your scene layer caches are still highly available. See Recover a data store for instructions.

Spatiotemporal big data store

The spatiotemporal big data store is always deployed in cluster mode. You can install ArcGIS Data Store on several machines, create spatiotemporal big data stores on each, and register each one with the same hosting GIS Server site.

Note:

Always use an odd number of machines in your spatiotemporal big data store.

Data in the spatiotemporal big data store is duplicated on at least two spatiotemporal big data store machines registered to the same hosting server.

Spatiotemporal big data store containing five machines

If any one machine fails, data still exists on at least one other machine. The data store replicates the data to another machine that is still functional.

One machine fails; data moved to remaining machines.

You can install ArcGIS Data Store on more machines and create spatiotemporal big data stores that are registered with the same GIS Server sites. In this example, three machines are added: one to replace the one that failed, and two more to accommodate increased numbers of hosted spatiotemporal feature layers. Data is rebalanced across all seven machines. Data for newly published spatiotemporal feature layers is also distributed.

Add machines to spatiotemporal big data store and data redistributes

Note:

If you script the creation of multiple spatiotemporal big data store machines, one spatiotemporal big data store machine must be manually configured with the GIS Server site before you can script the creation of additional spatiotemporal big data store machines. Include wait times in your script to be sure the additional spatiotemporal big data store machines are not added at the same time.

Machine roles

In a spatiotemporal big data store cluster, there are two roles: coordinator and data.

  • Coordinator—The coordinator keeps track of what machines are in or have been removed from the cluster and the health of each machine. The coordinator also controls the distribution of data across the cluster and rebalancing data when a machine is lost or removed.

    There must be at least one coordinator machine in the cluster. A spatiotemporal big data store can contain a maximum of five machines that have the coordinator role assigned to them. The first coordinator machine added to the cluster will be the current coordinator of the cluster; one of the other machines assigned the coordinator role can take over if the current coordinator machine is lost or removed from the cluster.

    Machines that fill the coordinator role experience higher network traffic as a result of all the communication required to manage all the machines in the cluster.

  • Data—Data machines store and search the spatial and nonspatial tables that populate hosted spatiotemporal layers. Machines that fill the data role must have a large amount of memory and processing power.

A machine can fill either role or both roles.

When you create a spatiotemporal big data store machine using the Data Store configuration wizard, the first three machines are created in both roles. In other words, all three machines will store data, the first machine will also serve as the current coordinator, and either of the other two machines can be promoted to coordinator if the current coordinator machine fails. Starting with the fourth machine, all machines added to the cluster using the Data Store configuration wizard are deployed in the data role only.

To control the role filled by a particular machine in the cluster when you create it, use the configuredatastore utility to create the spatiotemporal big data store. For example, you can run the configuredatastore utility to create the first three machines in the coordinator role only and create two more machines in the data role only, for a total of a five-machine cluster. This configuration isolates the network communications that could slow performance when the coordinator and data roles are filled by the same machine. It also allows you to have extra resources on the data only machines where they are most needed.

You can also use the configuredatastore utility to change the role of an existing, healthy spatiotemporal big data machine.

Supported role changes are as follows:

  • Specify --roles data to change a coordinator to a data machine. This is possible if there are other coordinator-eligible machines in the cluster. You can change a coordinator machine to a data role if there are at least three other coordinator-eligible machines in the cluster.
  • Specify --roles data to remove the coordinator role from a machine that was deployed with both roles. This leaves the machine in the data role. You can remove the coordinator role from the machine if there are at least three other coordinator-eligible machines in the cluster.
  • Specify --roles coord,data to add the coordinator role to a machine that was initially deployed to fill only the data role. A maximum of five machines can include the coordinator role. If there are already five coordinator machines in the cluster, you cannot add the coordinator role to a data machine in the cluster.

Run the describedatastore utility on any machine in the cluster to identify which machines are assigned the coordinator role, which are assigned the data role, and which machine is the current coordinator for the cluster.

You can only remove the coordinator role from a machine if there are three other coordinators in the cluster. In other words, if there are three machines that fill both the coordinator and data roles in the cluster, you cannot remove the coordinator role from any of those machines.

The number of machines in the cluster affects the status of the cluster when a machine fails, and affects what role you can assign to the replacement machine. Keep in mind that a spatiotemporal big data store cluster should contain an odd number of machines.

In a cluster with only one coordinator machine, failure of that machine renders the spatiotemporal big data store unusable. You must add the coordinator role to at least two of the remaining machines or add two machines in the coordinator role to the cluster to bring the spatiotemporal big data store back online. Therefore, it is recommended that you have three or five machines in the coordinator role in the cluster. These machines can be in both the coordinator and data role or, if you have other machines in the cluster that fill the data role, they can be in just the coordinator role.

You must have at least two data machines in the cluster to prevent data loss if a data machine fails. In a cluster with only two data machines, when one data machine fails, add at least one new data machine to the cluster. If you do not add at least one data machine, you risk losing data if the remaining data machine fails. When there is only one data machine in the cluster, the only way to recover the data if the data machine fails is to restore the spatiotemporal big data store from a backup.

Graph store

The graph store created with ArcGIS Data Store 11.2 and later can contain one or two machines. If you require a highly available graph store, configure a graph store that contains two machines.

The first machine on which you install ArcGIS Data Store and create a graph store is the primary graph store machine. You can install and configure one other graph store with the same GIS Server site, making this second machine your standby graph store machine. Hosted knowledge graphs are replicated from the primary to the standby.

The same data exists on both machines in a highly available graph store.

If the primary becomes unavailable (for example, if the primary machine crashes, loses power, or loses network connectivity), the standby machine is promoted to primary. You will only be without your knowledge graph data for the period of time it takes for the standby machine to become the primary—this can take up to a few minutes. Once the standby becomes the primary graph store, it sends its backup files to the same location as the former primary machine.

If you can correct the problem with the former primary machine—for example, if the machine was unplugged and you plugged it back in, or the machine temporarily lost network connectivity but network connectivity was later restored—the former primary becomes the standby machine for the graph store.

For information about when a graph store fails over, see ArcGIS Data Store deployment modes.

Legacy:

Graph stores created with ArcGIS Data Store 11.1 or earlier can contain only one machine.

Object store

You can create an object store that contains one machine or you can deploy an object store in cluster mode that contains three or more machines.

A single-machine object store with a large amount of disk space can provide you adequate drawing performance when viewing feature layers that are configured to use cached query responses. However, if you require scalability for cached feature query responses, deploy the object store in cluster mode when you create it. You cannot change modes after deploying.

If you deploy the object store in cluster mode, you can add machines to the object store cluster if the existing machines get overloaded. Similarly, if one of the machines in the object store cluster fails and you can't bring it back online, you can install ArcGIS Data Store on a new machine, and configure it as object store with the same hosting server to add the new machine to the cluster.

When you enable query caching on new feature layers, the cached query responses for the new layers will use the new machine in the cluster. If a federated ArcGIS Video Server site is configured to store videos in the object store, newly published video services will also use the new machine.

Add a machine to a data store

Use the Data Store Configuration wizard or the configuredatastore utility to add a machine to any data store.

You can have one standby relational data store, graph store, or tile cache data store that is running in primary-standby mode. You can have multiple machines in a spatiotemporal big data store and in a tile cache data store or object store that is running in cluster mode. All machines that participate in the same data store must use the same operating system and be the same ArcGIS Data Store release.

Use the Data Store Configuration wizard to add a machine to a data store

Follow these steps to use the Data Store Configuration wizard to add a machine to data store:

  1. Install ArcGIS Data Store on another server.

    Each standby machine, primary machine, or node in a cluster must be installed on different physical servers. Use the same network account for the ArcGIS Data Store account on all machines in the data store.

  2. Open the ArcGIS Data Store Configuration wizard.
  3. Specify the fully qualified name of one of the machines in the GIS Server site.

    The name is in the format gisserver.domain.com. Use the same GIS Server site as you did when configuring the other machine or machines in the same data store for this ArcGIS Enterprise deployment.

  4. Type the username and password of the ArcGIS Server primary site administrator and click Next.
  5. Choose the type of data store to create.
  6. Specify a location for the new data store's directory and click Next.
  7. Review the information in the Configuration Summary. If it is correct, click Finish. If changes are needed, click Back to make corrections.

    The High Availability Role indicates that this is a standby machine for relational data stores.

When the setup completes successfully, you have an additional data store machine to which ArcGIS Data Store can replicate data or caches.

If there were errors with the setup, click Next on the Configuration Summary dialog box to reopen the Data Store Configuration wizard so you can provide different information. See Troubleshoot ArcGIS Data Store for causes and resolutions to setup errors.

Use the configuredatastore utility to add a machine to a data store

You can install ArcGIS Data Store on another server or servers and use the configuredatastore utility to add it to an existing data store. The utility is installed in <ArcGIS Data Store installation directory>/tools.

Each standby machine, primary machine, or node in a cluster must be installed on different physical servers.

Note:

You must use the same network account for the ArcGIS Data Store account on all machines in the data store.

Follow these steps to use the configuredatastore utility to add a machine to a data store:

  1. Open a Command Prompt using the Run As Administrator option on the additional server where you installed ArcGIS Data Store.
  2. Run the configuredatastore utility, and provide the fully qualified domain name of a machine in the GIS Server site and the ArcGIS Server primary site administrator username and password.

    Use the --stores operation to specify the type of data store to be created.

    You must specify the same GIS Server machine name and data store type that you used for the data store to which you want to add this machine.

    For example, the following adds a standby relational data store to a machine in the GIS Server site with fully qualified name agsserver.ntwk.com. The data store directory on this machine is c:\data\.

    configuredatastore agsserver.ntwk.com siteadmin P$@pass c:\data\ --stores relational

Another data store machine is added to the data store.

See the configuredatastore section of the ArcGIS Data Store utility reference for syntax and more examples.