Using FoundationDB Clients
Versioning
FoundationDB supports a robust versioning system for both its API and binaries. This system allows clusters to be upgraded with minimal changes to both application code and FoundationDB binaries. The API and the FoundationDB binaries are each released in numbered versions. Each version of the binaries has a corresponding API version.
API versions
In general, a given client will support both its corresponding API version and earlier API versions. A client selects the API version it will use by explicitly specifying the version number upon initialization. The purpose of this design is to allow the server, client libraries, or bindings to be upgraded without having to modify application code. You can therefore upgrade to more recent packages (and so receive their various improvements) without having to change application code.
Binary versions
By default, client and server binaries installed for a given cluster must have the same version. However, there is also a multi-version feature that allows a client to connect to a cluster whose server processes have a different version. This feature works by loading a separate client library that is version-compatible with the cluster, which then proxies all API calls.
Note
The interaction between API versions and binary versions follows a simple rule: the API version cannot be set higher than that supported by the client library in use, including any client library loaded using the multi-version feature.
For more details, see API versioning and Multi-version client.
Cluster file
FoundationDB servers and clients use a cluster file (typically named fdb.cluster
) to connect to a cluster. The contents of the cluster file are the same for all processes that connect to the cluster. When connecting to a cluster, the client APIs allow a cluster file to either be explicitly provided or automatically determined by default. For example, using the Python API:
db = fdb.open('/path/to/specific/file')
uses only the specified file and errors if it is invalid. In contrast:
db = fdb.open()
checks the FDB_CLUSTER_FILE
environment variable, then the current working directory, then the default file. FoundationDB’s procedure for determining a cluster file is described in Specifying a cluster file.
Installing FoundationDB client binaries
To access a FoundationDB cluster from a machine that won’t need to run the server, you can install just the FoundationDB client binaries (available for download at Downloads).
Warning
Unless using the Multi-version client, the installed client and server packages must have the same version. When upgrading a FoundationDB cluster, be sure also to upgrade all clients to the same version.
If you don’t already have a FoundationDB cluster to connect to, you should instead follow the instructions in Getting Started on macOS or Getting Started on Linux.
To install on Ubuntu use the dpkg command:
user@host$ sudo dpkg -i foundationdb-clients_ON-1_amd64.deb
To install on RHEL/CentOS use the rpm command:
user@host$ sudo rpm -Uvh foundationdb-clients-ON-1.el7.x86_64.rpm
To install on macOS, run the installer as in Getting Started on macOS, but deselect the “FoundationDB Server” feature.
The client binaries include the fdbcli
tool and language bindings for C. Other language bindings must be installed separately.
Clients will also need a cluster file to connect to a FoundationDB cluster. You should copy the fdb.cluster
file from the default location on one of your FoundationDB servers to the default location on the client machine.
Multi-version client
The FoundationDB client library supports connecting to clusters with server processes running at a different version than the client. To do so, it must have access to a version of the client compatible with the cluster, which it loads and then proxies API calls through.
To make use of the multi-version client capabilities, you must set at least one of two network options (EXTERNAL_CLIENT_LIBRARY
and/or EXTERNAL_CLIENT_DIRECTORY
) that specify the location of these additional client libraries. The client library will start a new network thread for each external client and attempt to make connections to the cluster over all of them (as well as the local library). When making API calls, the library will use the version that is able to communicate with the cluster (choosing an arbitrary one if there are multiple compatible clients), and it will also automatically switch to the appropriate client should the cluster’s protocol version change. Each loaded library will generate its own trace files, if enabled.
The multi-version client API adds a new cluster_version_changed
error that is used to cancel operations performed by the client when it switches to using a different version of the client library. This error is a retryable one, meaning that all standard retry loops will automatically rerun a transaction that fails with cluster_version_changed
.
Warning
Setting an API version that is not supported by a particular client library will prevent that client from being used to connect to the cluster. In particular, you should not advance the API version of your application after upgrading your client until the cluster has also been upgraded.
Warning
You should avoid including multiple protocol-compatible clients in the external client libraries list. While the client will still work, it will consume more resources than necessary. Additionally, different patch releases of the same version (e.g. x.y.z
and x.y.w
) are protocol compatible, and including multiple may result in not using the most recent compatible client.
Note
It is recommended that you not include more external clients than necessary. For example, a client that has been upgraded to a newer version than its cluster may need to include a single external client that matches the version of the cluster, but it generally won’t require a copy of every prior version.
Note
If cluster_version_changed
is thrown during commit, it should be interpreted similarly to commit_unknown_result
. The commit may or may not have been completed.
Setting network options with environment variables
Client network options can be set automatically upon client startup using specially crafted environment variables. To set a particular network option, add a variable of the following form to your environment:
FDB_NETWORK_OPTION_<UPPERCASE_OPTION_NAME> = value
For example, you can enable trace logging for a client by setting the following environment variable:
FDB_NETWORK_OPTION_TRACE_ENABLE = ""
If you want to set the same option multiple times (e.g. to add multiple client libraries, for instance), you can separate the values with the default path separator on your system (:
on Linux/macOS, ;
on Windows). For example:
FDB_NETWORK_OPTION_EXTERNAL_CLIENT_DIRECTORY = /path/to/dir1:/path/to/dir2:...
Network options specified using environment variables are set at the end of the call to set the API version. They will be applied in the order of the corresponding option codes assigned to the options. If there is an error reading the appropriate environment variables or if the option cannot be set with the specified value, then the call to set the API version may return an error. In that case, you may attempt to set the API version again, but you must do so with the same version as the attempt that failed.
Client network thread
FoundationDB clients start a thread that is used to serialize operations and communicate with the cluster. This thread is commonly referred to as the network thread, and most operations performed by a client (such as reads, writes, and commits) will take place on this thread. It is important that a client application does not block on this thread, such as by issuing a blocking call in a callback from a FoundationDB operation. Some client language bindings (e.g. Java) will protect you from this risk by notifying the application that an operation has completed from a different thread automatically.
A client process can have only one network thread for the entire lifetime of that process. While it’s possible to stop the network thread, it is not possible to restart it.
Performance
Because it can have only a single network thread, a client process may become limited in its throughput by the amount of work that can be performed on that thread. A client with a saturated network thread may begin to experience increased latencies while it struggles to keep up. If a client application needs to support higher throughput than a single network thread can provide, then more network threads can be started by running additional client processes.
If you suspect that a client process’s workload may be saturating the network thread, this can be confirmed by checking whether the network thread is running with high CPU usage. In the client trace logs, the ProcessMetrics
trace event has a field for MainThreadCPUSeconds
that indicates the number of seconds out of Elapsed
that the network thread was busy. You can also attempt to identify a busy thread from any tool that reports the CPU activity of threads in your process.
Note
FoundationDB 6.3 introduced multi-threaded client, and can alternatively be used to scale clients.
Multi-threaded Client
FoundationDB client library can start multiple worker threads for each version of client that is loaded.
Currently, each database object is associated with exactly one of the threads, so a user would need at least N
database objects to make use of N
threads. Additionally, some language bindings (e.g. the python bindings) cache database objects by cluster file, so users may need multiple cluster files to make use of multiple threads. This may be improved in the future.
Clients can be configured to use worker-threads by setting the FDBNetworkOptions::CLIENT_THREADS_PER_VERSION
option.
Warning
In order to use the multi-threaded client feature, you must configure at least one external client. See multi-version client API for how to configure an external client.
Client trace logging
By default, clients do not generate trace logs. In order to enable them, the TRACE_ENABLE
network option must be set on your client. See the documentation for your language binding for instructions on how to do this.
Multi-version client API
When trace logs are enabled while using the multi-version client API, separate files will be generated for the primary client library and each of the loaded external clients. The ClientStart
event can be used to identify the version of the client that generated a particular log file. Most of the relevant logs will be found in the version that matches the cluster’s version, but for some types of issues with startup or upgrades (particularly those related to the multi-version client itself), relevant logs can be found in the logs for the primary client as well.