Installation & configuration guides#
The scalability of TheHive allows it to be set up as a standalone server or as nodes inside a cluster. Any number of nodes can rely on a database and a file system also setup as standalone servers or a cluster.
Before starting installing and configuring, you need to identify and define the targetted architecture.
Choose a setup#
The modular architecture makes it support several types of database, file storage system and indexing system. The initial choices made with the target architecture and the setup are crucial, especially for the database.
If high availability and fault tolerance are necessary, implementing a cluster might be the choice, and this choice determines the database, the file storage and indexing system to install.
Hardware requirements depends on the number of concurrent users and how they use the system. The following table give some information to choose the hardware.
|Number of users||CPU||RAM|
Choose a database#
Once the target setup is identified, the first choice to make is the database. Even of local Berkeley DB and Cassandra database are supported, we recommend using Apache Cassandra, which is a scalable and high available Database, even for standalone servers.
Berkeley DB can be enough for testing purposes.
This choice is decisive as migration from Berkeley DB to Cassandra is not possible.
Choose a file storage system#
Like for databases, several options exist regarding file system.
Basically, for standalone setups, using the local filesystem is the easiest solution. If installing a cluster, there are several options:
- Using a share NFS folder
- Using Apache Hadoop, a distributed file system
- Using a S3-compatible storage service ; for example with Min.IO
Starting with a standalone server and a local file storage and upgrading to a cluster with S3 of Hadoop is possible. Existing files can be moved to the targetted solutions.
Choose an index system#
Introduced with TheHive 4.1 to increase performances, TheHive relies on a dedicated indexing process. With a standalone setup, using a local index with Lucene is sufficient.
In the case of a cluster, all nodes have to connect to the same index: an instance of Elasticsearch is then required.
Starting with a standalone server and Lucene and upgrading to a cluster with Elasticsearch is possible. Indices can be rebuilt. However, it can takes some time.
The following Guide let you prepare, install and configure TheHive and its prerequisites for Debian and RPM packages based Operating Systems, as well as for other systems and using our binary packages.
If you want to build TheHive from sources, you can follow this guide.
The configuration of TheHive is in files stored in the
application.confcontains all parameters and options
logback.xmlis dedicated to log management
/etc/thehive ├── application.conf ├── logback.xml └── secret.conf
A separate secret.conf file is automatically created by Debian or RPM packages. This file should contain a secret that should be used by one instance.
Various aspects can configured in the
- database and indexing
- File storage
- Other service parameters
Basic stand alone server#
Follow the installation guides for you prefered operating system.
Cluster with 3 TheHive nodes#
The folling guide details all the installation and configuration steps to get a cluster with 3 nodes working. The cluster is composed of:
- 3 TheHive servers
- 3 Cassandra servers
- 3 Min.IO servers