consul/website/source/docs/internals/architecture.html.md

---
layout: "docs"
page_title: "Consul Architecture"
sidebar_current: "docs-internals-architecture"
description: |-
  Consul is a complex system that has many different moving parts. To help users and developers of Consul form a mental model of how it works, this page documents the system architecture.
---

# Consul Architecture

Consul is a complex system that has many different moving parts. To help
users and developers of Consul form a mental model of how it works, this
page documents the system architecture.

-> Before describing the architecture, we recommend reading the 
[glossary](/docs/glossary.html) of terms to help
clarify what is being discussed.


## 10,000 foot view

From a 10,000 foot altitude the architecture of Consul looks like this:

<div class="center">
[![Consul Architecture](/assets/images/consul-arch.png)](/assets/images/consul-arch.png)
</div>

Let's break down this image and describe each piece. First of all, we can see
that there are two datacenters, labeled "one" and "two". Consul has first
class support for [multiple datacenters](https://learn.hashicorp.com/consul/security-networking/datacenters) and
expects this to be the common case.

Within each datacenter, we have a mixture of clients and servers. It is expected
that there be between three to five servers. This strikes a balance between
availability in the case of failure and performance, as consensus gets progressively
slower as more machines are added. However, there is no limit to the number of clients,
and they can easily scale into the thousands or tens of thousands.

All the agents that are in a datacenter participate in a [gossip protocol](/docs/internals/gossip.html).
This means there is a gossip pool that contains all the agents for a given datacenter. This serves
a few purposes: first, there is no need to configure clients with the addresses of servers;
discovery is done automatically. Second, the work of detecting agent failures
is not placed on the servers but is distributed. This makes failure detection much more
scalable than naive heartbeating schemes. It also provides failure detection for the nodes; if the agent is not reachable, than the node may have experienced a failure. Thirdly, it is used as a messaging layer to notify
when important events such as leader election take place.

The servers in each datacenter are all part of a single Raft peer set. This means that
they work together to elect a single leader, a selected server which has extra duties. The leader
is responsible for processing all queries and transactions. Transactions must also be replicated to
all peers as part of the [consensus protocol](/docs/internals/consensus.html). Because of this
requirement, when a non-leader server receives an RPC request, it forwards it to the cluster leader.

The server agents also operate as part of a WAN gossip pool. This pool is different from the LAN pool
as it is optimized for the higher latency of the internet and is expected to contain only
other Consul server agents. The purpose of this pool is to allow datacenters to discover each
other in a low-touch manner. Bringing a new datacenter online is as easy as joining the existing
WAN gossip pool. Because the servers are all operating in this pool, it also enables cross-datacenter
requests. When a server receives a request for a different datacenter, it forwards it to a random
server in the correct datacenter. That server may then forward to the local leader.

This results in a very low coupling between datacenters, but because of failure detection,
connection caching and multiplexing, cross-datacenter requests are relatively fast and reliable.

In general, data is not replicated between different Consul datacenters. When a
request is made for a resource in another datacenter, the local Consul servers forward
an RPC request to the remote Consul servers for that resource and return the results.
If the remote datacenter is not available, then those resources will also not be
available, but that won't otherwise affect the local datacenter. There are some special
situations where a limited subset of data can be replicated, such as with Consul's built-in
[ACL replication](https://learn.hashicorp.com/consul/day-2-operations/acl-replication) capability, or
external tools like [consul-replicate](https://github.com/hashicorp/consul-replicate).

In some places, client agents may cache data from the servers to make it
available locally for performance and reliability. Examples include Connect
certificates and intentions which allow the client agent to make local decisions
about inbound connection requests without a round trip to the servers. Some API
endpoints also support optional result caching. This helps reliability because
the local agent can continue to respond to some queries like service-discovery
or Connect authorization from cache even if the connection to the servers is
disrupted or the servers are temporarily unavailable.

## Getting in depth

At this point we've covered the high level architecture of Consul, but there are many
more details for each of the subsystems. The [consensus protocol](/docs/internals/consensus.html) is
documented in detail as is the [gossip protocol](/docs/internals/gossip.html). The [documentation](/docs/internals/security.html)
for the security model and protocols used are also available.

For other details, either consult the code, ask in IRC, or reach out to the mailing list.
website: document the high level architecture 2014-02-20 00:58:15 +00:00			`---`
			`layout: "docs"`
			`page_title: "Consul Architecture"`
			`sidebar_current: "docs-internals-architecture"`
Use new Markdown syntaxes and add SEO descriptions 2014-10-19 23:40:10 +00:00			`description: \|-`
			`Consul is a complex system that has many different moving parts. To help users and developers of Consul form a mental model of how it works, this page documents the system architecture.`
website: document the high level architecture 2014-02-20 00:58:15 +00:00			`---`

			`# Consul Architecture`

			`Consul is a complex system that has many different moving parts. To help`
			`users and developers of Consul form a mental model of how it works, this`
			`page documents the system architecture.`

[docs] New Glossary Page (#5999) * Moved the glossary to a new page and removed the advanced warnings from all internals docs. * Update website/source/layouts/docs.erb Co-Authored-By: Judith Malnick <judith@hashicorp.com> * Updates based on PR feedback * Update website/source/docs/internals/index.html.md * Update website/source/docs/internals/index.html.md * Update website/source/docs/internals/index.html.md * Update website/source/docs/internals/index.html.md * Update website/source/docs/internals/index.html.md 2019-06-24 21:19:12 +00:00			`-> Before describing the architecture, we recommend reading the`
fix glossary link (#6043) 2019-06-28 16:04:09 +00:00			`[glossary](/docs/glossary.html) of terms to help`
[docs] New Glossary Page (#5999) * Moved the glossary to a new page and removed the advanced warnings from all internals docs. * Update website/source/layouts/docs.erb Co-Authored-By: Judith Malnick <judith@hashicorp.com> * Updates based on PR feedback * Update website/source/docs/internals/index.html.md * Update website/source/docs/internals/index.html.md * Update website/source/docs/internals/index.html.md * Update website/source/docs/internals/index.html.md * Update website/source/docs/internals/index.html.md 2019-06-24 21:19:12 +00:00			`clarify what is being discussed.`
website: document the high level architecture 2014-02-20 00:58:15 +00:00

			`## 10,000 foot view`

			`From a 10,000 foot altitude the architecture of Consul looks like this:`

Pre-process architecture docs as ERB for image tags 2014-10-06 23:12:17 +00:00			`<div class="center">`
Update arch diagram 2016-08-02 07:43:06 +00:00			`[![Consul Architecture](/assets/images/consul-arch.png)](/assets/images/consul-arch.png)`
Pre-process architecture docs as ERB for image tags 2014-10-06 23:12:17 +00:00			`</div>`
website: document the high level architecture 2014-02-20 00:58:15 +00:00
Website: tweaks to docs/internals/architecture.html. 2015-03-30 22:07:58 +00:00			`Let's break down this image and describe each piece. First of all, we can see`
			`that there are two datacenters, labeled "one" and "two". Consul has first`
[docs] Updating links to guides (#5795) * fixing links in the docs post guide migartion. * fixed one more * Update website/source/docs/acl/acl-legacy.html.md Co-Authored-By: kaitlincarter-hc <43049322+kaitlincarter-hc@users.noreply.github.com> * Update website/source/docs/enterprise/connect-multi-datacenter/index.html.md * Updating based on comments and fixing word wrap * Update website/source/api/acl-legacy.html.md * Update website/source/api/acl/acl.html.md * Update website/source/docs/agent/options.html.md * Update website/source/docs/faq.html.md * Update website/source/docs/internals/architecture.html.md * Update website/source/docs/agent/encryption.html.md 2019-05-15 15:49:41 +00:00			`class support for [multiple datacenters](https://learn.hashicorp.com/consul/security-networking/datacenters) and`
Website: tweaks to docs/internals/architecture.html. 2015-03-30 22:07:58 +00:00			`expects this to be the common case.`
website: document the high level architecture 2014-02-20 00:58:15 +00:00
Website: tweaks to docs/internals/architecture.html. 2015-03-30 22:07:58 +00:00			`Within each datacenter, we have a mixture of clients and servers. It is expected`
website: fix a couple of typos. 2014-05-03 22:23:16 +00:00			`that there be between three to five servers. This strikes a balance between`
website: document the high level architecture 2014-02-20 00:58:15 +00:00			`availability in the case of failure and performance, as consensus gets progressively`
			`slower as more machines are added. However, there is no limit to the number of clients,`
			`and they can easily scale into the thousands or tens of thousands.`

[docs] Architecture Node vs Agent (#6010) * Upating the term node to be more clear * Update website/source/docs/internals/architecture.html.md * Update website/source/docs/internals/architecture.html.md Co-Authored-By: Paul Banks <banks@banksco.de> * Addressing the failure detection comment 2019-06-24 17:25:47 +00:00			`All the agents that are in a datacenter participate in a [gossip protocol](/docs/internals/gossip.html).`
			`This means there is a gossip pool that contains all the agents for a given datacenter. This serves`
docs: internals/architecture: minor fixes 2014-11-26 12:31:38 +00:00			`a few purposes: first, there is no need to configure clients with the addresses of servers;`
[docs] Architecture Node vs Agent (#6010) * Upating the term node to be more clear * Update website/source/docs/internals/architecture.html.md * Update website/source/docs/internals/architecture.html.md Co-Authored-By: Paul Banks <banks@banksco.de> * Addressing the failure detection comment 2019-06-24 17:25:47 +00:00			`discovery is done automatically. Second, the work of detecting agent failures`
docs: internals/architecture: minor fixes 2014-11-26 12:31:38 +00:00			`is not placed on the servers but is distributed. This makes failure detection much more`
[docs] Architecture Node vs Agent (#6010) * Upating the term node to be more clear * Update website/source/docs/internals/architecture.html.md * Update website/source/docs/internals/architecture.html.md Co-Authored-By: Paul Banks <banks@banksco.de> * Addressing the failure detection comment 2019-06-24 17:25:47 +00:00			`scalable than naive heartbeating schemes. It also provides failure detection for the nodes; if the agent is not reachable, than the node may have experienced a failure. Thirdly, it is used as a messaging layer to notify`
website: document the high level architecture 2014-02-20 00:58:15 +00:00			`when important events such as leader election take place.`

			`The servers in each datacenter are all part of a single Raft peer set. This means that`
Website: tweaks to docs/internals/architecture.html. 2015-03-30 22:07:58 +00:00			`they work together to elect a single leader, a selected server which has extra duties. The leader`
			`is responsible for processing all queries and transactions. Transactions must also be replicated to`
			`all peers as part of the [consensus protocol](/docs/internals/consensus.html). Because of this`
			`requirement, when a non-leader server receives an RPC request, it forwards it to the cluster leader.`
website: document the high level architecture 2014-02-20 00:58:15 +00:00
[docs] Architecture Node vs Agent (#6010) * Upating the term node to be more clear * Update website/source/docs/internals/architecture.html.md * Update website/source/docs/internals/architecture.html.md Co-Authored-By: Paul Banks <banks@banksco.de> * Addressing the failure detection comment 2019-06-24 17:25:47 +00:00			`The server agents also operate as part of a WAN gossip pool. This pool is different from the LAN pool`
Website: tweaks to docs/internals/architecture.html. 2015-03-30 22:07:58 +00:00			`as it is optimized for the higher latency of the internet and is expected to contain only`
[docs] Architecture Node vs Agent (#6010) * Upating the term node to be more clear * Update website/source/docs/internals/architecture.html.md * Update website/source/docs/internals/architecture.html.md Co-Authored-By: Paul Banks <banks@banksco.de> * Addressing the failure detection comment 2019-06-24 17:25:47 +00:00			`other Consul server agents. The purpose of this pool is to allow datacenters to discover each`
Website: tweaks to docs/internals/architecture.html. 2015-03-30 22:07:58 +00:00			`other in a low-touch manner. Bringing a new datacenter online is as easy as joining the existing`
Fix some small doc errors 2018-01-04 21:44:07 +00:00			`WAN gossip pool. Because the servers are all operating in this pool, it also enables cross-datacenter`
Website: tweaks to docs/internals/architecture.html. 2015-03-30 22:07:58 +00:00			`requests. When a server receives a request for a different datacenter, it forwards it to a random`
			`server in the correct datacenter. That server may then forward to the local leader.`
website: document the high level architecture 2014-02-20 00:58:15 +00:00
website: fixing typo 2014-02-20 01:05:57 +00:00			`This results in a very low coupling between datacenters, but because of failure detection,`
website: Documentation cleanup 2014-04-09 18:06:27 +00:00			`connection caching and multiplexing, cross-datacenter requests are relatively fast and reliable.`
website: document the high level architecture 2014-02-20 00:58:15 +00:00
Adds a note about not replicating data to FAQ and federation-related spots. 2017-08-04 23:14:39 +00:00			`In general, data is not replicated between different Consul datacenters. When a`
			`request is made for a resource in another datacenter, the local Consul servers forward`
			`an RPC request to the remote Consul servers for that resource and return the results.`
			`If the remote datacenter is not available, then those resources will also not be`
			`available, but that won't otherwise affect the local datacenter. There are some special`
			`situations where a limited subset of data can be replicated, such as with Consul's built-in`
[docs] Updating links to guides (#5795) * fixing links in the docs post guide migartion. * fixed one more * Update website/source/docs/acl/acl-legacy.html.md Co-Authored-By: kaitlincarter-hc <43049322+kaitlincarter-hc@users.noreply.github.com> * Update website/source/docs/enterprise/connect-multi-datacenter/index.html.md * Updating based on comments and fixing word wrap * Update website/source/api/acl-legacy.html.md * Update website/source/api/acl/acl.html.md * Update website/source/docs/agent/options.html.md * Update website/source/docs/faq.html.md * Update website/source/docs/internals/architecture.html.md * Update website/source/docs/agent/encryption.html.md 2019-05-15 15:49:41 +00:00			`[ACL replication](https://learn.hashicorp.com/consul/day-2-operations/acl-replication) capability, or`
Adds a note about not replicating data to FAQ and federation-related spots. 2017-08-04 23:14:39 +00:00			`external tools like [consul-replicate](https://github.com/hashicorp/consul-replicate).`

Support Agent Caching for Service Discovery Results (#4541) * Add cache types for catalog/services and health/services and basic test that caching works * Support non-blocking cache types with Cache-Control semantics. * Update API docs to include caching info for every endpoint. * Comment updates per PR feedback. * Add note on caching to the 10,000 foot view on the architecture page to make the new data path more clear. * Document prepared query staleness quirk and force all background requests to AllowStale so we can spread service discovery load across servers. 2018-09-06 10:34:28 +00:00			`In some places, client agents may cache data from the servers to make it`
			`available locally for performance and reliability. Examples include Connect`
			`certificates and intentions which allow the client agent to make local decisions`
			`about inbound connection requests without a round trip to the servers. Some API`
			`endpoints also support optional result caching. This helps reliability because`
			`the local agent can continue to respond to some queries like service-discovery`
			`or Connect authorization from cache even if the connection to the servers is`
			`disrupted or the servers are temporarily unavailable.`

website: document the high level architecture 2014-02-20 00:58:15 +00:00			`## Getting in depth`

docs: internals/architecture: minor fixes 2014-11-26 12:31:38 +00:00			`At this point we've covered the high level architecture of Consul, but there are many`
Website: tweaks to docs/internals/architecture.html. 2015-03-30 22:07:58 +00:00			`more details for each of the subsystems. The [consensus protocol](/docs/internals/consensus.html) is`
			`documented in detail as is the [gossip protocol](/docs/internals/gossip.html). The [documentation](/docs/internals/security.html)`
Round 2: Fix typos, grammar errors, and misspellings 2014-04-16 04:01:12 +00:00			`for the security model and protocols used are also available.`
website: document the high level architecture 2014-02-20 00:58:15 +00:00
Website: tweaks to docs/internals/architecture.html. 2015-03-30 22:07:58 +00:00			`For other details, either consult the code, ask in IRC, or reach out to the mailing list.`