mirror of https://github.com/k3s-io/k3s
76 lines
3.0 KiB
Go
76 lines
3.0 KiB
Go
|
// Copyright 2015 The etcd Authors
|
||
|
//
|
||
|
// Licensed under the Apache License, Version 2.0 (the "License");
|
||
|
// you may not use this file except in compliance with the License.
|
||
|
// You may obtain a copy of the License at
|
||
|
//
|
||
|
// http://www.apache.org/licenses/LICENSE-2.0
|
||
|
//
|
||
|
// Unless required by applicable law or agreed to in writing, software
|
||
|
// distributed under the License is distributed on an "AS IS" BASIS,
|
||
|
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
|
||
|
// See the License for the specific language governing permissions and
|
||
|
// limitations under the License.
|
||
|
|
||
|
/*
|
||
|
Package wal provides an implementation of a write ahead log that is used by
|
||
|
etcd.
|
||
|
|
||
|
A WAL is created at a particular directory and is made up of a number of
|
||
|
segmented WAL files. Inside of each file the raft state and entries are appended
|
||
|
to it with the Save method:
|
||
|
|
||
|
metadata := []byte{}
|
||
|
w, err := wal.Create(zap.NewExample(), "/var/lib/etcd", metadata)
|
||
|
...
|
||
|
err := w.Save(s, ents)
|
||
|
|
||
|
After saving a raft snapshot to disk, SaveSnapshot method should be called to
|
||
|
record it. So WAL can match with the saved snapshot when restarting.
|
||
|
|
||
|
err := w.SaveSnapshot(walpb.Snapshot{Index: 10, Term: 2})
|
||
|
|
||
|
When a user has finished using a WAL it must be closed:
|
||
|
|
||
|
w.Close()
|
||
|
|
||
|
Each WAL file is a stream of WAL records. A WAL record is a length field and a wal record
|
||
|
protobuf. The record protobuf contains a CRC, a type, and a data payload. The length field is a
|
||
|
64-bit packed structure holding the length of the remaining logical record data in its lower
|
||
|
56 bits and its physical padding in the first three bits of the most significant byte. Each
|
||
|
record is 8-byte aligned so that the length field is never torn. The CRC contains the CRC32
|
||
|
value of all record protobufs preceding the current record.
|
||
|
|
||
|
WAL files are placed inside of the directory in the following format:
|
||
|
$seq-$index.wal
|
||
|
|
||
|
The first WAL file to be created will be 0000000000000000-0000000000000000.wal
|
||
|
indicating an initial sequence of 0 and an initial raft index of 0. The first
|
||
|
entry written to WAL MUST have raft index 0.
|
||
|
|
||
|
WAL will cut its current tail wal file if its size exceeds 64MB. This will increment an internal
|
||
|
sequence number and cause a new file to be created. If the last raft index saved
|
||
|
was 0x20 and this is the first time cut has been called on this WAL then the sequence will
|
||
|
increment from 0x0 to 0x1. The new file will be: 0000000000000001-0000000000000021.wal.
|
||
|
If a second cut issues 0x10 entries with incremental index later then the file will be called:
|
||
|
0000000000000002-0000000000000031.wal.
|
||
|
|
||
|
At a later time a WAL can be opened at a particular snapshot. If there is no
|
||
|
snapshot, an empty snapshot should be passed in.
|
||
|
|
||
|
w, err := wal.Open("/var/lib/etcd", walpb.Snapshot{Index: 10, Term: 2})
|
||
|
...
|
||
|
|
||
|
The snapshot must have been written to the WAL.
|
||
|
|
||
|
Additional items cannot be Saved to this WAL until all of the items from the given
|
||
|
snapshot to the end of the WAL are read first:
|
||
|
|
||
|
metadata, state, ents, err := w.ReadAll()
|
||
|
|
||
|
This will give you the metadata, the last raft.State and the slice of
|
||
|
raft.Entry items in the log.
|
||
|
|
||
|
*/
|
||
|
package wal
|