spellcheck/style leakybucket readme (#2294)

This commit is contained in:
mmetc 2023-06-21 11:47:07 +02:00 committed by GitHub
parent 4c4b545e9b
commit da6106bd23
No known key found for this signature in database
GPG key ID: 4AEE18F83AFDEB23

View file

@ -2,47 +2,49 @@
## Bucket concepts ## Bucket concepts
Leakybucket is used for decision making. Under certain conditions The Leakybucket is used for decision making. Under certain conditions,
enriched events are poured in these buckets. When these buckets are enriched events are poured into these buckets. When these buckets are
full, we raise a new event. After this event is raised the bucket is full, we raise a new event. After this event is raised the bucket is
destroyed. There are many types of buckets, and we welcome any new destroyed. There are many types of buckets, and we welcome any new
useful design of buckets. useful design of buckets.
Usually the bucket configuration generates the creation of many Usually, the bucket configuration generates the creation of many
buckets. They are differenciated by a field called stackkey. When two buckets. They are differentiated by a field called stackkey. When two
events arrives with the same stackkey they go in the same matching events arrive with the same stackkey they go in the same matching
bucket. bucket.
The very purpose of these buckets is to detect clients that exceed a The very purpose of these buckets is to detect clients that exceed a
certain rate of attemps to do something (ssh connection, http certain rate of attempts to do something (ssh connection, http
authentication failure, etc...). Thus, the most use stackkey field is authentication failure, etc...). Thus, the most used stackkey field is
often the source_ip. often the source_ip.
## Standard leaky buckets ## Standard leaky buckets
Default buckets have two main configuration options: Default buckets have two main configuration options:
* capacity: number of events the bucket can hold. When the capacity * capacity: number of events the bucket can hold. When the capacity
is reached and a new event is poured, a new event is raised. We is reached and a new event is poured, a new event is raised. We
call this type of event overflow. This is an int. call this type of event overflow. This is an int.
* leakspeed: duration needed for an event to leak. When an event * leakspeed: duration needed for an event to leak. When an event
leaks, it disappear from the bucket. leaks, it disappears from the bucket.
## Trigger ## Trigger
It's a special type of bucket with a zero capacity. Thus, when an A Trigger is a special type of bucket with a capacity of zero. Thus, when an
event is poured in a trigger, it always raises an overflow. event is poured into a trigger, it always raises an overflow.
## Uniq ## Uniq
It's a bucket working as the standard leaky bucket except for one A Uniq is a bucket working like the standard leaky bucket except for one
thing: a filter returns a property for each event and only one thing: a filter returns a property for each event and only one
occurrence of this property is allowed in the bucket, thus the bucket occurrence of this property is allowed in the bucket, thus the bucket
is called uniq. is called uniq.
## Counter ## Counter
It's a special type of bucket with an infinite capacity and an A Counter is a special type of bucket with an infinite capacity and an
infinite leakspeed (it never overflows, neither leaks). Nevertheless, infinite leakspeed (it never overflows, nor leaks). Nevertheless,
the event is raised after a fixed duration. The option is called the event is raised after a fixed duration. The option is called
duration. duration.
@ -52,36 +54,41 @@ duration.
* type: mandatory field. Must be one of "leaky", "trigger", "uniq" or * type: mandatory field. Must be one of "leaky", "trigger", "uniq" or
"counter" "counter"
* name: mandatory field, but the value is totally open. Nevertheless
* name: mandatory field, but the value is totally open. Nevertheless,
this value will tag the events raised by the bucket. this value will tag the events raised by the bucket.
* filter: mandatory field. It's a filter that is run when the decision
to make an event match the bucket or not. The filter have to return * filter: mandatory field. It's a filter that is run to decide whether
an event matches the bucket or not. The filter has to return
a boolean. As a filter implementation we use a boolean. As a filter implementation we use
https://github.com/antonmedv/expr https://github.com/antonmedv/expr
* capacity: [mandatory for now, shouldn't be mandatory in the final * capacity: [mandatory for now, shouldn't be mandatory in the final
version] it's the size of the bucket. When pouring in a bucket version] it's the size of the bucket. When pouring in a bucket
already with size events, it overflows. already with size events, it overflows.
* leakspeed: leakspeed is a time duration (has to be parseable by
https://golang.org/pkg/time/#ParseDuration). After each interval an * leakspeed: leakspeed is a time duration (it has to be parsed by
https://golang.org/pkg/time/#ParseDuration). After each interval, an
event is leaked from the bucket. event is leaked from the bucket.
* stackkey: mandatory field. This field is used to differentiate on * stackkey: mandatory field. This field is used to differentiate on
which bucket ongoing events will be poured. When an unknown stackkey which instance of the bucket the matching events will be poured.
is seen in an event a new bucket is created. When an unknown stackkey is seen in an event, a new bucket is created.
* on_overflow: optional field, that tells the what to do when the
bucket is returning the overflow event. As of today, the possibility * on_overflow: optional field, that tells what to do when the
are these: "ban,1h", "Reprocess", "Delete". bucket is returning the overflow event. As of today, the possibilities
Reprocess is used to send the raised event back in the event pool to are "ban,1h", "Reprocess" or "Delete".
be matched agains buckets Reprocess is used to send the raised event back to the event pool to
be matched against buckets
### Fields for special buckets ### Fields for special buckets
#### Uniq #### Uniq
Uniq has an extra field uniq_filter which is too use the filter * uniq_filter: an expression that must comply with the syntax defined
implementation from https://github.com/antonmedv/expr. The filter must in https://github.com/antonmedv/expr and must return a string.
return a string. All strins returned by this filter in the same All strings returned by this filter in the same buckets have to be different.
buckets have to be different. Thus, if a string is seen twice it is Thus if a string is seen twice, the event is dismissed.
dismissed.
#### Trigger #### Trigger
@ -89,11 +96,11 @@ Capacity and leakspeed are not relevant for this kind of bucket.
#### Counter #### Counter
It's a special kind of bucket that raise an event and is destroyed * duration: the Counter will be destroyed after this interval
after a fixed duration. The configuration field used is duration and has elapsed since its creation. The duration must be parsed
must be parseable by https://golang.org/pkg/time/#ParseDuration. by https://golang.org/pkg/time/#ParseDuration.
Nevertheless, this kind of bucket is often used with an infinite Nevertheless, this kind of bucket is often used with an infinite
leakspeed and an infinite capacity [capacity set to -1 for now]. leakspeed and an infinite capacity [capacity set to -1 for now].
## Add examples here ## Add examples here
@ -126,17 +133,17 @@ leakspeed and an infinite capacity [capacity set to -1 for now].
[This is not dry enough to have many details here, but:] [This is not dry enough to have many details here, but:]
The bucket code is triggered by `InfiniBucketify` in main.go. The bucket code is triggered by `InfiniBucketify` in `main.go`.
There's one struct called buckets which is for now a There is one struct called buckets which is for now a
`map[string]interface{}` that holds all buckets. The key of this map `map[string]interface{}` that holds all buckets. The key of this map
is derivated from the filter configured for the bucket and its is derived from the filter configured for the bucket and its
stackkey. This looks like complicated, but in fact it allows us to use stackkey. This looks complicated, but it allows us to use
only one structs. This is done in buckets.go. only one struct. This is done in buckets.go.
On top of that the implementation define only the standard leaky On top of that the implementation defines only the standard leaky
bucket. A goroutine is launched for every buckets (bucket.go). This bucket. A goroutine is launched for every bucket (`bucket.go`). This
goroutine manages the life of the bucket. goroutine manages the life of the bucket.
For special buckets, hooks are defined at initialization time in For special buckets, hooks are defined at initialization time in
manager.go. Hooks are called when relevant by the bucket gorourine manager.go. Hooks are called when relevant by the bucket goroutine
when events are poured and/or when bucket overflows. when events are poured and/or when a bucket overflows.