[fluentd-elasticsearch] Multi Process Workers #56

kfirfer · 2021-01-18T10:43:07Z

What do you think about Multi Process Workers in fluentd ?
https://docs.fluentd.org/deployment/multi-process-workers

nvtkaszpir · 2021-02-12T23:03:17Z

this is super problematic in general:

each worker spawns separate jobs and thus separate workdkir (buffers)
changing workers from 1 to more tends to generate subdirectories per worker (and thus worker=1 is not the same pattern as wotker=2 or more wokrers)
above means that if you change workers from 1 to more may cause data loss from old worker
scaling down workers from X to X-1 (where X>2) may cause data loss due to the fact that the buffers left from other workers may be never processed
scaling down workers from X (wher X>=2) to 1 may cause data loss because directory structucre is again changed (as in second point from the top)

What it means:

if you have a worker=1 now (default) and if you want to increase multi-worker then it is safer to create new deployment/statefulset with mutliple workers
scaling up workers is pretty safe (say from 2 to 4)
scaling down workers may lead to data loss - if you have workers=4 and want to switch to workers=3, then you may have orphaned files in hte buffer left by the last worker, and you need to handle it on our own.
above means - don't change workers or spawn new deployment with new worker count and deregister old deployment from the loadbalancers/services so that they gradually drain the buffers to avoid data loss, after that you can remoe them

So if you want to use multiple workers, you can do it, but stick to it really hard from the start and remember about limitations. It' way easier to spawn new pods in geneeeral instead of spawning more workers in pod. Yet you may need to tune worker threads per node size, so it's worth to count nproc or something like that when starting daemonset on the host.

kfirfer added the enhancement New feature or request label Jan 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fluentd-elasticsearch] Multi Process Workers #56

[fluentd-elasticsearch] Multi Process Workers #56

kfirfer commented Jan 18, 2021

nvtkaszpir commented Feb 12, 2021 •

edited

Loading

[fluentd-elasticsearch] Multi Process Workers #56

[fluentd-elasticsearch] Multi Process Workers #56

Comments

kfirfer commented Jan 18, 2021

nvtkaszpir commented Feb 12, 2021 • edited Loading

nvtkaszpir commented Feb 12, 2021 •

edited

Loading