engine:enhanced byte order handling for timestamps #9196

mirko-lazarevic · 2024-08-13T08:09:00Z

This change ensures correct byte order conversions for timestamp fields within log event decoder and encoder.

Added FLB_TO_NATIVE_UINT32 in flb_endian.h. This function checks the host machine's byte order and applies the necessary conversion for 32-bit unsigned integers.

Enter [N/A] in the box, if an item is not applicable to your change.

Testing
Before we can approve your change; please submit the following in a comment:

Testing is performed on BigEndian machine using IBM Linux One

Example configuration file for the change

[SERVICE]
    Flush                   5
    Daemon                  Off
    Log_Level               debug
    Parsers_File            parsers.conf
    Plugins_File            plugins.conf
    HTTP_Server             On
    HTTP_Listen             0.0.0.0
    HTTP_Port               8084
 
[INPUT]
    Name   dummy
    Dummy {"message": "custom dummy"}

[OUTPUT]
    Name forward
    Match *
    Host 127.0.0.1
    Port 24284

Debug log output from testing the change

Using nc command we captured the output.

nc -4 -l -p 24284 > /tmp/capture.forward.dummy.fix.bin &

Using vim and %!xxd command we transform a file in Vim to hex representation.

Capture before fix:

We observe that the timestamp converts to a date in the past:

Capture after the fix:

We observe that the timestamp converts to the current date (the date when the test was performed):

[N/A] Attached Valgrind output that shows no leaks or memory corruption was found

If this is a change to packaging of containers or native binaries then please confirm it works for all targets.

[N/A] Run local packaging test showing all targets (including any new ones) build.
[N/A] Set ok-package-test label to test for all targets (requires maintainer to do).

Documentation

[N/A] Documentation required for this feature

Backporting

[N/A] Backport to latest stable release.

Fluent Bit is licensed under Apache 2.0, by submitting this pull request I understand that this code will be released under the terms of that license.

mirko-lazarevic · 2024-08-13T09:56:23Z

Hey @leonardo-albertovich @edsiper, this pr introduces a new fuction FLT_TO_NATIVE_UINT32 that dynamically determines the byte order of the host machine and apples the necessary byte swap operation for 32-bit unsigned integers. This approach accommodates both little-endian and big-endian architectures and should ensure consistent handling of data (timestamps) across platforms with different endianess.

My knowledge of the fluent-bit codebase is limited, therefore, I am uncertain about the function name and the location where I added the function, in this case, the flb_byteswap.h file. I'm open for suggestions. Thank you.

leonardo-albertovich · 2024-08-13T12:16:31Z

include/fluent-bit/flb_byteswap.h

@@ -102,4 +103,13 @@ static inline uint64_t FLB_BSWAP_64(uint64_t value)

 #endif

+static inline uint32_t FLB_TO_NATIVE_UINT32(uint32_t value)


Please use FLB_BYTE_ORDER and FLB_BIG_ENDIAN instead of these.

Please rename this to FLB_UINT32_TO_HOST_BYTE_ORDER so its intention is clear.

src/flb_log_event_encoder_primitives.c

This change ensures correct byte order conversions for timestamp fields within log event decoder and encoder. Added FLB_TO_NATIVE_UINT32 in flb_endian.h. This function checks the host machine's byte order and applies the necessary conversion for 32-bit unsigned integers. Co-authored-by: Bernhard Schmid <[email protected]> Signed-off-by: Mirko Lazarevic <[email protected]>

rightblank · 2024-08-16T07:42:27Z

Hi, @edsiper @leonardo-albertovich @cosmo0920 @pwhelan,
Would you please help to review this PR again? This is a crucial part for fluent bit to work on s390x.

rightblank · 2024-08-21T11:50:15Z

@rightblank I see the unit tests in the CI are failing, would you please check them ?

Hi, @edsiper @leonardo-albertovich @cosmo0920 @pwhelan @fujimotos, these cases failed because the timestamps are dealt with in big endian system style because of an issue in the flb_endian.h file

On linux x86 platform, __BIG_ENDIAN is defined in /usr/include/x86_64-linux-gnu/bits/endian.h,
```
#define __LITTLE_ENDIAN 1234
#define __BIG_ENDIAN    4321
#define __PDP_ENDIAN    3412
```

This makes the condition on line 60 of flb_endian.h become true and FLB_BYTE_ORDER is defined to FLB_BIG_ENDIAN on line 61.

60    #elif defined(__BIG_ENDIAN__) || defined(__BIG_ENDIAN) || defined(_BIG_ENDIAN)
61        #define FLB_BYTE_ORDER FLB_BIG_ENDIAN

leonardo-albertovich · 2024-08-21T12:42:29Z

@rightblank I see the unit tests in the CI are failing, would you please check them ?

Hi, @edsiper @leonardo-albertovich @cosmo0920 @pwhelan @fujimotos, these cases failed because the timestamps are dealt with in big endian system style because of an issue in the flb_endian.h file
On linux x86 platform, __BIG_ENDIAN is defined in /usr/include/x86_64-linux-gnu/bits/endian.h,
#define __LITTLE_ENDIAN 1234
#define __BIG_ENDIAN    4321
#define __PDP_ENDIAN    3412
This makes the condition on line 60 of flb_endian.h become true and FLB_BYTE_ORDER is defined to FLB_BIG_ENDIAN on line 61.
60    #elif defined(__BIG_ENDIAN__) || defined(__BIG_ENDIAN) || defined(_BIG_ENDIAN)
61        #define FLB_BYTE_ORDER FLB_BIG_ENDIAN

Good catch, do you want to open a PR to fix it or would you rather have me do it?

rightblank · 2024-08-21T13:03:02Z

Good catch, do you want to open a PR to fix it or would you rather have me do it?

@leonardo-albertovich, I have a local patch but not sure if it can work for *BSD, windows, and macOS

diff --git a/include/fluent-bit/flb_endian.h b/include/fluent-bit/flb_endian.h
index b376ea842..5a6f73a09 100644
--- a/include/fluent-bit/flb_endian.h
+++ b/include/fluent-bit/flb_endian.h
@@ -55,9 +55,9 @@
 #define FLB_BIG_ENDIAN    1
 
 #ifndef FLB_BYTE_ORDER
-    #if defined(__BYTE_ORDER__) &&  __BYTE_ORDER__ == __ORDER_BIG_ENDIAN__
-        #define FLB_BYTE_ORDER FLB_BIG_ENDIAN
-    #elif defined(__BIG_ENDIAN__) || defined(__BIG_ENDIAN) || defined(_BIG_ENDIAN)
+    #if (defined(__BYTE_ORDER__) && __BYTE_ORDER__ == __ORDER_BIG_ENDIAN__) || \
+          (defined(_BYTE_ORDER) && _BYTE_ORDER == _BIG_ENDIAN) || \
+          (defined(__BIG_ENDIAN__))
         #define FLB_BYTE_ORDER FLB_BIG_ENDIAN
     #else
         #define FLB_BYTE_ORDER FLB_LITTLE_ENDIAN

leonardo-albertovich · 2024-08-21T13:12:10Z

Yeah, that's the thing, since my initial approach was clearly flawed I think we should do a more thorough check for the second one.

Another option would be using a cmake test like this :

check_c_source_runs("
  int main() {
    volatile uint64_t source_value;
    volatile uint8_t *test_value;

    source_value = 1;
    test_value = (uint8_t *) &source_value; 

    return (int) test_value[0];
  }
  }" FLB_ENDIANNESS_TEST_RESULT)

if(!FLB_ENDIANNESS_TEST_RESULT)
  FLB_DEFINITION(FLB_HAVE_BIG_ENDIAN_SYSTEM)
endif()

The only upside of this would be simplicity and the pressumption that it should work accross systems (unless the compiler is seriously dodgy about optimizations).

Which approach do you think would be better?

Side note : the volatile qualifier is more or less abused to prevent overly eager compilers from erroneously optimizing the code.

rightblank · 2024-08-21T13:21:34Z

@leonardo-albertovich I'm good with your solution, it's more generic across all the platforms, please go ahead and open the PR! Thanks in advance!

Also I remembered that the cmake file of msgpack-c implemented something very similar to your solution.
https://github.com/fluent/fluent-bit/blob/master/lib/msgpack-c/CMakeLists.txt#L12-L30

leonardo-albertovich · 2024-08-21T13:59:34Z

Well, given that it's already implemented I don't see a need to add a different version so I'll just copy and adapt that snippet in a PR.

I'll send an update as soon as it's up.

leonardo-albertovich · 2024-08-21T14:18:35Z

PR #9256 up, would you mind taking a look at it @rightblank?

rightblank · 2024-08-23T06:37:01Z

Hi, @cosmo0920, would you please help to trigger the CI for this PR again?

rightblank · 2024-08-25T14:53:10Z

@rightblank I see the unit tests in the CI are failing, would you please check them ?

Hi, @edsiper, the CI looks good now, would you please help to merge this PR?

…r timestamps (fluent#9196) This change ensures correct byte order conversions for timestamp fields within log event decoder and encoder. Added FLB_TO_NATIVE_UINT32 in flb_endian.h. This function checks the host machine's byte order and applies the necessary conversion for 32-bit unsigned integers. Signed-off-by: Mirko Lazarevic <[email protected]> Co-authored-by: Bernhard Schmid <[email protected]>

mirko-lazarevic requested review from edsiper, leonardo-albertovich, fujimotos and koleini as code owners August 13, 2024 08:09

github-actions bot added the docs-required label Aug 13, 2024

mirko-lazarevic force-pushed the timestamp-encode-decode-big-endian branch from 7c2ffce to 38264c0 Compare August 13, 2024 08:56

mirko-lazarevic temporarily deployed to pr August 13, 2024 09:42 — with GitHub Actions Inactive

mirko-lazarevic temporarily deployed to pr August 13, 2024 10:02 — with GitHub Actions Inactive

mirko-lazarevic temporarily deployed to pr August 13, 2024 10:03 — with GitHub Actions Inactive

leonardo-albertovich requested changes Aug 13, 2024

View reviewed changes

mirko-lazarevic force-pushed the timestamp-encode-decode-big-endian branch from 38264c0 to 698f607 Compare August 13, 2024 15:30

mirko-lazarevic requested a review from leonardo-albertovich August 13, 2024 15:49

mirko-lazarevic force-pushed the timestamp-encode-decode-big-endian branch from 698f607 to 87be8b9 Compare August 13, 2024 16:39

leonardo-albertovich requested changes Aug 13, 2024

View reviewed changes

src/flb_log_event_encoder_primitives.c Outdated Show resolved Hide resolved

rightblank mentioned this pull request Aug 14, 2024

core: fix configuration type cast issue on big endian systems #8904

Merged

6 tasks

mirko-lazarevic force-pushed the timestamp-encode-decode-big-endian branch from 87be8b9 to efdf65b Compare August 14, 2024 09:09

mirko-lazarevic requested a review from leonardo-albertovich August 15, 2024 12:07

leonardo-albertovich approved these changes Aug 19, 2024

View reviewed changes

mirko-lazarevic temporarily deployed to pr August 19, 2024 11:09 — with GitHub Actions Inactive

mirko-lazarevic temporarily deployed to pr August 19, 2024 11:31 — with GitHub Actions Inactive

mirko-lazarevic temporarily deployed to pr August 19, 2024 11:32 — with GitHub Actions Inactive

Merge branch 'fluent:master' into timestamp-encode-decode-big-endian

442a4ab

mirko-lazarevic temporarily deployed to pr August 20, 2024 12:20 — with GitHub Actions Inactive

mirko-lazarevic temporarily deployed to pr August 20, 2024 12:42 — with GitHub Actions Inactive

rightblank mentioned this pull request Aug 22, 2024

core: endianness detection fix #9256

Merged

edsiper modified the milestones: Fluent Bit v3.2.0, Fluent Bit v3.1.7 Aug 22, 2024

Merge branch 'fluent:master' into timestamp-encode-decode-big-endian

2e7c59e

mirko-lazarevic temporarily deployed to pr August 23, 2024 11:55 — with GitHub Actions Inactive

mirko-lazarevic temporarily deployed to pr August 23, 2024 12:16 — with GitHub Actions Inactive

edsiper merged commit 8e5c213 into fluent:master Aug 27, 2024
43 checks passed

mirko-lazarevic deleted the timestamp-encode-decode-big-endian branch August 29, 2024 18:16

BrewTestBot mentioned this pull request Sep 2, 2024

fluent-bit 3.1.7 Homebrew/homebrew-core#183215

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

engine:enhanced byte order handling for timestamps #9196

engine:enhanced byte order handling for timestamps #9196

mirko-lazarevic commented Aug 13, 2024 •

edited

Loading

mirko-lazarevic commented Aug 13, 2024

leonardo-albertovich Aug 13, 2024

leonardo-albertovich Aug 13, 2024

mirko-lazarevic Aug 13, 2024

rightblank commented Aug 16, 2024

rightblank commented Aug 21, 2024 •

edited

Loading

leonardo-albertovich commented Aug 21, 2024

rightblank commented Aug 21, 2024

leonardo-albertovich commented Aug 21, 2024 •

edited

Loading

rightblank commented Aug 21, 2024 •

edited

Loading

leonardo-albertovich commented Aug 21, 2024

leonardo-albertovich commented Aug 21, 2024

rightblank commented Aug 23, 2024

rightblank commented Aug 25, 2024

		@@ -102,4 +103,13 @@ static inline uint64_t FLB_BSWAP_64(uint64_t value)

		#endif

		static inline uint32_t FLB_TO_NATIVE_UINT32(uint32_t value)

engine:enhanced byte order handling for timestamps #9196

engine:enhanced byte order handling for timestamps #9196

Conversation

mirko-lazarevic commented Aug 13, 2024 • edited Loading

mirko-lazarevic commented Aug 13, 2024

leonardo-albertovich Aug 13, 2024

Choose a reason for hiding this comment

leonardo-albertovich Aug 13, 2024

Choose a reason for hiding this comment

mirko-lazarevic Aug 13, 2024

Choose a reason for hiding this comment

rightblank commented Aug 16, 2024

rightblank commented Aug 21, 2024 • edited Loading

leonardo-albertovich commented Aug 21, 2024

rightblank commented Aug 21, 2024

leonardo-albertovich commented Aug 21, 2024 • edited Loading

rightblank commented Aug 21, 2024 • edited Loading

leonardo-albertovich commented Aug 21, 2024

leonardo-albertovich commented Aug 21, 2024

rightblank commented Aug 23, 2024

rightblank commented Aug 25, 2024

mirko-lazarevic commented Aug 13, 2024 •

edited

Loading

rightblank commented Aug 21, 2024 •

edited

Loading

leonardo-albertovich commented Aug 21, 2024 •

edited

Loading

rightblank commented Aug 21, 2024 •

edited

Loading