-
Notifications
You must be signed in to change notification settings - Fork 83
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Missing data #57
Comments
Hello, Someone has an idea ? |
I check it, i think is another problem. I continue to loose some messages. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hello,
i use this rabbitmq receivers but i have some problems. I used a java project ( spring-amqp ) for publish somes Avro messages in RabbitMq and consume it with scala-spark project.
Let me take an example with simple 'String' message but in my real case i publish avro messages and decode it and save it on HDFS in Parquet Format with PairRddFunction and i lose ~1% of my data .
and i use distributed receivers to read it transform it and write it on Parquet format in HDFS.
If you execute this code, sometime you will see the log "Missing Data" without any reason or warn or something like that (cf misssing-data.png 67156 instead of 67157).
For me totalEvents it's ok but totalEventsPair miss some data
I'm looking for this problems during 3 days but i didn't find any solutions.
For your information, i try to activate Write Ahead Logs, Checkpointing, Back-pressure ...
I use
Tx for your work 👍
The text was updated successfully, but these errors were encountered: