Add sanity check in GC to prevent active ledger deletion #15

rdhabalia · 2018-01-10T19:36:20Z

Motivation

Sometimes, due to unexpected bugs in GC, GC deletes active ledgers (ledger metadata still exist in ZK) from the bookie and it results into the data-loss.

We have seen this scenario multiple times:
(1) Lexicographical sorting at ledgerManager
(2) Recently we have seen that LongHierarchicalLedgerManager doesn't handle ledger directory traversing if we manually cleans up empty directory which doesn't contain any children.
eg:
We implemented a cron job to delete ledger directory from zookeeper, when it does not contain any children in zookeeper. e.g: /ledgers/000/0000/0042/0003
In the above sequence, after processing 0001 and 0002 nodes, when LongHierarchicalLedgerManager iterator sees the next node(0003) does not exist, it assumes no other nodes starting from 0003 till 9999 exists under /ledgers/000/0000/0042. This triggers bookie to delete any ledgers from 0004 onwards present in that bookie.

So, we have seen data loss multiple times which can be prevented if GC can perform a sanity check before cleaning ledger from the bookie.

Modifications

Add sanity check before cleaning up the ledger.

Result

It can prevent possible data loss due to any bug in GC.

eolivelli · 2018-01-10T19:51:03Z

bookkeeper-server/src/main/java/org/apache/bookkeeper/meta/AbstractZkLedgerManager.java

@@ -350,6 +348,23 @@ public void readLedgerMetadata(final long ledgerId, final GenericCallback<Ledger
        readLedgerMetadata(ledgerId, readCb, null);
    }

+    @Override
+    public void existLedgerMetadata(final long ledgerId, final GenericCallback<Boolean> callback) {


Typo, 'exists' instead of 'exit'?

revans2

Overall this looks good. I am a little confused about one of the design choices. I would also like to see a corresponding pull request go into open source before I merge it in here.

revans2 · 2018-01-10T21:30:26Z

...keeper-server/src/main/java/org/apache/bookkeeper/bookie/ScanAndCompareGarbageCollector.java

+                LOG.warn("Fail to check {} exists in zk {}", ledgerId, BKException.getMessage(rc));
+            }
+            latch.countDown();
+            semaphore.release();


I don't think we need a semaphore at all. Except during unit tests the gc method is called by scheduleAtFixedRate, which guarantees that later calls to gc will not happen until the previous ones finish.

if garbageCleaner.clean was inside the callback then the semaphore would make since, but with the CountDownLatch this is a blocking call, so it really doesn't.

yes, i missed to remove it after adding latch. will fix it.

rdhabalia · 2018-01-10T21:57:01Z

+@merlimat @sijie

sijie · 2018-01-11T01:55:44Z

I think this is similar as the change Salesforce made in apache/bookkeeper@1f8b26d fyi

saandrews · 2018-01-11T17:35:33Z

@sijie That change still does not address cases where due to some other bug a valid ledger is deleted.

saandrews

Looks good

saandrews · 2018-01-11T17:43:15Z

...keeper-server/src/main/java/org/apache/bookkeeper/bookie/ScanAndCompareGarbageCollector.java

+    private void gcLedgerSafely(GarbageCleaner garbageCleaner, long ledgerId, LedgerManager ledgerManager) throws InterruptedException {
+        CountDownLatch latch = new CountDownLatch(1);
+        AtomicBoolean ledgerDeleted = new AtomicBoolean(false);
+        ledgerManager.existsLedgerMetadata(ledgerId, (rc, exists) -> {


Do we need the second param exists?

merlimat · 2018-01-11T17:56:37Z

@sijie That change still does not address cases where due to some other bug a valid ledger is deleted.

Actually, I think it's the same behavior, because it forces to read the ledger metadata, which means reading the znode

saandrews · 2018-01-11T18:08:18Z

I see. Did we merge that pull request? I see it got closed, so not sure if it got approved eventually.

sijie · 2018-01-11T18:31:57Z

apache#876 was approved and merged. the gitsha I pointed out is the change that already exists in latest apache/master. I think the change in apache and the change here are almost same, either merge this one to yahoo or cherry-pick the apache change works for me.

rdhabalia · 2018-01-11T18:49:53Z

thanks for pointing out the commit. I think checking just exists of znode might be more optimized approach than reading znode and transferring metadata over n/w for each ledger. But I am not sure if it would be a big deal. So, @revans2 should we cherry-pick apache/bookkeeper#876 commit to be in sync and we can close this PR?

revans2 · 2018-02-07T16:43:56Z

@rdhabalia could you take a look at #22 and see if it is good enough to replace this? It is a backport of the change that went into the apache repo that you suggested we look at instead.

Add sanity check in GC to prevent active ledger deletion

085ff76

eolivelli reviewed Jan 10, 2018

View reviewed changes

Fix typo for existLedgerMetadata(..)

037b9ba

revans2 reviewed Jan 10, 2018

View reviewed changes

remove semaphore

aaf386e

rdhabalia requested a review from merlimat January 10, 2018 22:09

saandrews approved these changes Jan 11, 2018

View reviewed changes

rdhabalia pushed a commit to rdhabalia/bookkeeper that referenced this pull request Sep 25, 2018

Improve the client api to a type-safe api (YahooArchive#15)

ef6f871

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sanity check in GC to prevent active ledger deletion #15

Add sanity check in GC to prevent active ledger deletion #15

rdhabalia commented Jan 10, 2018

eolivelli Jan 10, 2018

revans2 left a comment

revans2 Jan 10, 2018

rdhabalia Jan 10, 2018

rdhabalia commented Jan 10, 2018

sijie commented Jan 11, 2018

saandrews commented Jan 11, 2018

saandrews left a comment

saandrews Jan 11, 2018

merlimat commented Jan 11, 2018

saandrews commented Jan 11, 2018

sijie commented Jan 11, 2018

rdhabalia commented Jan 11, 2018

revans2 commented Feb 7, 2018

Add sanity check in GC to prevent active ledger deletion #15

Are you sure you want to change the base?

Add sanity check in GC to prevent active ledger deletion #15

Conversation

rdhabalia commented Jan 10, 2018

Motivation

Modifications

Result

eolivelli Jan 10, 2018

Choose a reason for hiding this comment

revans2 left a comment

Choose a reason for hiding this comment

revans2 Jan 10, 2018

Choose a reason for hiding this comment

rdhabalia Jan 10, 2018

Choose a reason for hiding this comment

rdhabalia commented Jan 10, 2018

sijie commented Jan 11, 2018

saandrews commented Jan 11, 2018

saandrews left a comment

Choose a reason for hiding this comment

saandrews Jan 11, 2018

Choose a reason for hiding this comment

merlimat commented Jan 11, 2018

saandrews commented Jan 11, 2018

sijie commented Jan 11, 2018

rdhabalia commented Jan 11, 2018

revans2 commented Feb 7, 2018