Add snapshots for model summaries #106

jneeven · 2020-02-21T13:02:20Z

TODO:

Try this directly from zookeeper without subprocess

tests/models_test.py

jneeven · 2020-02-21T13:58:06Z

Rip in TFDS, I'm gonna try to get that generic dummy data up and running

jneeven · 2020-02-27T11:13:23Z

@AdamHillier I know this is my PR, but could you update this to use #119 ? I can also do it myself, probably just not today

leonoverweel · 2020-02-27T11:23:30Z

I don't think I like having the model summary strings in tests/snapshots/snap_models_test.py. If we put them all in separate .txt files (named by their models), it'll be easier to track model changes over time and find/link to individual model summaries.

jneeven · 2020-02-27T11:25:06Z

I don't think I like having the model summary strings in tests/snapshots/snap_models_test.py. If we put them all in separate .txt files (named by their models), it'll be easier to track model changes over time and find/link to individual model summaries.

This is not a choice I've made; it's just how snapshots work. I'm not sure if what you suggest is possible without creating an explicit python test file for every model separately...

Edit: obviously it is possible, but then we'd need to not use snapshot and manually save, load and compare them. Sounds like a lot of hassle to me but could be okay. I also don't like the way snapshot works; if a comparison is wrong it will just print the entire string in red without much of an indication of what's wrong.

leonoverweel · 2020-02-27T11:32:39Z

Ah, I see. Since we're literally just comparing the string output of model.summary() to the text contents of a .txt file though, it shouldn't be too hard to implement without the snapshottest framework right?

jneeven · 2020-02-27T11:33:32Z

Ah, I see. Since we're literally just comparing the string output of model.summary() to the text contents of a .txt file though, it shouldn't be too hard to implement without the snapshottest framework right?

Yes true, I think I like this suggestion

AdamHillier · 2020-02-27T11:35:26Z

The nice thing about the snapshot module is that there is a single pytest command to update the snapshot. As long as there is a nice way to replicate that with a single command, I'm happy to have individual txt files.

jneeven · 2020-02-27T11:36:57Z

The nice thing about the snapshot module is that there is a single pytest command to update the snapshot. As long as there is a nice way to replicate that with a single command, I'm happy to have individual txt files.

As long as we keep these summaries in a specific folder, we can just delete the folder and re-generate the snapshots that way?

AdamHillier · 2020-02-27T11:38:15Z

As long as we keep these summaries in a specific folder, we can just delete the folder and re-generate the snapshots that way?

Sure, if that's how it works then that sounds good.

tests/models_test.py

larq_zoo/train.py

AdamHillier · 2020-02-27T17:52:35Z

I like this, once the isort linting is fixed should be good to go.

leonoverweel · 2020-02-28T09:47:09Z

😍 love this!

tests/models_test.py

jneeven · 2020-04-06T08:22:41Z

@larq/core For some reason, keras.backend.clear_session() (called in parametrize) no longer seems to have any effect; the second model to be tested will have layer names like input_2 etc. Does anyone have any idea what might be the issue here? I'm on TF 2.0.0 and manually calling clear_session() makes no difference...

On February 27th, this was working fine so something must have changed

leonoverweel · 2020-04-06T08:50:21Z

It also looks like it's failing on fixture 'snapshot' not found. Maybe some requirements/imports got messed up when you merged master?

jneeven · 2020-04-06T08:57:07Z

It also looks like it's failing on fixture 'snapshot' not found. Maybe some requirements/imports got messed up when you merged master?

Yeah that's indeed interesting, though snapshottest is still in the requirements and pytest is imported so I don't see why that wouldn't work...

AdamHillier · 2020-04-30T15:25:47Z

Hopefully CI for this should pass now that #166 / #168 are merged.

AdamHillier · 2020-05-01T08:58:19Z

There's surely something wrong with 018ed29, Larq Zoo shouldn't be on its own line should it?

jneeven · 2020-05-01T09:01:46Z

There's surely something wrong with 018ed29, Larq Zoo shouldn't be on its own line should it?

Why not? It's not a third-party library in this case

AdamHillier · 2020-05-01T09:37:08Z

Why not? It's not a third-party library in this case

Sorry you're absolutely right, that was a dumb moment on my part 😂

leonoverweel · 2020-05-01T09:40:57Z

So this looks like it's still failing now because auto-named layers have the wrong indices because previously-built models aren't properly being cleared out? So same problem as before :(

AdamHillier · 2020-05-05T12:42:13Z

Haha so it turns out that this code actually works just fine, it's just that the snapshots must have been generated when it wasn't working because the layer names in the .txt files are definitely wrong, they just need to be re-generated :)

I'm about to push a commit that does that.

AdamHillier · 2020-05-05T13:04:21Z

Well that's progress, but unfortunately it looks like TF 2.2 changes a default layer name, tf_op_layer_Mul -> tf_op_layer_mul. Not sure how we can get around that without setting layer names explicitly....

jneeven · 2020-05-06T06:27:36Z

Well that's progress, but unfortunately it looks like TF 2.2 changes a default layer name, tf_op_layer_Mul -> tf_op_layer_mul. Not sure how we can get around that without setting layer names explicitly....

I compare the summary strings manually so we could just use assert snapshot.lower() == current.lower()... It's not extremely pretty but I don't think it's necessarily a bad solution either; in general this makes the snapshots less brittle (I don't foresee any scenario in which this would cause issues)

leonoverweel · 2020-05-06T10:10:03Z

Nice, exciting that this is ready now!

AdamHillier

Looks good to me 👍

jneeven · 2020-05-07T08:04:00Z

@lgeiger I think this should be good to go, could you review this another time? Merging is currently blocked.

jneeven · 2020-08-17T15:43:36Z

@larq/core do we still want this? If so, I'll resolve the conflicts and we can get this merged. If not, let's close this PR.

AdamHillier · 2020-08-17T16:00:23Z

@larq/core do we still want this? If so, I'll resolve the conflicts and we can get this merged. If not, let's close this PR.

I don't have a strong desire for this, and it seems like it'd be a bunch of work to get it working, so I'd be minded to close it.

lgeiger · 2020-08-17T16:01:42Z

@larq/core do we still want this? If so, I'll resolve the conflicts and we can get this merged. If not, let's close this PR.

I'm fairly neutral to this, although I fear that these tests might be tricky to maintain accross the multiple versions of TensorFlow which we support since the snapshots might be slightly different depending on the TensorFlow version used to generate them.

jneeven · 2020-08-18T07:16:33Z

Would it make sense to just compare the ModelProfiles instead? Then we don't need to care about the layer names etc, but will at least be notified if e.g, the number of MACs suddenly changes unintentionally

koenhelwegen · 2020-08-18T09:56:11Z

Would it make sense to just compare the ModelProfiles instead? Then we don't need to care about the layer names etc, but will at least be notified if e.g, the number of MACs suddenly changes unintentionally

Would this catch issues that are not already covered by the unit tests for the ModeProfile itself? (https://github.com/larq/larq/blob/master/larq/models_test.py#L66)

jneeven · 2020-08-19T07:25:39Z

Would this catch issues that are not already covered by the unit tests for the ModeProfile itself? (https://github.com/larq/larq/blob/master/larq/models_test.py#L66)

It would do that check for all the models, rather than just for the toy model used in the unit tests. That test is a good way to check the calculations made in ModelProfile are still correct, but the idea behind the tests here is that we'd want to catch subtle things like accidentally enabling / disabling biases somewhere while updating some code

jneeven · 2020-08-27T09:11:12Z

We decided that at this time, it's not worth the extra effort to change to a ModelProfile snapshot solution. I'll close this PR for now, but we can re-open it in the future if we do decide this is necessary.

Add snapshots for model summaries

d6a0fa4

lgeiger reviewed Feb 21, 2020

View reviewed changes

tests/models_test.py Outdated Show resolved Hide resolved

Split summary and experiment test

49525fc

jneeven marked this pull request as ready for review February 21, 2020 13:35

Remove unused imports

bd507a5

jneeven mentioned this pull request Feb 27, 2020

Snapshot tests of model summaries #120

Open

jneeven added 3 commits February 27, 2020 18:10

Merge branch 'master' into summaries_snapshots

dc61dc2

Change to DummyOxfordFlowers

48c498b

Write model summaries to file

9e1c6ff

AdamHillier reviewed Feb 27, 2020

View reviewed changes

tests/models_test.py Outdated Show resolved Hide resolved

AdamHillier reviewed Feb 27, 2020

View reviewed changes

larq_zoo/train.py Outdated Show resolved Hide resolved

AdamHillier reviewed Feb 28, 2020

View reviewed changes

tests/models_test.py Outdated Show resolved Hide resolved

jneeven added 4 commits April 6, 2020 09:57

Code review suggestions

0234f2e

isort

15888fe

Merge branch 'master' into summaries_snapshots

0c6c299

Tried to create new snapshots

426ba6b

Merge branch 'master' into summaries_snapshots

e8a9f52

isort

018ed29

Update snapshots.

5949704

AdamHillier added 2 commits May 5, 2020 13:46

Flake8

cbe0af2

Summaries for new models.

8ac2494

jneeven added the internal-improvement label May 6, 2020

jneeven added 2 commits May 6, 2020 08:29

Compare lowercase summaries

2fb464d

isort

5384051

leonoverweel requested a review from lgeiger May 6, 2020 10:09

AdamHillier approved these changes May 6, 2020

View reviewed changes

jneeven added internal-improvement and removed internal-improvement labels May 7, 2020

jneeven closed this Aug 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add snapshots for model summaries #106

Add snapshots for model summaries #106

jneeven commented Feb 21, 2020 •

edited

Loading

jneeven commented Feb 21, 2020

jneeven commented Feb 27, 2020

leonoverweel commented Feb 27, 2020 •

edited

Loading

jneeven commented Feb 27, 2020 •

edited

Loading

leonoverweel commented Feb 27, 2020

jneeven commented Feb 27, 2020

AdamHillier commented Feb 27, 2020

jneeven commented Feb 27, 2020

AdamHillier commented Feb 27, 2020

AdamHillier commented Feb 27, 2020

leonoverweel commented Feb 28, 2020

jneeven commented Apr 6, 2020 •

edited

Loading

leonoverweel commented Apr 6, 2020

jneeven commented Apr 6, 2020

AdamHillier commented Apr 30, 2020

AdamHillier commented May 1, 2020

jneeven commented May 1, 2020

AdamHillier commented May 1, 2020

leonoverweel commented May 1, 2020

AdamHillier commented May 5, 2020

AdamHillier commented May 5, 2020 •

edited

Loading

jneeven commented May 6, 2020 •

edited

Loading

leonoverweel commented May 6, 2020 •

edited

Loading

AdamHillier left a comment

jneeven commented May 7, 2020

jneeven commented Aug 17, 2020

AdamHillier commented Aug 17, 2020

lgeiger commented Aug 17, 2020

jneeven commented Aug 18, 2020

koenhelwegen commented Aug 18, 2020

jneeven commented Aug 19, 2020

jneeven commented Aug 27, 2020

Add snapshots for model summaries #106

Add snapshots for model summaries #106

Conversation

jneeven commented Feb 21, 2020 • edited Loading

jneeven commented Feb 21, 2020

jneeven commented Feb 27, 2020

leonoverweel commented Feb 27, 2020 • edited Loading

jneeven commented Feb 27, 2020 • edited Loading

leonoverweel commented Feb 27, 2020

jneeven commented Feb 27, 2020

AdamHillier commented Feb 27, 2020

jneeven commented Feb 27, 2020

AdamHillier commented Feb 27, 2020

AdamHillier commented Feb 27, 2020

leonoverweel commented Feb 28, 2020

jneeven commented Apr 6, 2020 • edited Loading

leonoverweel commented Apr 6, 2020

jneeven commented Apr 6, 2020

AdamHillier commented Apr 30, 2020

AdamHillier commented May 1, 2020

jneeven commented May 1, 2020

AdamHillier commented May 1, 2020

leonoverweel commented May 1, 2020

AdamHillier commented May 5, 2020

AdamHillier commented May 5, 2020 • edited Loading

jneeven commented May 6, 2020 • edited Loading

leonoverweel commented May 6, 2020 • edited Loading

AdamHillier left a comment

Choose a reason for hiding this comment

jneeven commented May 7, 2020

jneeven commented Aug 17, 2020

AdamHillier commented Aug 17, 2020

lgeiger commented Aug 17, 2020

jneeven commented Aug 18, 2020

koenhelwegen commented Aug 18, 2020

jneeven commented Aug 19, 2020

jneeven commented Aug 27, 2020

jneeven commented Feb 21, 2020 •

edited

Loading

leonoverweel commented Feb 27, 2020 •

edited

Loading

jneeven commented Feb 27, 2020 •

edited

Loading

jneeven commented Apr 6, 2020 •

edited

Loading

AdamHillier commented May 5, 2020 •

edited

Loading

jneeven commented May 6, 2020 •

edited

Loading

leonoverweel commented May 6, 2020 •

edited

Loading