printing data frames with validation function information #115

viktorpm · 2024-01-31T12:41:35Z

Description

Implementing a new way of printing validation function results

What is this PR

Bug fix
Addition of a new feature
Other

Why is this PR needed?

Currently, only the lists of valid and invalid atlases are printed. One atlas can be in both lists as it might pass one validation function but not the other.

What does this PR do?

To get more information on why an atlas is invalid and which validation function it did not pass additional information is stored and printed in data frames.

How has this PR been tested?

It was tested locally and on the HPC

Is this a breaking change?

No

Does this PR require an update to the documentation?

No

Checklist:

The code has been tested locally
Tests have been added to cover all new functionality (unit & integration)
The documentation has been updated to reflect any changes
The code has been formatted with pre-commit

…frame

for more information, see https://pre-commit.ci

…re implemented on another branch

codecov · 2024-01-31T12:53:05Z

Codecov Report

Attention: 30 lines in your changes are missing coverage. Please review.

Comparison is base (a4acdb4) 0.00% compared to head (56686e1) 0.00%.
Report is 3 commits behind head on main.

❗ Current head 56686e1 differs from pull request most recent head dea87e4. Consider uploading reports for the commit dea87e4 to get more accurate results

Files	Patch %	Lines
bg_atlasgen/validate_atlases.py	0.00%	30 Missing ⚠️

Additional details and impacted files

@@          Coverage Diff          @@
##            main    #115   +/-   ##
=====================================
  Coverage   0.00%   0.00%           
=====================================
  Files         24      24           
  Lines       1943    1952    +9     
=====================================
- Misses      1943    1952    +9

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

alessandrofelder

Could you motivate the reason for needing better printing more (in the "Why is this PR needed" section of the description)?

pass additional information is stored

I don't understand what extra information is stored? I think all the info required is in our output now (the outputs are dictionaries, not lists, now, I think) but it may not be displayed nicely when printing.

Furthermore, I am not sure adding pandas as an extra dependency is necessary what you'd like achieve here.

We may want to move away from printing to writing a structured text file as output anyway - what do you think?

viktorpm · 2024-01-31T15:53:13Z

Thanks @alessandrofelder for the quick review. I'm a bit confused. I'm on the main branch and as far as I understand our outputs at the moment are the valid_atlases and the invalid_atlases lists: print(valid_atlases) and print(invalid_atlases).
They were defined earlier in the code as valid_atlases = [] and invalid_atlases = []
We also have access to the successful_validations and failed_validations dictionaries but we are not printing those. I wanted to have two tabular outputs with the list of atlases and the functions they passed/failed. Something like this:

Atlas	Function
allen_mouse_100um	validate_atlas_files
allen_mouse_100um	validate_mesh_matches_image_extents

I just made this branch to help me debug the test functions as I don't fully understand their behaviour and it would be nice to see in a lookup table what validation functions pass or fail on each atlas.

viktorpm · 2024-01-31T16:44:30Z

Also, writing a structured text file is a great idea! I'm fully on board 🙂

alessandrofelder · 2024-01-31T18:18:37Z

They were defined earlier in the code as valid_atlases = [] and invalid_atlases = []

Sorry, yes, you're right - they are lists (of dictionaries).

We also have access to the successful_validations and failed_validations dictionaries but we are not printing those.

We are, indirectly, right, because we append them to the aforementioned lists?

Maybe we actually want one object, containing all the necessary info to be able to generate:

Atlas	Function	Passed
allen_mouse_100um	validate_atlas_files	True
allen_mouse_100um	validate_mesh_matches_image_extents	False
...	...	...

…y the name of the successful and failed functions (not the function object) to lists in validate_atlases function

for more information, see https://pre-commit.ci

…smetics

for more information, see https://pre-commit.ci

viktorpm · 2024-02-01T17:53:28Z

Looks like it works as expected.
successful_validations.json
failed_validations.json

@alessandrofelder, is there a convention on where to save output files? For now, they are in the bg-atlasgen folder.

alessandrofelder · 2024-02-02T11:43:27Z

I think they should go in ~/.brainglobe/atlases/validation or something along those lines in the BrainGlobe user data folder?
This would be in line with our aspirations detailed in brainglobe/BrainGlobe#26

for more information, see https://pre-commit.ci

alessandrofelder

Two tiny comments for you to consider:

discussion point for a possible refactoring we could add on here
I think we can delete some variables that we've stopped using?

Looks great otherwise!

bg_atlasgen/validate_atlases.py

alessandrofelder · 2024-02-05T13:28:20Z

bg_atlasgen/validate_atlases.py

+            successful_validations[atlas_name].append(
+                validation_function.__name__
+            )


Suggested change

successful_validations[atlas_name].append(

validation_function.__name__

)

validations[atlas_name].append(

validation_function.__name__, None

)

(:arrow_up: Just a sketch... and just a discussion point)

Should we combine the two lists while we're refactoring this, and have them have the same format (str(error) if failed, and None if valid)? The advantage would be simplicity

just one file

shorter and easier to read code

But it's entirely possible there's value in keeping the things separate? What do you think?

Thanks for this suggestion! I was also thinking about doing this but wanted to have a chat first, as I wasn't sure how to implement it. I think it would be much better to have only one file with all the necessary information.

@alessandrofelder, I implemented these changes and tested them locally and on the HPC. Could you check them, please? 🙂
I think it's ready to be merged if you approve the changes.

result file:
validation_results.json

Thanks @viktorpm - As per our developer's guide around PRs, you have the liberty to merge if your reviewer approves with optional comments without asking for another round of review :) I have had another look though and looks great!

Co-authored-by: Alessandro Felder <[email protected]>

removing unused variables Co-authored-by: Alessandro Felder <[email protected]>

for more information, see https://pre-commit.ci

…smetics

viktorpm and others added 5 commits January 29, 2024 16:47

first test functions for validate_mesh_structure_pairs

d6d481b

storing atlases and successful/failed validation functions in a data …

512cd63

…frame

[pre-commit.ci] auto fixes from pre-commit.com hooks

3aaf90a

for more information, see https://pre-commit.ci

restoring test_validation.py to the original merged version. Chages a…

4e29467

…re implemented on another branch

restoring test_validation.py to the original merged version. Chages a…

84de0d1

…re implemented on another branch

viktorpm self-assigned this Jan 31, 2024

viktorpm requested a review from alessandrofelder January 31, 2024 13:14

alessandrofelder reviewed Jan 31, 2024

View reviewed changes

viktorpm and others added 2 commits February 1, 2024 16:25

validate_atlases.py: going back to the version on main, appending onl…

f6b0dc0

…y the name of the successful and failed functions (not the function object) to lists in validate_atlases function

[pre-commit.ci] auto fixes from pre-commit.com hooks

fd8a6a8

for more information, see https://pre-commit.ci

viktorpm mentioned this pull request Feb 1, 2024

Writing the results of failed and successful validation functions to a JSON file #116

Closed

viktorpm and others added 3 commits February 1, 2024 17:14

populating dictionaries in for loop, writing JSON files

a0ba87b

Merge branch 'cosmetics' of github.com:brainglobe/bg-atlasgen into co…

6d9707b

…smetics

[pre-commit.ci] auto fixes from pre-commit.com hooks

0bd3163

for more information, see https://pre-commit.ci

viktorpm and others added 4 commits February 2, 2024 14:12

saving JSON files to ~/.brainglobe/atlases/validation

a980f67

[pre-commit.ci] auto fixes from pre-commit.com hooks

eec486a

for more information, see https://pre-commit.ci

printing where to find the result files

398dccc

[pre-commit.ci] auto fixes from pre-commit.com hooks

56686e1

for more information, see https://pre-commit.ci

viktorpm requested a review from alessandrofelder February 2, 2024 14:41

viktorpm marked this pull request as ready for review February 2, 2024 16:03

alessandrofelder approved these changes Feb 5, 2024

View reviewed changes

viktorpm and others added 3 commits February 5, 2024 14:27

Update bg_atlasgen/validate_atlases.py

576d8d2

Co-authored-by: Alessandro Felder <[email protected]>

Update bg_atlasgen/validate_atlases.py

d7a7d3b

removing unused variables Co-authored-by: Alessandro Felder <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

defaa29

for more information, see https://pre-commit.ci

viktorpm and others added 6 commits February 6, 2024 13:23

Merge branch 'main' into cosmetics

16dee78

[pre-commit.ci] auto fixes from pre-commit.com hooks

8773286

for more information, see https://pre-commit.ci

saving only one JSON file with all the information

da4cc8f

[pre-commit.ci] auto fixes from pre-commit.com hooks

6c00f6d

for more information, see https://pre-commit.ci

uncommenting test functions

58722fd

Merge branch 'cosmetics' of github.com:brainglobe/bg-atlasgen into co…

dea87e4

…smetics

viktorpm requested a review from alessandrofelder February 9, 2024 17:49

alessandrofelder merged commit 03392f3 into main Feb 12, 2024
7 checks passed

alessandrofelder deleted the cosmetics branch February 12, 2024 10:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

printing data frames with validation function information #115

printing data frames with validation function information #115

viktorpm commented Jan 31, 2024 •

edited

Loading

codecov bot commented Jan 31, 2024 •

edited

Loading

alessandrofelder left a comment •

edited

Loading

viktorpm commented Jan 31, 2024

viktorpm commented Jan 31, 2024

alessandrofelder commented Jan 31, 2024 •

edited

Loading

viktorpm commented Feb 1, 2024

alessandrofelder commented Feb 2, 2024

alessandrofelder left a comment

alessandrofelder Feb 5, 2024

viktorpm Feb 5, 2024

viktorpm Feb 9, 2024 •

edited

Loading

alessandrofelder Feb 12, 2024

printing data frames with validation function information #115

printing data frames with validation function information #115

Conversation

viktorpm commented Jan 31, 2024 • edited Loading

Description

How has this PR been tested?

Is this a breaking change?

Does this PR require an update to the documentation?

Checklist:

codecov bot commented Jan 31, 2024 • edited Loading

Codecov Report

alessandrofelder left a comment • edited Loading

Choose a reason for hiding this comment

viktorpm commented Jan 31, 2024

viktorpm commented Jan 31, 2024

alessandrofelder commented Jan 31, 2024 • edited Loading

viktorpm commented Feb 1, 2024

alessandrofelder commented Feb 2, 2024

alessandrofelder left a comment

Choose a reason for hiding this comment

alessandrofelder Feb 5, 2024

Choose a reason for hiding this comment

viktorpm Feb 5, 2024

Choose a reason for hiding this comment

viktorpm Feb 9, 2024 • edited Loading

Choose a reason for hiding this comment

alessandrofelder Feb 12, 2024

Choose a reason for hiding this comment

viktorpm commented Jan 31, 2024 •

edited

Loading

codecov bot commented Jan 31, 2024 •

edited

Loading

alessandrofelder left a comment •

edited

Loading

alessandrofelder commented Jan 31, 2024 •

edited

Loading

viktorpm Feb 9, 2024 •

edited

Loading