Numerical stability is better when array sizes are powers of two #38

syncrostone · 2022-07-13T01:39:21Z

Problem: when running the same code multiple times with the same seeds, there are small numerical differences that arise over the course of training. This is fixed if array sizes are powers of two.

Suggestion: Use array sizes that are powers of two for now

Eventually I would like to implement a workaround (if tensorflow doesn't have a way to activate a built in one) where if the array size is not a power of two, in the background an array with dimensions that are powers of two is made and unneeded entries are set to 0. If this is relevant to you and you want to work on that workaround please do (and drop a comment here so people don't duplicate work).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Numerical stability is better when array sizes are powers of two #38

Numerical stability is better when array sizes are powers of two #38

syncrostone commented Jul 13, 2022 •

edited

Loading

Numerical stability is better when array sizes are powers of two #38

Numerical stability is better when array sizes are powers of two #38

Comments

syncrostone commented Jul 13, 2022 • edited Loading

syncrostone commented Jul 13, 2022 •

edited

Loading