Brainstorm about speeding up TJLF #4

orso82 · 2023-10-26T23:17:37Z

For typical use-cases each eigensolve tends to be fast (grows with number of species and number of basis). The main issue is that TJLF does a lot of eigensolves! (order ~200)

Many of the eigensolves are used to find the width of the modes. Can such width be determined only for few Ky's and not for each one of them? For example, when width is entered by user, the same width is used for all Ky's. Perhaps we can calculate only few widths (eg at lowest and highest Kw) and interpolate between them?

bclyons12 · 2023-10-26T23:44:28Z

Ah, okay, this is the stuff I don't know anything about. Each call to eigen allocates some workspace (I believe) and then calls a non-allocating version:

function eigen(A::AbstractMatrix{TA}, B::AbstractMatrix{TB}; kws...) where {TA,TB}
    S = promote_type(eigtype(TA), TB)
    eigen!(eigencopy_oftype(A, S), eigencopy_oftype(B, S); kws...)
end

If we can preallocate the workspace and call eigen! directly, maybe that would have some improvement. I'm not sure though. It does look like most of the time is spent in ggev!

tomneiser · 2023-10-27T04:56:20Z

The width depends on the type of mode. Above a certain ky we can expect to be only dealing with ETG's, so there could be some improvement for say k_y > 10. Otherwise we would need to run a form of mode ID routine that guesses the best width based on inputs or initial eigenvalues.

orso82 · 2023-10-27T10:33:59Z

@bclyons12 there could certainly be some allocation savings if we preallocated the workspace. As you know, we just need to be careful if we eventually desire to multi-thread the calculation per k_y that each thread uses its own work matrix.

orso82 · 2023-10-27T10:37:32Z

@tomneiser should we expect those widths to change significantly between iterations of the FluxMatcher? If not, we could happily pay the price of calculating the optimal width once, and then use those same widths for all subsequent FluxMatcher iterations.

tomneiser · 2023-10-28T22:01:15Z

I think this is a good idea. The widths don't change every iteration, maybe every 30 iterations we can re-evaluate them

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Brainstorm about speeding up TJLF #4

Brainstorm about speeding up TJLF #4

orso82 commented Oct 26, 2023

bclyons12 commented Oct 26, 2023

tomneiser commented Oct 27, 2023

orso82 commented Oct 27, 2023

orso82 commented Oct 27, 2023

tomneiser commented Oct 28, 2023 •

edited

Loading

Brainstorm about speeding up TJLF #4

Brainstorm about speeding up TJLF #4

Comments

orso82 commented Oct 26, 2023

bclyons12 commented Oct 26, 2023

tomneiser commented Oct 27, 2023

orso82 commented Oct 27, 2023

orso82 commented Oct 27, 2023

tomneiser commented Oct 28, 2023 • edited Loading

tomneiser commented Oct 28, 2023 •

edited

Loading