Support Multipass shaders (buffer A ...) #30

Vipitis · 2024-05-09T22:21:58Z

part of #4

approximately 17.5% of public Shadertoys are multipass. Multipass allows up to 4 buffers (A through D) to be rendered as a texture. These can also be used to store data and enable quite some more experiences.

Some of the challenges include timing as well as cross inputs.
Buffer passes can seemingly take the exact same inputs as the main "Image" renderpass, including other buffers (and themselves?)

This PR starts to bloat a little and contains some refactor for the whole channel input concept... still in flux

Instead, will try to implement BufferTexture as a ShadertoyChannel subclass so it can hold for example the sampler settings.
Additionally, there will likely be a RenderPass base class and subclasses for Image, Buffer(a-d) and later cube and sound.
So the main Shadertoy class contains several render passes, and all of these get their inputs(channels) attached.
I even started to try and sketch it out - but will have to sleep through this for a few more days... my concepts change every day but I need to just try and work on the ideas for a bit.

The render order should be Buffer A through D and then Image. So you can keep temporal data, by using itself has an input.

TODOs:

Refactor common code to ShadertoyChannel base class
~~additional tests cases for inferred input types, empty channels~~ (caching conflict with pytests)
dynamic headers
~~test coverage for examples in readme!~~(different PR)
Working buffer
working multiple buffers!
tests for buffers
multipass examples
~~(maybe) some debug mode where you can render the buffers to canvas?~~ (you can use RenderDoc with "capture child processes")

Vipitis · 2024-05-24T00:10:34Z

Little update:
It seems to sorta work. But a bunch of stuff is still broken:

missing sampler filters fail to recreate some behavior, example: https://www.shadertoy.com/view/MfyXzV
having unsupported channels break the layout, example: https://www.shadertoy.com/view/ssjyWc (layout fixed by detecting channels in the common code, shader still broken, fixed in float32)
resizing the window breaks the nbytes of the previous frame - so maybe there needs to be a hook to the on_resize function to fill the new space (similar to behaviour when going fullscreen on the website)

new breaking examples found, that might be unrelated to this PR, but I will note them down for later reference:

some issue with resolution(maybe filters), but could be some of those fetching functions: https://www.shadertoy.com/view/X3c3WN (also has horrible performance?)
not sure: https://www.shadertoy.com/view/4X33zH (fixed by ensuring the order of channels is correct)
~~kinda has similarities to the issue with nested loops (Index value issue in GLSL front nested loop gfx-rs/wgpu#5246)~~ https://www.shadertoy.com/view/MdX3Rr turns out this issue exist because the alpha of the buffer pass stores motion vectors inside a negative float. Meaning we likely need to use float format for the buffer texture. I will give it a try...

Vipitis · 2024-06-07T22:25:02Z

I think I finally fixed the compatibility issue. There is some small visual issues which look like precision problems to me (not sure yet). And the performance is horrible it seems... Please let me know if you find any shaders that are broken (not due to missing features, wgpu bugs)

Will work on tests, examples and documentation to hopefully get this ready for next week.

E: found this one seemingly broken: >wgpu-shadertoy https://www.shadertoy.com/view/tsKXR3
detailed example of precision of this alpha channel is different: https://www.shadertoy.com/view/wsjSDt

Vipitis · 2024-06-21T19:25:18Z

I think this is finally ready for review - and I welcome some feedback.

This PR refactors the whole inputs/channels to be easily extendable with the missing channel types.
Multipass shaders are quite complex, but I learned quite a lot to get this running, but I think my implementation is what is minimally required for wgpu (with the redoing the sampler and pipeline).
resizing is really janky, since you download the texture and pad it with numpy only to upload it again - but I wanted to mirror the website (this includes breaking quite a few shaders)

wgpu_shadertoy/inputs.py

hmaarrfk · 2024-07-21T14:42:11Z

Cool stuff!

Vipitis · 2024-08-13T18:51:28Z

@Korijn will you be able to help with a review with this? Would be great to get this merged and get a v0.2 released in the next couple of weeks.

.github/workflows/ci.yml

README.md

examples/shadertoy_buffer.py

Korijn · 2024-08-14T22:15:43Z

examples/shadertoy_buffer_lovers.py

+
+# shadertoy source: https://www.shadertoy.com/view/ssjyWc by FabriceNeyret2 (CC-BY-NC-SA-3.0?)
+
+# current "bug": the string kinda floats off to the upper right corner, without any inputs... ? Likely to be some issue with the implementation of buffers.


Still applicable?

still happens with the wgpu22.1 branch of wgpu-py

It could be a variety of causes, so I am not even shure if it's something with this implementation.
We could search for a example that seems to be working as expected. I wanted to have one example that includes multiple buffers and complex interactions between them.

Korijn · 2024-08-21T11:15:49Z

This branch is pretty huge, I'll resume the review later, sorry, still getting used to my new work rhythm and finding a place for pygfx. Let me know if there are any specific parts of the diff I should focus my attention on first.

Vipitis · 2024-08-21T13:09:31Z

No worries, this sorta a large rewrite. Perhaps others can help too, time permitting.

Let me know if there are any specific parts of the diff I should focus my attention on first.

The part I am most unsure about is _update_textures
Since it feels really inefficient to make a new texture for every single frame. That includes a new binding and sampler too. I tried to use TextureView instead which worked much better, but I couldn't get it to work when the Buffer passes also sample buffer inputs. I feel like I am missing something.
The added overhead makes the example from the API tests run at like 45fps for example.

Korijn · 2024-09-02T10:47:36Z

No worries, this sorta a large rewrite. Perhaps others can help too, time permitting.

Let me know if there are any specific parts of the diff I should focus my attention on first.

The part I am most unsure about is _update_textures Since it feels really inefficient to make a new texture for every single frame. That includes a new binding and sampler too. I tried to use TextureView instead which worked much better, but I couldn't get it to work when the Buffer passes also sample buffer inputs. I feel like I am missing something. The added overhead makes the example from the API tests run at like 45fps for example.

Maybe @almarklein can weigh in on that issue?

almarklein · 2024-09-04T13:34:35Z

Is _update_textures called every frame? It seems to rebuild everything from scratch, from descriptors all the way to the pipeline. I don't have a clear understanding of what happens and the path that leads up to update_textures. But Ideally you want to re-use the textures. If that's not possible, you can probably at least re-use the layouts.

Vipitis · 2024-09-04T15:27:11Z

Is _update_textures called every frame?

yeah, it will be called for each renderpass, every frame and then also iterate through all channels... I think this whole method doesn't actually need to exist.
I will try to make some changes that simply use a texture view for all the channels. And then the buffer renders to a temporary render target texture before overwriting the old texture. This was likely the cause for usage conflicts I had on the previous attempt. As it's common to have the previous frame as one of the inputs.

Works already well, but I will try to run some more examples before I push the commits. And it will likely break resizing for which there are no tests in CI. (but resizing in this PR is horrible too).

Vipitis · 2024-09-23T21:28:05Z

Resizing now works again as it should. It's not the cleanest solution but it works.
I do feel like the performance is really bad again, but I need some proper ways to test that, as I am also using wgpu-py@main currently...

The two CI failures are sorta unrelated, one is deprecated actions and the other is python3.8 not happy with the excessive typing. Python 3.8 will be EOL in a few days - so I no longer care.
Will look at the PRs in wgpu-py that updated all the CI stuff and open something separate tomorrow.

almarklein · 2024-09-30T09:57:02Z

wgpu_shadertoy/shadertoy.py

- device=self._device, format=wgpu.TextureFormat.bgra8unorm
+ device=self._device, format=self._format


Can also set to None to have it select the preferred format. Less code, unless you need self._format. Note that the format is also accessible in texture.format on the texture obtained via get_current_texture().

hm, I had a look and the only other solution I can think of is to make it a property of the ImageRenderPass, as it's needed to create the render pipeline. The awful part is that at init time for the Image class, the canvas(_present_context) might not be accessible via the parent instance of Shadertoy.
It could be returned by this method instead of passed via an attribute. It could be useful to have available when translating the snapshot back into RGB.

Vipitis mentioned this pull request May 9, 2024

Missing compatibility features with Shadertoy webseite #4

Open

13 tasks

Vipitis marked this pull request as ready for review June 21, 2024 19:13

Vipitis changed the title ~~[WIP] Support Multipass shaders (buffer A ...)~~ Support Multipass shaders (buffer A ...) Jun 24, 2024

Vipitis commented Jun 25, 2024

View reviewed changes

wgpu_shadertoy/inputs.py Show resolved Hide resolved

Vipitis requested a review from Korijn July 2, 2024 23:10

Vipitis mentioned this pull request Jul 21, 2024

Update wgpu dependency to match pygfx #32

Merged

Vipitis mentioned this pull request Jul 21, 2024

Use ruff --check because ruff suggests it #33

Merged

Vipitis added 19 commits July 26, 2024 00:40

Initial texture channel refactor

786f8a2

small clarification on .snapshot usage

c49f43d

keep base channels working

5414d0d

consider renderpasses in main

d4c943a

add renderpass classe stubs

a07c201

refactor some code to the channel classes

5242c70

start move to ImagePass for main image code and channels

e667479

start draw_buffer function

451f9f4

split up _prepare_render

18c0990

initialize buffers with zero

71d97d4

move prepare_render function to passes

f182699

static buffer pass working?

60e8a3a

put passes into it's own file

e5b67fe

naive update textures function

2bcbac8

fix color and orientation

e10aa81

fix type annotations

8529aeb

only update dynamic channels

8e2b577

refactor duplicate code to method

cfc388f

add row padding, resizing still broken

040d9e9

Vipitis added 5 commits July 26, 2024 00:41

fix empty buffer case

7bacf78

omit test due to caching issue

964e40d

fix lint

1aa5ce0

update ruff

6957cc3

initialize inputs_complete

00c0ee2

Vipitis force-pushed the wip-multipass branch from b247e3f to 00c0ee2 Compare July 25, 2024 22:50

Vipitis added 3 commits July 26, 2024 01:00

fix lint

253c5e1

fix wgsl buffer vertex code

46833c7

avoid duplicated glsl vertex code

589ce4e