Implement caching system to be used for page load functions #403

Macludde · 2024-07-09T14:23:08Z

This PR adds a naive in-memory caching system which works like this:

You have an async method you would like to cache, 99% a database request but does work for stuff like Github commit data etc.

You have to decide, is this data user-specific or global? (For example, alerts are global because we want every visitor to see the same alerts, most data is user-specific due to our access system)

Depending on the answer you either wrap your async call in a globallyCached(...) or userLevelCached(...) method. What this does is save the return value in an in-memory cache on the server, and next time it's called does a cache query first.

The cache system is purged every so often, and cache lifetime is set per-resource (with a default of 5 minutes IIRC)

…reen

vercel · 2024-07-09T14:23:12Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
web	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Jul 9, 2024 2:23pm

danieladugyan · 2024-07-14T21:40:50Z

I'll paste this comment from #400 here as well:

I'm not a huge fan of caching Prisma calls like this unless strictly necessary.

A longer explanation. Leaving this as is kind of implies that this pattern is "correct" from an access control standpoint, and I will argues that it's not. Here's why:

AdminSetting has the following read policy: @@allow("read", has(auth().policies, "admin:settings:read"))

A user may or may not have this access policy. In this specific case, it doesn't matter because this function is currently only called with an authorized Prisma client (i.e no access control). This could change in the future though – there's nothing enforcing this.

In general, caching data that's protected by access control policies requires careful consideration since it completely bypasses our access policy system. A privileged user's permissions could be used to fill the cache, and subsequent requests will succeed no matter whether the user is authorized. Equally, a non-privileged user's permissions could be used to fill the cache (with empty data) and subsequent requests will fail no matter whether the user is authorized. None of these cases are good.

Macludde · 2024-07-18T16:02:20Z

I see your comment Daniel and guess it's disputing that perhaps ALL data is user-level, because even Alerts might some day not be the same for everyone. At some point I think we have to think about performance over potential future problems (which can be solved then), and right now our landing page takes almost a second to load, every time.

At least implementing user-level caching improves performance a lot for someone when you click around within the app, or go back and forth between pages.

Isak-Kallini · 2024-07-24T20:35:51Z

src/lib/server/loadHomeData.ts


 // COMMIT DATA
- const commitPromise = fetch("/api/home").then((res) => res.json());
+ const commitPromise = globallyCached("commitData", async () => {
+ const res = await fetch("/api/home");


The /api/home endpoint already caches the commit data to avoid getting rate limited by Github, should probably extract the logic and remove the endpoint entirely

danieladugyan · 2024-08-15T19:58:39Z

I've given this a lot of thought over the past weeks and frankly I haven't really reached any sort of conclusion. But I'll put down some thoughts here while we await opinions from other DWWW-members.

I see your comment Daniel and guess it's disputing that perhaps ALL data is user-level, because even Alerts might some day not be the same for everyone.

This is a great point that really puts our entire model for access control into question. I pushed for the idea of database-level access control because I hoped it would make it easy to enforce access control everywhere. There's no risk of ever forgetting to enable access control on a specific server function, because it's handled at a lower layer.

However, that starts to break down when we have operations that should ignore access control. We can easily spot these operations since they occur wherever authorizedClient is used. In principle, that's equivalent to this caching PR – it sidesteps our access control code. Needing to sidestep the access control model is a sign of a flawed access control model and to me it's a typical case of a leaky abstraction.

At some point I think we have to think about performance over potential future problems (which can be solved then), and right now our landing page takes almost a second to load, every time.

This is a very reasonable take, but not one I care much about (but that's just a matter of priorities). I think a one second loading time is fine. Obviously it's not ideal, but I think it's unlikely to affect usage of our web page. Caching (as implemented in this PR) on the other hand is famously error-prone (see especially this, or this quote) so I don't think it's worth the trade-off

At least implementing user-level caching improves performance a lot for someone when you click around within the app, or go back and forth between pages.

It does, and I have always wanted user-level caching. However, my idea was to implement it using service workers on the client side. This seems less error-prone, but I haven't tried implementing it yet.

Summary:

Maybe we should rethink the way we handle access control? We could revert to defining access control rules manually in JavaScript for each operation. If we do then server-side caching could be done without overriding our conceptual model for access control.
I sometimes care more about DX than UX 😮
There are other approaches to caching - I think it's worth exploring some other options that might result in a way simpler developer experience: service workers, client side data fetching, prisma level caching, Redis? etc?

Macludde added 6 commits July 7, 2024 20:26

Implement custom in-memory caching system and implement it on home sc…

9f4d878

…reen

Add cache to app layout load function

b16b4f3

Prune cache every so often

c1f3fca

Add caching for committee pages

c4bbcb9

Cleanup a bit and mark where cache might be wanted

bc69d4f

Fix type error on board page

1489657

github-actions bot assigned Macludde Jul 9, 2024

danieladugyan mentioned this pull request Jul 14, 2024

Hide "staben" by denying reads of positions and mandates of them #400

Merged

Macludde requested a review from danieladugyan July 18, 2024 16:02

Isak-Kallini reviewed Jul 24, 2024

View reviewed changes

danieladugyan force-pushed the main branch from 8cce009 to 83e3661 Compare August 24, 2024 17:53

Macludde marked this pull request as draft August 25, 2024 14:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement caching system to be used for page load functions #403

Implement caching system to be used for page load functions #403

Macludde commented Jul 9, 2024 •

edited

Loading

vercel bot commented Jul 9, 2024

danieladugyan commented Jul 14, 2024 •

edited

Loading

Macludde commented Jul 18, 2024

Isak-Kallini Jul 24, 2024

danieladugyan commented Aug 15, 2024

Implement caching system to be used for page load functions #403

Are you sure you want to change the base?

Implement caching system to be used for page load functions #403

Conversation

Macludde commented Jul 9, 2024 • edited Loading

vercel bot commented Jul 9, 2024

danieladugyan commented Jul 14, 2024 • edited Loading

Macludde commented Jul 18, 2024

Isak-Kallini Jul 24, 2024

Choose a reason for hiding this comment

danieladugyan commented Aug 15, 2024

Macludde commented Jul 9, 2024 •

edited

Loading

danieladugyan commented Jul 14, 2024 •

edited

Loading