feat: Matching algo performance + Exclude duplicate matchings #1142

Jonasdoubleyou · 2024-09-14T21:17:04Z

Splits the tests into small unit tests that are fast to run (npm run unit) and perf tests that might take longer (npm run perf) - npm run test runs all of them in the barrier.

Adds a performance test for the matching algorithm, which simulates a real world matching - pupils and students continously request matches, every N days the matching algorithm is run using the current pools, matched users are removed from the pool. At the end, reports how many matches are created, how many subjects and states match, and how long the pupils and students had to wait for the match.

Extends the Matching Algorithm to exclude duplicate matches (as the old implementation does). Currently implemented as a postfilter after the matching, dont think it happens often and is worth considering in the assignment itself.

https://github.com/corona-school/project-user/issues/1316

Run the matching algorithm every n days while simulating match requests of real users.

As can be seen from the diff, this only has a small impact on the matched subjects while improving the matching state rate from 10% to 30%.

realmayus

Nice work!

Keep pupils a bit longer in the match pool to find a better match. This increases wait time from 4 days to 17 days on average, but increases both the number of subjects and states matched. Not sure if the trade-off is worth it, but now we have a mechanism for it that can be tuned.

Jonasdoubleyou · 2024-09-15T13:22:57Z

common/match/matching.perf.ts

+            'new',
+            1,
+            {
+                matchCountSum: 1044,


As the new algorithm is able to create more matches with the same "quality" (about the same number of matching subjects and states) I think it is worth to switch.

Jonasdoubleyou · 2024-09-15T13:23:58Z

common/match/matching.perf.ts

+            'old',
+            1000,
+            {
+                matchCountSum: 883,


It's interesting that the old algorithm prioritizes the number of matching states over the number of matching subjects and the number of matches created ...

Jonasdoubleyou · 2024-09-15T13:32:45Z

common/match/matching.perf.ts

+        [
+            'new',
+            1000,
+            {


Using such a fixed time series as a benchmark is not so optimal - it can easily lead to overfitting onto this time series. Some sort of shuffling of the input would be good, but on the other hand that might loose some time effects that are actually there (e.g. pupils / students signing up in bulk after marketing campaigns)

Jonas Wilms added 3 commits September 14, 2024 23:04

ci: Distinguish unit and perf tests

8565ac8

feat: Add request date to request and offer

4e00263

test: Add real world matching performance test

f800eaa

Run the matching algorithm every n days while simulating match requests of real users.

realmayus temporarily deployed to backend-feat-matching-a-iu2xfb September 14, 2024 21:18 Inactive

fix: Run old matching

26157a5

realmayus temporarily deployed to backend-feat-matching-a-i58zxe September 14, 2024 21:49 Inactive

fixups

d1a5ae2

Jonasdoubleyou temporarily deployed to backend-feat-matching-a-i58zxe September 14, 2024 21:51 Inactive

Jonas Wilms added 2 commits September 15, 2024 00:34

feat: Exclude duplicate matchings

d8e1b06

test: Consider duplicate exclusion in perf

dd89dae

Jonasdoubleyou had a problem deploying to backend-feat-matching-a-i58zxe September 14, 2024 22:34 Failure

fixup

8f3fd71

Jonasdoubleyou temporarily deployed to backend-feat-matching-a-i58zxe September 14, 2024 22:40 Inactive

fixup

d845030

Jonasdoubleyou had a problem deploying to backend-feat-matching-a-i58zxe September 14, 2024 22:45 Failure

Jonas Wilms added 2 commits September 15, 2024 00:51

feat: Compute matches to exclude from existing matches

37195a4

fixup

5f58554

Jonasdoubleyou changed the title ~~feat: Matching algo performance~~ feat: Matching algo performance + Exclude duplicate matchings Sep 14, 2024

Jonasdoubleyou temporarily deployed to backend-feat-matching-a-i58zxe September 14, 2024 22:54 Inactive

fixup

ce59058

Jonasdoubleyou temporarily deployed to backend-feat-matching-a-i58zxe September 14, 2024 23:11 Inactive

fixup

2ea75d8

Jonasdoubleyou temporarily deployed to backend-feat-matching-a-i58zxe September 14, 2024 23:43 Inactive

fixup

7e62133

Jonasdoubleyou temporarily deployed to backend-feat-matching-a-i58zxe September 14, 2024 23:46 Inactive

fixup

1208cae

Jonasdoubleyou temporarily deployed to backend-feat-matching-a-i58zxe September 14, 2024 23:52 Inactive

fixup

08b7ed5

Jonasdoubleyou temporarily deployed to backend-feat-matching-a-i58zxe September 15, 2024 00:00 Inactive

dont run old algo in barrier

dc683f9

Jonasdoubleyou temporarily deployed to backend-feat-matching-a-i58zxe September 15, 2024 00:06 Inactive

adapt to double run

e5c2d72

Jonasdoubleyou temporarily deployed to backend-feat-matching-a-i58zxe September 15, 2024 00:12 Inactive

Jonasdoubleyou requested review from dhenkel92, realmayus and JeangelLF September 15, 2024 00:27

feat: Consider state in matching

18fb7af

As can be seen from the diff, this only has a small impact on the matched subjects while improving the matching state rate from 10% to 30%.

realmayus previously approved these changes Sep 15, 2024

View reviewed changes

Jonasdoubleyou dismissed realmayus’s stale review via 4bffd5c September 15, 2024 13:09

Jonasdoubleyou temporarily deployed to backend-feat-matching-a-i58zxe September 15, 2024 13:09 Inactive

Jonasdoubleyou requested a review from realmayus September 15, 2024 13:10

test: Adapt tests

bda85be

Jonasdoubleyou temporarily deployed to backend-feat-matching-a-i58zxe September 15, 2024 13:18 Inactive

Jonasdoubleyou commented Sep 15, 2024

View reviewed changes

Merge branch 'master' into feat/matching-algo-perf

69f9bbd

Jonasdoubleyou merged commit 8bd29b0 into master Sep 22, 2024
2 checks passed

Jonasdoubleyou deleted the feat/matching-algo-perf branch September 22, 2024 11:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Matching algo performance + Exclude duplicate matchings #1142

feat: Matching algo performance + Exclude duplicate matchings #1142

Jonasdoubleyou commented Sep 14, 2024 •

edited

Loading

realmayus left a comment

Jonasdoubleyou Sep 15, 2024

Jonasdoubleyou Sep 15, 2024

Jonasdoubleyou Sep 15, 2024

+                      [
+                          'new',
+,
+                          {

feat: Matching algo performance + Exclude duplicate matchings #1142

feat: Matching algo performance + Exclude duplicate matchings #1142

Conversation

Jonasdoubleyou commented Sep 14, 2024 • edited Loading

realmayus left a comment

Choose a reason for hiding this comment

Jonasdoubleyou Sep 15, 2024

Choose a reason for hiding this comment

Jonasdoubleyou Sep 15, 2024

Choose a reason for hiding this comment

Jonasdoubleyou Sep 15, 2024

Choose a reason for hiding this comment

Jonasdoubleyou commented Sep 14, 2024 •

edited

Loading