🧪⚡ Load Testing & Discussion on Scalability and Performance #340

ff137 · 2023-05-31T08:08:31Z

What is the scalability of the system and where are the bottlenecks:
- if the cloud-api and the cloud-controller are in tip-top shape, then the bottlenecks should ultimately be on the agent side.
- we should simplify and optimise the processes in cloudapi as much as possible, and perhaps generate another cloud-controller that runs with a different framework -- that way we can compare controller performance as well, and get a clearer idea of what sort of performance improvements can be eked out.
- it also becomes important to define benchmark metrics: what is an acceptable duration for onboarding, issuing or verifying a credential, etc. "As fast as possible" is not useful because there will have to be trade-offs, so we must be able to say, e.g. "a minute is fine but slower than 2 minutes it will feel sluggish". That way we have a target to measure and scale against.
- can we predict where the main bottlenecks will be? At least the ones in our control: webhooks, trust registry, endorsing ... What can we start improving now to be prepared for scaling out?
What are the limitations of running a single multi-tennant instance, and how can this be scaled out with multiple instances? What are the unknowns, or further development that is required to achieve this? e.g. How can we ensure load balancing across instances?
- How many instances will we need in order to provision wallets for and transactions between 100'000 agents (while meeting our benchmark goals)? What about 1 million, or 10 million?
What are the different scenarios we should cover for proper load testing? Create a million wallets, create a million schemas, issue a million credentials, verify a million credentials, etc ... we need a checklist of these different categories to evaluate performance.
What is the fundamental bottleneck in the aries-cloud-agents that we can a) provide open source contribution to for improvement, or b) what alternative agent providers should we look out for that we want to be able to swap in and out of our api?

These are just some initial questions to spur further discussion, in the spirit of "start by getting things down on paper". And they aren't trivial, so we should answer what we can: defining our benchmark metrics, defining our test scenarios, etc, and getting down on what other unknowns still need to be answered.

ff137 · 2024-01-16T09:31:43Z

@rblaine95 it may be worth tracking your load testing results here. Unless it's already logged online elsewhere

ff137 added the question Further information is requested label Jan 16, 2024

ff137 changed the title ~~Load Testing & Discussion on Scalability and Performance~~ 🧪⚡ Load Testing & Discussion on Scalability and Performance Jul 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🧪⚡ Load Testing & Discussion on Scalability and Performance #340

🧪⚡ Load Testing & Discussion on Scalability and Performance #340

ff137 commented May 31, 2023

ff137 commented Jan 16, 2024

🧪⚡ Load Testing & Discussion on Scalability and Performance #340

🧪⚡ Load Testing & Discussion on Scalability and Performance #340

Comments

ff137 commented May 31, 2023

ff137 commented Jan 16, 2024