Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mark pod as failed if it is in ContainerCreating state for "x" amount of time #911

Open
ashnamehrotra opened this issue Nov 13, 2023 · 2 comments
Labels
enhancement New feature or request

Comments

@ashnamehrotra
Copy link
Contributor

ashnamehrotra commented Nov 13, 2023

Describe the solution you'd like
[A clear and concise description of what you want to happen.]
AKS ImageCleaner user request:
In order to allow for cleanup and new ImageJobs to be scheduled, allow users to set a timeout for ImageJob to be marked as failed with pods stuck in ContainerCreating state for "x" amount of time.

Anything else you would like to add:
[Miscellaneous information that will assist in solving the issue.]

Environment:

  • Eraser version:
  • Kubernetes version: (use kubectl version):
@ashnamehrotra ashnamehrotra added the enhancement New feature or request label Nov 13, 2023
@ashnamehrotra
Copy link
Contributor Author

If pod is stuck due to an issue on the node, a work around here can be to use the eraser.sh/cleanup.filter to skip that node so it doesn't run into the issue on each repeat period of ImageJob.

@ashnamehrotra
Copy link
Contributor Author

cc @WilliamRockwellEvans

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant