Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

adding cosine rewarmed scheduler #243

Open
wants to merge 6 commits into
base: main
Choose a base branch
from

Conversation

Tomerporian
Copy link
Contributor

image

Adding cosine rewarmed scheduler. Rewarming to where cosine would have been if running for the total number of steps - of both original and rewarmed runs.

There are two arguments that are used:

--cosine-rewarmed-target-steps - set the total number of steps.
--cosine-rewarmed-original-warmup - number of warmup steps in the runs before rewarming. default: 1000.

Choose base_lr to be the base lr you would use in the run with total number of steps. The new base_lr is computed within the scheduler

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant