-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Poor performance on a simple dataset #62
Comments
I had an interesting further finding. The problem might lie in the predict function of autoxgboost. If I extract the parameters using Following Kodi's code above, if we continue to run:
we get An extra issue is that to use 'dart' correctly, we need to pass argument
Output:
After setting |
@ja-thomas , I am not very familiar with mlr... I am curious how is the |
Hi, sorry for the late reply, I was away for a few days. thanks for the issue, this is indeed very surprising and I found the problem to be that we call For now I'll drop this step from the preprocessing, until it is fixed in mlrCPO. see here: mlr-org/mlrCPO#59 |
I'm not familiar with the statistical approach taken by mlrMBO, so excuse me if I'm missing something. Anyway, I was going to ask a question about overfitting in autoxgboost, but it looks like I actually have an underfitting problem. Below is a simple example using DART and mostly default settings. Training and test error for the autoxgboost model hovers near the SD and is much worse than that of linear regression. Increasing the number of iterations to 500 didn't seem to help. What am I doing wrong?
Output:
CCing my cow-orkers: @allanjust, @liuyanguu
The text was updated successfully, but these errors were encountered: