Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Garbled BIO tags in training files #2

Open
yvesscherrer opened this issue Sep 24, 2024 · 1 comment
Open

Garbled BIO tags in training files #2

yvesscherrer opened this issue Sep 24, 2024 · 1 comment

Comments

@yvesscherrer
Copy link

Hi! The en.train.conll as well as the projectedTrain files contain varying amounts of the three tags Orecurring_datetime, B-ecurring_datetime and I-ecurring_datetime. All versions seem affected, and both the fixed and unfixed files.

@robvanderg
Copy link
Collaborator

Thanks for the notification, I had already just heard about these, and assume that these were already in the original training data!, I have fixed them now in version 0.6.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants