Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to train a model with the given dataset? #8

Open
nashid opened this issue Jan 13, 2022 · 0 comments
Open

How to train a model with the given dataset? #8

nashid opened this issue Jan 13, 2022 · 0 comments

Comments

@nashid
Copy link

nashid commented Jan 13, 2022

Thanks for sharing the artefact. However, the steps are not clear enough for replication.

I have downloaded the dataset from https://github.com/lin-tan/CoCoNut-Artifact/releases/tag/training_data_1.0.0

Dataset for python:

coconut/coconut-dataset/python/2010$ ls -rlt
add.txt
context.txt
meta.txt
rem.txt

After that what next needs to be done?

There are many scripts under the ./source/ folder without any steps on how to train a model with the given dataset. It appears from me the first step is to run ./source/tokenization/generate_data.py:?

Then we should run the ./source/training/prepare_data.py?

Can you provide some steps, please? Looking forward to your help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant