Are test_eval and test_llama the same data? #109

Camellia-hz · 2024-08-09T08:18:21Z

Dear Author, Hello, When I followed the data preparation in the challenge/readme documentation, I realized that test_eval.json and test_llama.json are essential the same data (derived from test.json),
and if I train my model using test_llama.json and then generate the output.json, and then ultimately evaluate it according to the documented methods (using output.json and test_eval.json), wouldn't that be equivalent to assess my model with the training set? Is my understanding wrong?

Camellia-hz · 2024-08-09T08:24:20Z

@DevLinyan

Camellia-hz · 2024-08-09T08:25:42Z

@ChonghaoSima

DevLinyan · 2024-08-09T11:05:12Z

The files test_eval.json and test_llama.json contain the same data but in different formats. The evaluation can only be conducted using the specific format in test_eval.json.

Camellia-hz · 2024-08-09T11:20:17Z

Thanks for your reply, if so is the evaluation valid? Because I am using test_llama.json to train my model, if I then use test_eval.json to evaluate it, what about the training set and validation set are the same?

Camellia-hz · 2024-08-09T11:23:23Z

@DevLinyan

ChonghaoSima · 2024-08-14T03:03:50Z

Not sure what you mean by "the training set and validation set are the same".

The evaluationo is valid as long as you use our provided test file and submit to our official test server.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are test_eval and test_llama the same data? #109

Are test_eval and test_llama the same data? #109

Camellia-hz commented Aug 9, 2024

Camellia-hz commented Aug 9, 2024

Camellia-hz commented Aug 9, 2024

DevLinyan commented Aug 9, 2024

Camellia-hz commented Aug 9, 2024

Camellia-hz commented Aug 9, 2024

ChonghaoSima commented Aug 14, 2024

Are test_eval and test_llama the same data? #109

Are test_eval and test_llama the same data? #109

Comments

Camellia-hz commented Aug 9, 2024

Camellia-hz commented Aug 9, 2024

Camellia-hz commented Aug 9, 2024

DevLinyan commented Aug 9, 2024

Camellia-hz commented Aug 9, 2024

Camellia-hz commented Aug 9, 2024

ChonghaoSima commented Aug 14, 2024