Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About DST on delexicalized response #11

Open
Leezekun opened this issue Apr 21, 2022 · 3 comments
Open

About DST on delexicalized response #11

Leezekun opened this issue Apr 21, 2022 · 3 comments

Comments

@Leezekun
Copy link

Leezekun commented Apr 21, 2022

Hi, this is very great work. Congrats on being accepted by ACL2022.

But I have a question about the DST model. It seems that the DST model is trained and evaluated on delexicalized response. However, some slot values are mentioned in the non-delexicalized system responses. How can the model predict these slots correctly if it is trained and evaluated using delexicalized responses?

Thanks!

@yxuansu
Copy link
Contributor

yxuansu commented Apr 21, 2022

Hi, this is very great work. Congrats on being accepted by ACL2022.

But I have a question about the DST model. It seems that the DST model is trained and evaluated on delexicalized response. However, some slot values are mentioned in the non-delexicalized system responses. How can the model predict these slots correctly if it is trained and evaluated using delexicalized responses?

Thanks!

Hi,

Thank you for your interest in our work. Actually, we only focus on the delexicalized part of DST prediction as following previous studies. I assume the accuracy of the model on non-delexicalized slots cannot be well guaranteed due to the nature of our training and evaluation. One way to improve this might be switching the training and evaluation to the non-delexicalized format of the data.

Best,

Yixuan

@Leezekun
Copy link
Author

Hi, thanks for the reply.

Did you mean that the comparison results between your model and other models are all trained and evaluated on delexicalized response (Table 4 and 5)? I have tried training and evaluating the model on non-delexicalized response and the performance seems better.

Thanks

@yxuansu
Copy link
Contributor

yxuansu commented Apr 21, 2022

Hi, thanks for the reply.

Did you mean that the comparison results between your model and other models are all trained and evaluated on delexicalized response (Table 4 and 5)? I have tried training and evaluating the model on non-delexicalized response and the performance seems better.

Thanks

Yes, that's right. Our model is evaluated on the delexicalized responses. It is quite interesting to know that the model can perform better on non-delexicalized responses :-)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants