Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

3.1: Dataset: Add a "language" field #744

Open
bact opened this issue Apr 29, 2024 · 0 comments
Open

3.1: Dataset: Add a "language" field #744

bact opened this issue Apr 29, 2024 · 0 comments
Labels
Profile:AI Artificial intelligence profile Profile:Dataset
Milestone

Comments

@bact
Copy link
Collaborator

bact commented Apr 29, 2024

AI team meeting 2024-04-24 made a decision to explore about the addition of a field/fields about language/linguistic properties of a dataset and a model.

While some repository may made this language filed a mandatory metadata,
we should keep in mind that not every datasets are having language aspect (or even human-associated data)
and not every models are language models.

  • No Assertion or Not Relevant could be a possible value.
  • IETF BCP 47 language tag can be considered as a standardized code

More discussion to follow.

@goneall goneall added the Profile:AI Artificial intelligence profile label Jun 10, 2024
@goneall goneall added this to the 3.1 milestone Jun 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Profile:AI Artificial intelligence profile Profile:Dataset
Projects
None yet
Development

No branches or pull requests

2 participants