Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enums for modelExplainability and anonymizationMethodUsed #868

Open
bact opened this issue Aug 29, 2024 · 0 comments
Open

Enums for modelExplainability and anonymizationMethodUsed #868

bact opened this issue Aug 29, 2024 · 0 comments
Labels
Profile:AI Artificial intelligence profile Profile:Dataset
Milestone

Comments

@bact
Copy link
Collaborator

bact commented Aug 29, 2024

From this snippet

{
    "type": "dataset_DatasetPackage",
    "dataset_anonymizationMethodUsed" : "pseudonymization",
    "description": "replace direct identifiers (such as name or social security number) with artificial identifiers to prevent the data from being directly linked back to the individual"
}
  • The anonymizationMethodUsed field is only used for a method name (possible to be from a list/enum).
  • Details of how it has been used is in the description filed.
  • Similar style of usage like this can be applied with modelExplainability field in AI Profile

This is the area that in 3.1 we can have new vocabularies:
AnonymizationMethod and ModelExplainabilityAlgorithm,
to be used with anonymizationMethodUsed and modelExplainability.

They may look similar to HashAlgorithm.

  • dataCollectionProcess and dataPreprocessing (all free-form text currently) are also potentially able to have these kind of enums.
  • We may even able to borrow steps from data processing "pipelines" in framework like langchain and huggingface for dataPreprocessing
@bact bact added Profile:AI Artificial intelligence profile Profile:Dataset labels Aug 29, 2024
@bact bact added this to the 3.1 milestone Aug 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Profile:AI Artificial intelligence profile Profile:Dataset
Projects
None yet
Development

No branches or pull requests

1 participant