Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Validator error - Dataset #207

Open
lopierra opened this issue Oct 23, 2024 · 0 comments
Open

Validator error - Dataset #207

lopierra opened this issue Oct 23, 2024 · 0 comments
Assignees
Labels
linkml Issues that require linkml development

Comments

@lopierra
Copy link
Member

Hi @madanucd - I am attempting to use the validator on some Dataset metadata and got the following error:

Validating dataset data from file: ..\..\INCLUDE\Study & Dataset\dataset_draft_2024-10-09.csv
Traceback (most recent call last):
  File "C:\Users\lopi\OneDrive - The University of Colorado Denver\Documents\INCLUDE_LinkML\include-linkml\venv\Scripts\\validate-data", line 6, in <module>
    sys.exit(main())
             ^^^^^^
  File "C:\Users\lopi\OneDrive - The University of Colorado Denver\Documents\INCLUDE_LinkML\include-linkml\src\data_validation\cli.py", line 36, in main
    validation_function(args.input_file, args.output)
  File "C:\Users\lopi\OneDrive - The University of Colorado Denver\Documents\INCLUDE_LinkML\include-linkml\src\data_validation\validation.py", line 40, in validate_dataset
    return validate_data(file_path, string_columns, validate_dataset_entry, output_path)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\lopi\OneDrive - The University of Colorado Denver\Documents\INCLUDE_LinkML\include-linkml\src\data_validation\validation_utils.py", line 68, in validate_data
    clean_dataframe_strings(df, string_columns)
  File "C:\Users\lopi\OneDrive - The University of Colorado Denver\Documents\INCLUDE_LinkML\include-linkml\src\data_validation\validation_utils.py", line 17, in clean_dataframe_strings
    df[string_columns] = df[string_columns].map(clean_string)
                         ~~^^^^^^^^^^^^^^^^
  File "C:\Users\lopi\OneDrive - The University of Colorado Denver\Documents\INCLUDE_LinkML\include-linkml\venv\Lib\site-packages\pandas\core\frame.py", line 4108, in __getitem__
    indexer = self.columns._get_indexer_strict(key, "columns")[1]
              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\lopi\OneDrive - The University of Colorado Denver\Documents\INCLUDE_LinkML\include-linkml\venv\Lib\site-packages\pandas\core\indexes\base.py", line 6200, in _get_indexer_strict
    self._raise_if_missing(keyarr, indexer, axis_name)
  File "C:\Users\lopi\OneDrive - The University of Colorado Denver\Documents\INCLUDE_LinkML\include-linkml\venv\Lib\site-packages\pandas\core\indexes\base.py", line 6252, in _raise_if_missing
    raise KeyError(f"{not_found} not in index")
KeyError: "['data access'] not in index"
@lopierra lopierra assigned lopierra and madanucd and unassigned lopierra Oct 23, 2024
@lopierra lopierra added the linkml Issues that require linkml development label Oct 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
linkml Issues that require linkml development
Projects
None yet
Development

No branches or pull requests

2 participants