Human-in-the-loop Feature Discovery with AutoFeat

AutoTDA is a Python-based, user-friendly library that incorporates our human-in-the-loop methodology for feature discovery, designed for seamless integration in any notebook environment. AutoTDA streamlines the process of selecting and integrating relevant tables from a dataset collection into the base table, thus creating an augmented table. Additionally, AutoTDA employs heuristic-based feature selection strategies to eliminate redundant or irrelevant features from this augmented table. By doing so, AutoTDA notably enhances the efficiency and accuracy of subsequent machine learning operations.

An overview of the AutoTDA pipeline is shown below.

Installing

Install and update using pip:

$ pip install -U autotda

Automatic Data Augmentation

from autotda import TDA
autofeat = TDA()
autofeat.set_base_table(base_table="school/base.csv", target_column="class")
autofeat.set_dataset_repository(dataset_repository=["school"])
autofeat.augment_dataset()

Human-in-the-loop Data Augmentation

from autotda import TDA
autofeat = TDA()
autofeat.set_base_table(base_table="school/base.csv", target_column="class")
autofeat.set_dataset_repository(dataset_repository=["school"])
autofeat.find_relationships()
autofeat.compute_join_trees()
autofeat.evaluate_paths()

Documentation

Documentation: https://autofeat.readthedocs.io/
Example Notebook: https://www.kaggle.com/zegermouw2/human-in-the-loop-tabular-data-augmentation
Demonstration Video: https://youtu.be/z3ZmR_A0nyE

Name		Name	Last commit message	Last commit date
Latest commit History 124 Commits
data/benchmark		data/benchmark
docs		docs
saved_weights		saved_weights
src		src
.flake8		.flake8
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
Evaluation_Notebook.ipynb		Evaluation_Notebook.ipynb
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run.ipynb		run.ipynb
scenario1.ipynb		scenario1.ipynb
scenario2.ipynb		scenario2.ipynb
test.ipynb		test.ipynb
workflow.png		workflow.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Human-in-the-loop Feature Discovery with AutoFeat

Installing

Automatic Data Augmentation

Human-in-the-loop Data Augmentation

Documentation

About

Releases 1

Packages

Contributors 2

Languages

delftdata/hci-auto-feat

Folders and files

Latest commit

History

Repository files navigation

Human-in-the-loop Feature Discovery with AutoFeat

Installing

Automatic Data Augmentation

Human-in-the-loop Data Augmentation

Documentation

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages