Welcome to the senselab
repo! This is a Python package for streamlining the processing and analysis of behavioral data, such as voice and speech patterns, with robust and reproducible methodologies.
Caution:: this package is still under development and may change rapidly over the next few weeks.
-
Modular design: Utilize a variety of task-specific transformations that can be easily integrated or used standalone, allowing for flexible data manipulation and analysis strategies.
-
Pre-built pipelines: Access pre-configured pipelines combining multiple transformations tailored for common analysis tasks, which help in reducing setup time and effort.
-
Reproducibility: Ensures consistent outputs through the use of fixed seeds and version-controlled processing steps, making your results verifiable and easily comparable.
-
Easy integration: Designed to fit into existing workflows with minimal configuration,
senselab
can be used alongside other data analysis tools and frameworks seamlessly. -
Extensible: Open to modifications and contributions, the package can be expanded with custom transformations and pipelines to meet specific research needs. Do you want to contribute? Please, reach out!
-
Comprehensive documentation: Comes with detailed documentation for all features and modules, including examples and guides on how to extend the package for other types of behavioral data analysis.
-
Performance Optimized: Efficiently processes large datasets with optimized code and algorithms, ensuring quick turnaround times even for complex analyses.
-
Interactive Examples: Includes Jupyter notebooks that provide practical examples of how
senselab
can be implemented to derive insights from real-world data sets.
Whether you're researching speech disorders, analyzing customer service calls, or studying communication patterns, senselab
provides the tools and flexibility needed to extract meaningful conclusions from your data.
Install this package via:
pip install senselab
Or get the newest development version via:
pip install git+https://github.com/sensein/senselab.git
from senselab.app import hello_world
hello_world()
Please see CONTRIBUTING.md before contributing.
-
Integrating more multi-modal tasks
- input_output
-
Integrating more audio tasks and moving functions from b2aiprep package:
- data_augmentation
- data_representation (embeddings and feats extraction)
- raw_signal_processing
- speaker_diarization [@Isaac]
- speech emotion recognition [@Jordan]
- speech enhancement
- speech_to_text
- text_to_speech
- voice conversion
-
Integrating more video tasks:
- input_output
-
Preparing some pipelines with pydra
-
Populating the CLI
- Check login with HF
- Setup project cache directory