Skip to content
forked from sensein/senselab

PipePal is a Python package that simplifies building pipelines for speech and voice analysis.

License

Notifications You must be signed in to change notification settings

ibevers/senselab

 
 

Repository files navigation

The senselab repo

Build codecov Ruff

PyPI Python Version License

pages

Welcome to the senselab repo! This is a Python package for streamlining the processing and analysis of behavioral data, such as voice and speech patterns, with robust and reproducible methodologies.

Caution:: this package is still under development and may change rapidly over the next few weeks.

Features

  • Modular design: Utilize a variety of task-specific transformations that can be easily integrated or used standalone, allowing for flexible data manipulation and analysis strategies.

  • Pre-built pipelines: Access pre-configured pipelines combining multiple transformations tailored for common analysis tasks, which help in reducing setup time and effort.

  • Reproducibility: Ensures consistent outputs through the use of fixed seeds and version-controlled processing steps, making your results verifiable and easily comparable.

  • Easy integration: Designed to fit into existing workflows with minimal configuration, senselab can be used alongside other data analysis tools and frameworks seamlessly.

  • Extensible: Open to modifications and contributions, the package can be expanded with custom transformations and pipelines to meet specific research needs. Do you want to contribute? Please, reach out!

  • Comprehensive documentation: Comes with detailed documentation for all features and modules, including examples and guides on how to extend the package for other types of behavioral data analysis.

  • Performance Optimized: Efficiently processes large datasets with optimized code and algorithms, ensuring quick turnaround times even for complex analyses.

  • Interactive Examples: Includes Jupyter notebooks that provide practical examples of how senselab can be implemented to derive insights from real-world data sets.

Whether you're researching speech disorders, analyzing customer service calls, or studying communication patterns, senselab provides the tools and flexibility needed to extract meaningful conclusions from your data.

Installation

Install this package via:

pip install senselab

Or get the newest development version via:

pip install git+https://github.com/sensein/senselab.git

Quick start

from senselab.app import hello_world

hello_world()

Contributing

Please see CONTRIBUTING.md before contributing.

To do (majors):

  • Integrating more multi-modal tasks

    • input_output
  • Integrating more audio tasks and moving functions from b2aiprep package:

    • data_augmentation
    • data_representation (embeddings and feats extraction)
    • raw_signal_processing
    • speaker_diarization [@Isaac]
    • speech emotion recognition [@Jordan]
    • speech enhancement
    • speech_to_text
    • text_to_speech
    • voice conversion
  • Integrating more video tasks:

    • input_output
  • Preparing some pipelines with pydra

  • Populating the CLI

To do (minors):

  • Check login with HF
  • Setup project cache directory

About

PipePal is a Python package that simplifies building pipelines for speech and voice analysis.

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 76.8%
  • CSS 23.2%