Meta Spirit LM: Interleaved Spoken and Written Language Model

This repository contains the model weights, inference code and evaluation scripts for the Spirit LM paper. You can find more generation samples on our demo page.

Spirit LM Model Overview

Installation Setup

Conda

conda env create -f env.yml
pip install -e '.[eval]'

Pip

pip install -e requirements.txt
pip install -e '.[eval]'

Dev

(Optionally, use only if you want to run the tests.)

pip install -e '.[dev]'

Checkpoints Setup

See checkpoints/README.md

Quick Start

Model Card

More details of the model can be found in MODEL_CARD.md.

License

The present code is provided under the FAIR Noncommercial Research License found in LICENSE.

Citation

@misc{nguyen2024spiritlminterleavedspokenwritten,
      title={SpiRit-LM: Interleaved Spoken and Written Language Model},
      author={Tu Anh Nguyen and Benjamin Muller and Bokai Yu and Marta R. Costa-jussa and Maha Elbayad and Sravya Popuri and Paul-Ambroise Duquenne and Robin Algayres and Ruslan Mavlyutov and Itai Gat and Gabriel Synnaeve and Juan Pino and Benoit Sagot and Emmanuel Dupoux},
      year={2024},
      eprint={2402.05755},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2402.05755},
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Meta Spirit LM: Interleaved Spoken and Written Language Model

Spirit LM Model Overview

Installation Setup

Conda

Pip

Dev

Checkpoints Setup

Quick Start

Speech Tokenization

Spirit LM Generation

Speech-Text Sentiment Preservation benchmark (STSP)

Model Card

License

Citation

Files

README.md

Latest commit

History

README.md

File metadata and controls

Meta Spirit LM: Interleaved Spoken and Written Language Model

Spirit LM Model Overview

Installation Setup

Conda

Pip

Dev

Checkpoints Setup

Quick Start

Speech Tokenization

Spirit LM Generation

Speech-Text Sentiment Preservation benchmark (STSP)

Model Card

License

Citation