tts : implement mimi decoder #12636

ngxson · 2025-03-29T00:33:04Z

llama.cpp/example/mimi

Related to #12392

This demonstrates running Kyutai's Mimi model via GGML.

TODO:

implement decode_frame
see how long generation goes
test with audio codes from Sesame
abstract the decode into a function decode(codes) that returns std::vector<float>

Quickstart

Convert model to GGUF (no need to download, the script will automatically download the safetensors file)

python examples/tts/convert_mimi_to_gguf.py

# output file: kyutai-mimi.gguf

# optionally, use q8_0 quantization for faster speed
python examples/tts/convert_mimi_to_gguf.py --outtype q8_0

Then compile, run it:

cmake --build build -j --target llama-mimi

./build/bin/llama-mimi kyutai-mimi.gguf codes.txt

# output: output.wav

# alternatively, use "dummy1" to get a "hey hello there" sample output file
./build/bin/llama-mimi kyutai-mimi.gguf dummy1

Example of code file (one code per line):

ngxson · 2025-03-30T11:51:09Z

Close and merge with #12648

tts : implement mimi decoder

24a07ab

github-actions bot added examples python python script changes labels Mar 29, 2025

ngxson added 3 commits March 29, 2025 09:06

fix llama-tts

efeaa57

put mimi_model into a shared header

a98f199

mimi : non-transposed input codes

891273c

This was referenced Mar 29, 2025

csm : implement Sesame-based conversation example #12392

Closed

tts : implement sesame CSM + Mimi decoder #12648

Open

ngxson added 6 commits March 30, 2025 10:50

add mimi_model::transpose_input

eae5f0e

fix build

43bf237

fix build (2)

e618405

fix build (3)

e185e0a

fix strcmp

ce83041

fix compilation on linux

61d8ad6

ngxson closed this Mar 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

tts : implement mimi decoder #12636

tts : implement mimi decoder #12636

Uh oh!

ngxson commented Mar 29, 2025 •

edited

Loading

Uh oh!

ngxson commented Mar 30, 2025

Uh oh!

Uh oh!

tts : implement mimi decoder #12636

tts : implement mimi decoder #12636

Uh oh!

Conversation

ngxson commented Mar 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

llama.cpp/example/mimi

Quickstart

Uh oh!

ngxson commented Mar 30, 2025

Uh oh!

Uh oh!

ngxson commented Mar 29, 2025 •

edited

Loading