Skip to content
This repository was archived by the owner on Aug 3, 2021. It is now read-only.

Commit 2e0b1d8

Browse files
committed
Updated docs
Signed-off-by: Vitaly Lavrukhin <[email protected]>
1 parent c4532a7 commit 2e0b1d8

File tree

2 files changed

+12
-9
lines changed

2 files changed

+12
-9
lines changed

docs/sources/source/installation.rst

+10-9
Original file line numberDiff line numberDiff line change
@@ -84,20 +84,21 @@ Installation of OpenSeq2Seq for speech recognition
8484
CTC-based speech recognition models can use the following decoders to get a transcription out of a model's state:
8585

8686
* greedy decoder, the fastest, but might yield spelling errors (can be enabled with ``"use_language_model": False``)
87-
* beam search decoder with language model rescoring, the most accurate, but the slowest (can be enabled with ``"use_language_model": True``)
87+
* beam search decoder with language model (LM) rescoring, the most accurate, but the slowest
8888

89-
You can find more information about these decoders at :doc:`DeepSpeech 2 page </speech-recognition/deepspeech2>`.
89+
You can find more information about these decoders at :ref:`decoders-ref` section.
9090

9191
CTC beam search decoder with language model rescoring is an optional component and might be used for speech recognition inference only.
9292

93-
Since TensorFlow does not support it by default, you will need to build TensorFlow
94-
from sources with a custom CTC decoder operation. In order to do that, follow
95-
the steps below. Alternatively, you can disable language model by setting
96-
"use_language_model" parameter of decoder to False, but that will lead to a
97-
worse model accuracy.
93+
There are two implementations of CTC beam search decoder with LM rescoring in OpenSeq2Seq:
9894

99-
How to install a CTC decoder with language model to TensorFlow (optional)
100-
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
95+
* Baidu CTC decoder (the recommended). It can be installed with ``scripts/install_decoders.sh`` command.
96+
To test the installation please run ``python scripts/ctc_decoders_test.py``.
97+
98+
* Custom native TF op (rather deprecated). See installation instructions below.
99+
100+
How to build a custom native TF op for CTC decoder with language model (optional)
101+
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
101102

102103
First of all, make sure that you installed CUDA >= 10.0, cuDNN >= 7.4, NCCL >= 2.3.
103104

docs/sources/source/speech-recognition.rst

+2
Original file line numberDiff line numberDiff line change
@@ -96,6 +96,8 @@ The recommended pipeline is the following (in order to get the best accuracy, th
9696
To enable `librosa`, please make sure that there is a line ``"backend": "librosa"`` in ``"data_layer_params"``.
9797

9898

99+
.. _decoders-ref:
100+
99101
Decoders
100102
~~~~~~~~
101103

0 commit comments

Comments
 (0)