DeepSpeech 是一款开源嵌入式(离线、设备上)语音识别引擎,最低可以在树莓派上运行。

趋势

Project DeepSpeech

Documentation macOS builds Linters Docker Images

DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier.

Documentation for installation, usage, and training models are available on deepspeech.readthedocs.io.

For the latest release, including pre-trained models and checkpoints, see the latest release on GitHub.

For contribution guidelines, see CONTRIBUTING.rst.

For contact and support information, see SUPPORT.rst.

关于
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
26.0 k
4.0 k
676
语言
C++
Python
C
Shell
C#
Swift
Java
Makefile
CMake
TypeScript
SWIG
JavaScript
Starlark
Awk
Ruby
Objective-C
46.94%
21.39%
11.23%
10.76%
2.77%
1.76%
1.32%
0.91%
0.9%
0.56%
0.46%
0.45%
0.39%
0.12%
0.04%
0.01%