当前位置：首页 > 软件库 > 程序开发 > TTS/语音合成和处理 >

DeepSpeech

端到端自动语音识别

授权协议 Apache-2.0

开发语言 C/C++ Python

所属分类程序开发、 TTS/语音合成和处理

软件类型开源软件

地区国产

投递者祝宏放

操作系统跨平台

开源组织百度

适用人群未知

软件官网

软件文档

官方下载

软件概览

DeepSpeech 是一个采用 PaddlePaddle 平台的端到端自动语音识别（ASR）引擎的开源项目，具体原理参考这篇论文 Baidu's Deep Speech 2 paper。我们的愿景是为语音识别在工业应用和学术研究上，提供易于使用、高效和可扩展的工具，包括训练，推理，测试模块，以及 demo 部署。同时，我们还将发布一些预训练好的英语和普通话模型。

下载安装命令

## CPU版本安装命令
pip install -f https://paddlepaddle.org.cn/pip/oschina/cpu paddlepaddle

## GPU版本安装命令
pip install -f https://paddlepaddle.org.cn/pip/oschina/gpu paddlepaddle-gpu

使用案例

DeepSpeech的使用尝试Linux环境下

环境：Ubuntu18.0.4 python3.6 安装DeepSpeech：会自动安装最新的版本 pip install deepspeech 或者，也可以指定版本： pip install deepspeech~=0.9.3 ● 首先wget获取deepspeech的model：这里选取最新的0.9.3 wget https://github.com/mozilla/DeepSpeec
DeepSpeech 怎么下载模型

可以在 DeepSpeech 的 GitHub 页面上找到模型下载链接。可以选择下载预训练模型或者训练自己的模型。还可以使用 pip 安装 DeepSpeech。命令： pipinstall deepspeech
使用java 调用 DeepSpeech 的代码

使用 Java 调用 DeepSpeech 的代码需要使用 DeepSpeech 的 Java 绑定。使用方法如下：下载并安装 DeepSpeech 的 Java 绑定。在 Java 代码中导入相应的类，如：org.mozilla.deepspeech.libdeepspeech.DeepSpeechModel。创建 DeepSpeechModel 对象，并使用 loadModel() 方法
deepspeech 1 （百度 2014 论文解读）

论文：https://arxiv.org/pdf/1412.5567.pdf 题目：Deep Speech: Scaling up end-to-end speech recognition 摘要我们提出了使用端到端深度学习开发的最先进的语音识别系统。我们的体系结构比传统的语音系统要简单得多，传统的语音系统依靠费力地设计的处理管道。当在嘈杂的环境中使用时，这些传统系统的性能也往往很差。相反，我们
mozilla 源码_使用mozilla deepspeech自动生成字幕

mozilla 源码 In the age of OTT platforms, there are still some who prefer to download movies/videos from YouTube/Facebook/Torrents (shush 狼) over streaming. I am one of them and on one such occasion, I
Paddlpaddle+DeepSpeech2自动语音识别部署

Paddlpaddle+DeepSpeech2自动语音识别部署背景语音识别环境 DeepSpeech2 Paddlpaddle1.8.5 Python 2.7 Nvidia-docker ubuntu1~18.04 安装与配置可以不使用nvidia-docker，直接跳到第五步 1.首先安装nvidia-docker curl https://get.docker.com | sh s
Deepspeech v2版本deepspeech.pytorch中文语音识别笔记

代码地址https://github.com/SeanNaren/deepspeech.pytorch 中文语音数据库采用thchs30 （1）首先提取data文件下的trn翻译文本，生成包含空格在内的生字表并保存为json格式lexicon.json，是汉字字典，不是拼音，我在这一步卡了很久，后来发现data_loader只能读取单个字符，所以中文识别的词汇表是翻译文本的汉字生字表（2）生成t

DeepSpeech

同类工具

相关阅读

相关文章

相关问答

相关文档