问题：

MS Azure上麦克风的连续语音识别

慕容晔

2023-03-14

我想使用Azure Speech服务从麦克风进行语音识别。我有一个使用recognize_once_async（）在Python中顺利运行的程序，但它只能识别具有15秒音频限制的第一个话语。我对这个主题做了一些研究，并检查了MS（https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/python/console/speech_sample.py）的示例代码，但找不到任何可以从麦克风进行连续语音识别的东西...有什么提示吗？

共有1个答案

米俊晤

2023-03-14

您可以尝试以下代码：

import azure.cognitiveservices.speech as speechsdk
import os
import time

 
path = os.getcwd()
# Creates an instance of a speech config with specified subscription key and service region.
# Replace with your own subscription key and region identifier from here: https://aka.ms/speech/sdkregion
speech_key, service_region = "6.....9", "eastus"
speech_config = speechsdk.SpeechConfig(subscription=speech_key, region=service_region)

# Creates a recognizer with the given settings
speech_config.speech_recognition_language="en-US"
#source_language_config = speechsdk.languageconfig.SourceLanguageConfig("en-US", "The Endpoint ID for your custom model.")
speech_recognizer = speechsdk.SpeechRecognizer(speech_config=speech_config)

done = False 
def stop_cb(evt):
    print('CLOSING on {}'.format(evt))
    speech_recognizer.stop_continuous_recognition()
    global done
    done= True
    

#Connect callbacks to the events fired by the speech recognizer    
speech_recognizer.recognizing.connect(lambda evt: print('RECOGNIZING: {}'.format(evt)))
speech_recognizer.recognized.connect(lambda evt: print('RECOGNIZED: {}'.format(evt)))
speech_recognizer.session_started.connect(lambda evt: print('SESSION STARTED: {}'.format(evt)))
speech_recognizer.session_stopped.connect(lambda evt: print('SESSION STOPPED {}'.format(evt)))
speech_recognizer.canceled.connect(lambda evt: print('CANCELED {}'.format(evt)))
# stop continuous recognition on either session stopped or canceled events
speech_recognizer.session_stopped.connect(stop_cb)
speech_recognizer.canceled.connect(stop_cb)

speech_recognizer.start_continuous_recognition()

while not done:
    time.sleep(.5)

说明：默认情况下，当您不提供audioconfig时，默认输入源是麦克风。

如果您想配置/自定义-可以使用audioconfig类

在连续识别中，有各种各样的回调事件，如-识别、识别、取消。

类似资料：

语音语音识别android移除谷歌屏幕和麦克风按钮

我正在使用Android API的语音识别。我成功地遵循了以下教程:http://code4reference.com/2012/07/tutorial-android-voice-refactionation/#comment-335
使用麦克风检测音量

使用光环板制作一个音量检测计，通过光环板的麦克风检测音量大小，并通过可视化的形式呈现出来，音量越大，LED灯环亮起的灯就越多。 1. 从事件类积木拖取一个当按钮被按下时积木。 2. 从控制类积木拖取一个重复执行积木。 3. 从灯光类积木拖取一个显示LED环形图（）% 积木，再添加一个传感器类积木麦克风响度。 4. 试着拍下桌子，看光环板LED环形图的变化吧！下载代码
启用识别连续语音

如何使用REST API（带javascript SDK）Bing语音API实现连续语音识别？使用do Javascript SDK示例：https://github.com/Microsoft/Cognitive-Speech-STT-JavaScript只能用麦克风转录短句
基于phonegap的连续语音识别

我想在phonegap中创建应用程序，在Android和IOS中使用连续语音识别。我的应用程序应该等待用户的声音，当他/她说“下一步”时，应用程序应该更新屏幕并执行一些操作。我发现这个插件：https://github.com/macdonst/SpeechRecognitionPlugin而且它工作得非常快。但在语音识别启动几秒钟后，语音识别器停止工作，但并没有语音。是否有任何方法或标志，如i
从麦克风识别蟒蛇口袋斯芬克斯

我已经在python中安装和安装了pocketsphinx和sphinxbase包。我还为github获取了语音识别代码，并根据需要更改了数据和模式目录，但当我试图通过“python test.py”运行它时，它仍然无法通过语音进行流式传输。下面是代码：请告诉我如何执行它。
Android-录制视频时静音麦克风

在遵循类似的教程之后，我正在使用MediaRecorder类用相机录制视频 http://androidcookbook.com/Recipe.seam;jsessionid=40151FCD26222877E151C3EEFB406EED？recipeId=1375 我希望在录音时能够使麦克风静音/取消静音。怎么可能呢？我正在开始设置音频源但是如果我想在某个时候录制没有声音的音乐呢？

MS Azure上麦克风的连续语音识别

共有1个答案

相关问答

相关文章

相关阅读

相关工具

相关文档