问题：

谷歌云语音到文本AP

聂溪叠

2023-03-14

我正在使用谷歌云语音到文本AP，并试图转录长音频文件。但是，无法检测到桶中的音频文件。我得到一个错误，说明：IOError：[Errno 2]没有这样的文件或目录：

transcribe_gcs（gcs_uri）：

time(gcs_uri)

"""Asynchronously transcribes the audio file specified by the gcs_uri."""
from google.cloud import speech
from google.cloud.speech import enums
from google.cloud.speech import types
client = speech.SpeechClient()

audio = types.RecognitionAudio(uri=gcs_uri)
config = types.RecognitionConfig(
    encoding=enums.RecognitionConfig.AudioEncoding.FLAC,
    sample_rate_hertz=16000,
    language_code='en-US')

operation = client.long_running_recognize(config, audio)

print('Waiting for operation to complete...')
response = operation.result(timeout=90)

# Each result is for a consecutive portion of the audio. Iterate through
# them to get the transcripts for the entire audio file.
for result in response.results:
    # The first alternative is the most likely one for this portion.
    print(u'Transcript: {}'.format(result.alternatives[0].transcript))
    print('Confidence: {}'.format(result.alternatives[0].confidence))

共有1个答案

乐华晖

2023-03-14

试试这个

import requests
import json

url = "https://speech.googleapis.com/v1/speech:longrunningrecognize?key=<apiaccesskey>"


payload = {"config": {"encoding": "LINEAR16","sample_rate_hertz": 8000,
                     "language_code": "en-IN"},
                     "audio": {"uri": "gs://bucketname/file.flac"}}

r = requests.post(url, data=json.dumps(payload))

json_resp = r.json()
token_resp=json_resp['name']

url = "https://speech.googleapis.com/v1/operations/" + str(token_resp) + 
      "?key=<apiacesskey>"

content_response = requests.get(url)
content_json = content_response.json()

您的响应位于content\u json变量中。

类似资料：

谷歌语音到文本api android

Android谷歌语音转文本SDK，语音录制由SDK控制。我需要将其设置为手动按钮，用于启动和停止语音录制，以便将语音转换为文本。例如：当单击按钮开始语音识别时，它会继续录制音频，直到单击停止按钮。但在android SDK中，它会自动停止录制并将录制的音频传递给处理。
谷歌云语音NoSuchFieldError:CONTEXT_SPAN_KEY

我在尝试使用谷歌云语音库时出错。这可能是关于我的依赖项。在添加Firebase身份验证依赖项之前，库是无缝工作的。所以它可能是关于endpoint-管理-控制-应用引擎-所有依赖项。 Stackdriver日志：这是类的唯一方法：项目的依赖关系：
API谷歌语音到文本-mp3文件

我能够使用链接中包含的教程转录音频(https://cloud.google.com/speech-to-text/docs/async-recognize)，但它仅适用于WAV格式的音频。我想让它与MP3或M4A格式的音频一起工作。我已经测试了链接中包含的所有可能性(https://github.com/GoogleCloudPlatform/java-docs-samples/blob/ma
谷歌云语音到文本API-多说话人识别？

新的谷歌云语音文本转换API据说是市场上最好的。它是否提供了说话人注释（或其他说话人信息）？比如谁在什么时候说什么。我似乎在它的文档或提供的例子中找不到任何提到它的地方。 IBM和亚马逊都这样做。如果有人能告诉我，我将不胜感激，谢谢！
如何进行实时语音识别|谷歌云语音到文本

我正在尝试从扬声器转录音频我正在将声音从扬声器传送到节点。js文件(https://askubuntu.com/a/850174) 这是我的抄本。js公司但谷歌云语音到文本在1分钟内对流媒体识别有一个限制。所以我有一个错误“超过了允许的最大流持续时间65秒” 如何将流拆分为以静默为拆分器的块，或拆分为持续30秒的块？
Python谷歌语音到文本转换

我正在使用spyder IDE进行我的语音到文本转换项目，该项目使用谷歌语音到文本api。我在代码中集成了谷歌语音和文本，出现了类似“配额耗尽”的错误。在谷歌控制台的仪表板上，我可以看到可用的配额。 > 错误文件"C：\Program Data\Anaconda3\lib\site-包\speech_recognition__init__. py"，第932行，在recognize_google

谷歌云语音到文本AP

共有1个答案

相关问答

相关文章

相关阅读

相关工具

相关文档