【Python实用API】语音转文本-whisper

作者：System 时间：2024年08月28日分类：所有,AIGC 字数：1232

这篇文章距离上次修改已过499天，其中的内容可能已经有所变动。




from google.cloud import speech_v1p1beta1
from google.cloud.speech_v1p1beta1 import enums
from google.cloud.speech_v1p1beta1 import types
from google.oauth2 import service_account
 
# 使用服务账户的凭证创建认证对象
credentials = service_account.Credentials.from_service_account_file('service_account.json')
 
# 创建speech客户端
client = speech_v1p1beta1.SpeechClient(credentials=credentials)
 
# 语音文件的路径
audio_file_path = 'audio.wav'
 
# 读取语音文件内容
with open(audio_file_path, 'rb') as audio_file:
    content = audio_file.read()
 
# 创建一个SyncRecognizeRequest对象，设置配置参数
request = types.SyncRecognizeRequest(
    config=types.RecognitionConfig(
        encoding=enums.RecognitionConfig.AudioEncoding.LINEAR16,
        sample_rate_hertz=16000,
        language_code='en-US',
        model='command_and_search',
    ),
    audio=types.RecognitionAudio(content=content),
)
 
# 执行识别并打印结果
response = client.recognize(request)
 
for result in response.results:
    print('Transcript: {}'.format(result.alternatives[0].transcript))

这段代码展示了如何使用Google的Speech-to-Text API进行语音转文本的操作。它首先配置了认证信息，然后创建了一个Speech客户端用于发送请求。接着，它读取了一个语音文件并准备了请求，最后调用了client的recognize方法来执行语音识别，并打印出了识别的文本结果。这个例子使用了服务账户的凭证进行认证，并设置了请求的配置参数，包括编码格式、采样率、语言代码和使用的模型。

【Python实用API】语音转文本-whisper

评论已关闭

推荐阅读