ChatGPT 的 stt 是用的 whisper 吗？感觉比所有其他的语音输入都要强

reDesign · 2024-10-18T11:10:37Z

中英文混输比讯飞强，纯中文和讯飞差不多说的是这个东西 https://i.imgur.com/zLIvD7Q.png

This topic created in 607 days ago, the information mentioned may be changed or developed.

中英文混输比讯飞强，纯中文和讯飞差不多
说的是这个东西

Supplement 1 · Oct 18, 2024

这个跟多模肽有关系么？我记得 GPT 3.5 的时候就有这个功能，现在选择 GPT4 也可以用这个功能。

whisper

stt

语音输入

4 replies • 2024-10-19 19:18:22 +08:00

malusama

Oct 18, 2024

这玩意估计就是模型支持语音的输入输出。。毕竟早就是多模态的了

kyor0

Oct 18, 2024

4o 是多模台的

cyp0633

Oct 19, 2024

如果是 whisper ，效果会远不如讯飞

FlashEcho

Oct 19, 2024

官方文档里就有： https://platform.openai.com/docs/guides/speech-to-text

The Audio API provides two speech to text endpoints, transcriptions and translations, based on our state-of-the-art open source large-v2 Whisper model.

ChatGPT 的 stt 是用的 whisper 吗？ 感觉比所有其他的语音输入都要强

ChatGPT 的 stt 是用的 whisper 吗？感觉比所有其他的语音输入都要强