By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
CryptoCommunityCryptoCommunity
  • Home
  • General
  • Blockchain
  • Crypto
  • DeFi
  • Metaverse
  • NFT
Search
  • BTC
  • ETH
  • USDT
  • USDC
  • BNB
  • BUSD
  • ADA
  • XRP
  • SOL
  • DOGE
  • DOT
  • MATIC
Reading: OpenAI open-sources Whisper, a multilingual speech recognition system
Share
Aa
CryptoCommunityCryptoCommunity
Aa
  • Home
  • General
  • Blockchain
  • Crypto
  • DeFi
  • Metaverse
  • NFT
Search
  • Home
  • General
  • Blockchain
  • Crypto
  • DeFi
  • Metaverse
  • NFT
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
CryptoCommunity > Blog > General > OpenAI open-sources Whisper, a multilingual speech recognition system
General

OpenAI open-sources Whisper, a multilingual speech recognition system

admin Published September 21, 2022
Last updated: 2022/09/21 at 10:16 PM
Share
SHARE

[ad_1]

Speech recognition remains a challenging problem in AI and machine learning. In a step toward solving it, OpenAI today open-sourced Whisper, an automatic speech recognition system that the company claims enables “robust” transcription in multiple languages as well as translation from those languages into English.

Countless organizations have developed highly capable speech recognition systems, which sit at the core of software and services from tech giants like Google, Amazon and Meta. But what makes Whisper different, according to OpenAI, is that it was trained on 680,000 hours of multilingual and “multitask” data collected from the web, which lead to improved recognition of unique accents, background noise and technical jargon.

“The primary intended users of [the Whisper] models are AI researchers studying robustness, generalization, capabilities, biases and constraints of the current model. However, Whisper is also potentially quite useful as an automatic speech recognition solution for developers, especially for English speech recognition,” OpenAI wrote in the GitHub repo for Whisper, from where several versions of the system can be downloaded. “[The models] show strong ASR results in ~10 languages. They may exhibit additional capabilities … if fine-tuned on certain tasks like voice activity detection, speaker classification or speaker diarization but have not been robustly evaluated in these area.”

Whisper has its limitations, particularly in the area of text prediction. Because the system was trained on a large amount of “noisy” data, OpenAI cautions Whisper might include words in its transcriptions that weren’t actually spoken — possibly because it’s both trying to predict the next word in audio and trying to transcribe the audio itself. Moreover, Whisper doesn’t perform equally well across languages, suffering from a higher error rate when it comes to speakers of languages that aren’t well-represented in the training data.

That last bit is nothing new to the world of speech recognition, unfortunately. Biases have long plagued even the best systems, with a 2020 Stanford study finding systems from Amazon, Apple, Google, IBM and Microsoft made far fewer errors — about 35% — with users who are white than with users who are Black.

Despite this, OpenAI sees Whisper’s transcription capabilities being used to improve existing accessibility tools.

“While Whisper models cannot be used for real-time transcription out of the box, their speed and size suggest that others may be able to build applications on top of them that allow for near-real-time speech recognition and translation,” the company continues on GitHub. “The real value of beneficial applications built on top of Whisper models suggests that the disparate performance of these models may have real economic implications … [W]e hope the technology will be used primarily for beneficial purposes, making automatic speech recognition technology more accessible could enable more actors to build capable surveillance technologies or scale up existing surveillance efforts, as the speed and accuracy allow for affordable automatic transcription and translation of large volumes of audio communication.”

The release of Whisper isn’t necessarily indicative of OpenAI’s future plans. While increasingly focused on commercial efforts like DALL-E 2 and GPT-3, the company is pursuing several purely theoretical research threads, including AI systems that learn by observing videos.

[ad_2]

You Might Also Like

Revyze is building the TikTok of educational videos

YouTube ends the test asking users to get a premium subscription to watch 4K videos

Who is going to buy Cadillac’s $300,000 hand-built EV?

Don’t let today’s software rally improve your mood

Daily Crunch: Kanye West reaches agreement to acquire social media platform Parler

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
[mc4wp_form]
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
admin September 21, 2022
Share this Article
Facebook TwitterEmail Print
Share
Previous Article Dear Sophie: My EB-2 priority date will be delayed 2 years! What should I do?
Next Article Prepare For Volatility: Data Suggests Bitcoin Gets Chaotic During FOMC Meetings

Latest News

earn bitcoins fast
How to earn bitcoins fast
crypto
Cryptocurrency is a Scam or Not
How to Tell If a Cryptocurrency is a Scam or Not
crypto
Losing In Cryptocurrency Trading
Tips to Avoid Losing In Cryptocurrency Trading
crypto
Sell Products Online with Bitcoins
How to Sell Products Online with Bitcoins – The Ultimate Guide
crypto

You Might also Like

Revyze is building the TikTok of educational videos

6 Min Read

YouTube ends the test asking users to get a premium subscription to watch 4K videos

2 Min Read

Who is going to buy Cadillac’s $300,000 hand-built EV?

4 Min Read

Don’t let today’s software rally improve your mood

1 Min Read

Crypto Community

  • Home
  • Crypto Calculator
  • Blog
  • Contact Us
  • Privacy Policy
  • Disclaimer
  • Terms and Conditions

Real time Cryptocurrency

  • Crypto Prices
  • Dogecoin price
  • Shibainu coin price
  • Bitcoin Price
  • Cardano Price
  • Litecoins Price

Cryptocurrency Price USD

  • Bitcoin price USD
  • Ethereum price USD
  • Tether price USD
  • BNB Price USD
  • Cardano Price USD
  • Solana Price USD
  • Peps coin Price USD
  • floki inu Price USD
  • SIA coin Price USD
CryptoCommunityCryptoCommunity
Follow US

© 2022 Cryptos Community All Rights Reserved. All logos and images used on this website are registered trademarks of their respective companies. All Rights Reserved. Cryptos Community is not liable for inaccuracies, errors, or omissions found herein. For the removal of copyrighted images, trademarks, or other issues, Contact Us. 


Removed from reading list

Undo
Welcome Back!

Sign in to your account

Lost your password?