Yahoo Web Search

Search results

  1. 5 days ago · Ever wondered what the people around you are really thinking? Whisper is an online community where millions of people around the world share real thoughts, trade advice, and get the inside scoop.

    • (244.3K)
    • Social
    • Teen
    • MediaLab.AI, Inc-Whisper
    • Overview
    • Approach
    • Setup
    • Available models and languages
    • Command-line usage
    • Python usage
    • More examples
    • License
    • GeneratedCaptionsTabForHeroSec

    [Blog] [Paper] [Model card] [Colab example]

    Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.

    A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented as a sequence of tokens to be predicted by the decoder, allowing a single model to replace many stage...

    We used Python 3.9.9 and PyTorch 1.10.1 to train and test our models, but the codebase is expected to be compatible with Python 3.8-3.11 and recent PyTorch versions. The codebase also depends on a few Python packages, most notably OpenAI's tiktoken for their fast tokenizer implementation. You can download and install (or update to) the latest release of Whisper with the following command:

    Alternatively, the following command will pull and install the latest commit from this repository, along with its Python dependencies:

    To update the package to the latest version of this repository, please run:

    It also requires the command-line tool ffmpeg to be installed on your system, which is available from most package managers:

    There are five model sizes, four with English-only versions, offering speed and accuracy tradeoffs. Below are the names of the available models and their approximate memory requirements and inference speed relative to the large model; actual speed may vary depending on many factors including the available hardware.

    The .en models for English-only applications tend to perform better, especially for the tiny.en and base.en models. We observed that the difference becomes less significant for the small.en and medium.en models.

    The following command will transcribe speech in audio files, using the medium model:

    The default setting (which selects the small model) works well for transcribing English. To transcribe an audio file containing non-English speech, you can specify the language using the --language option:

    Adding --task translate will translate the speech into English:

    Run the following to view all available options:

    Transcription can also be performed within Python:

    Internally, the transcribe() method reads the entire file and processes the audio with a sliding 30-second window, performing autoregressive sequence-to-sequence predictions on each window.

    Please use the 🙌 Show and tell category in Discussions for sharing more example usages of Whisper and third-party extensions such as web demos, integrations with other tools, ports for different platforms, etc.

    Whisper's code and model weights are released under the MIT License. See LICENSE for further details.

    Whisper is a Transformer model that can perform multilingual speech recognition, speech translation, and language identification. It is trained on a large dataset of diverse audio and can be installed and used with Python and ffmpeg.

  2. Sep 21, 2022 · Whisper is an open-source system that can transcribe and translate speech in multiple languages from a large and diverse web dataset. It uses a simple encoder-decoder Transformer architecture and outperforms existing models on zero-shot tasks.

  3. Download Whisper - Share, Express, Meet and enjoy it on your iPhone, iPad, and iPod touch. ‎Ever wondered what the people around you are really thinking? Whisper is an online community where millions of people around the world share real thoughts, trade advice, and get the inside scoop.

    • Social Networking
    • 17+
    • WhisperText LLC.
  4. People also ask

  5. Whisper is a proprietary mobile app available without charge. It is a form of anonymous social media, allowing users to post and share photo and video messages anonymously, although this claim has been challenged with privacy concerns over Whisper's handling of user data.

    • 9.58.0 / May 18, 2022; 16 months ago
    • Proprietary
    • March 31, 2012
  6. Aug 28, 2023 · Whisper is an app for people to anonymously post their secrets. It's a great way to get something off your chest, read other people's secrets, and even meet people online. This wikiHow guide will help you get started on the app. Download...

    • 76.2K
  7. Learn the meaning, synonyms, examples, and history of the word whisper as a verb and a noun. Find out how to use whisper in a sentence and how to pronounce it correctly.

  1. People also search for