News
Meta's Voicebox AI promises to do for the spoken word what ChatGPT and Dall-E, respectfully, did for text and image generation.
Turn your favourite book or document into a podcast with narration, voices, and effects using Google NotebookLM. Here’s how it works.
By leveraging the power of Googles NotebookLM app, you can transform any book into a rich, immersive podcast experience.
OpenAI unveils cutting-edge speech-to-text audio AI models API to help developers build accurate, reliable, and engaging voice-driven apps ...
Transformers.js, the JavaScript counterpart to the Python Transformers library, is designed for running Transformers models directly within web browsers, eliminating the necessity for external ...
OpenAI’s latest speech-to-text models, such as GPT-4 Transcribe and GPT-4 Mini Transcribe, deliver significant improvements in transcription accuracy and processing speed.
ElevenLabs, the highly-valued AI voice cloning and generation startup from former Palantir alumni, today launched Scribe v1, a new speech-to-text model that reportedly achieves the highest ...
ElevenLabs had developed the speech-to-text component for its AI conversational agent platform, which was released last year. However, this is the first time the company is releasing a stand-alone ...
Meta’s “massively multilingual” AI model translates up to 100 languages, speech or text Meta aims for a universal translator like "Babel Fish" from Hitchhiker’s Guide.
For the text-to-speech functionality itself, there are a few customizations you can do, such as changing the speed, volume, and pitch, skipping in-text citations, and adding a sleep timer.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results