Skip to content(if available)orjump to list(if available)

Edge TTS

Edge TTS

22 comments

·January 23, 2025

BrunoJo

I wouldn't use Edge TTS for commercial projects since it's using an internal Microsoft API that was reverse engineered.

If you are looking for a commercial API, I just launched a TTS API powered by the the best performing open source model Kokoro: https://www.lemonfox.ai/text-to-speech-api. The API is compatible with OpenAI and ElevenLabs and up to 25x cheaper.

gigel82

This is dubious, I'm surprised MS hasn't locked down those APIs yet.

I'm curious, would this be legal equivalent of a "cracked" software in terms of piracy?

slyn

I like to use Edge on occasion when I need to read something dry but necessary because I find following along with the TTS it’s auto-highlight of text helps me stay focused and retain better as well.

Is there any equivalent program for ebooks? If not can someone build one? The dream would be to plop in an arbitrary document (pdf, docs, tex, epub, and so on) and have it read to me by a reasonable TTS at a speed of my choosing and have words / lines highlighted as the TTS goes along. Bonus points if you can regularly identify and skip things that are not necessarily relevant like page numbers, headers, footnote markers, and so on, which is something that Edge TTS within Edge struggles with when reading PDFs.

lf-non

The ReadEra app for android supports this, and I use it for reading/listening to ebooks during commute. It works well.

gostsamo

You can use a screen reader. Most of them have a focus highlight feature and use local tts.

modeless

Why would you pirate a TTS service when there are so many great options for local open source TTS now? Models like Fish and Kokoro and StyleTTSv2 are great and very fast.

Click the leaderboard tab here: https://huggingface.co/spaces/TTS-AGI/TTS-Arena

itake

The models you shared only support the top ~10 languages / english only.

I believe the Edge API supports more models:

https://gist.github.com/BettyJJ/17cbaa1de96235a7f5773b8690a2...

Do you know any commercial licensed TTS that support 50+ languages and are relatively small (e.g. many small models, not 1 big model)? Meta's open models supports like 300 languages, but the license doesn't permit commercial use :-/

archerx

I have been experimenting with piper TTS recently, it's free, open source, fast and has a lot of voices in different languages but the quality is not the best but it's still good enough for most cases.

https://rhasspy.github.io/piper-samples/

modeless

I don't know, but the Edge API is not licensed for any use, commercial or otherwise (outside of Edge itself).

userbinator

"pirate"? This was always free.

modeless

The API endpoint was clearly intended for use only by Edge. Yes, reverse engineering the authentication (even if trivial) and using it for other applications, knowing that was not its intended use, I consider a form of piracy.

userbinator

That is a very hazardous slope to go down. We are already seeing user-agent discrimination and this is no different than using Bing from a browser that isn't Edge.

dcre

Not sure if the CLI does this directly, but here's a command that takes text either as an arg or through stdin.

    function tts() {
      if [ -p /dev/stdin ]; then
        edge-playback --file -
      else
        edge-playback --text "$*"
      fi
    }

chopete3

Its not running on the edge. A hack to use MS online tts.

>> edge-tts is a Python module that allows you to use Microsoft Edge's online text-to-speech service from within your Python code or using the provided edge-tts or edge-playback command.

wiradikusuma

Edge = Microsoft Edge, a browser

croes

I guess parent wanted just to clarify that it’s using Edge not running on the edge.

caseyy

So is this entirely offline? If so, it could have quite many useful applications, if not for copyleft of course.

userbinator

Entirely online.

caseyy

Ah, thanks.