gigel82
This is dubious, I'm surprised MS hasn't locked down those APIs yet.
I'm curious, would this be legal equivalent of a "cracked" software in terms of piracy?
slyn
I like to use Edge on occasion when I need to read something dry but necessary because I find following along with the TTS it’s auto-highlight of text helps me stay focused and retain better as well.
Is there any equivalent program for ebooks? If not can someone build one? The dream would be to plop in an arbitrary document (pdf, docs, tex, epub, and so on) and have it read to me by a reasonable TTS at a speed of my choosing and have words / lines highlighted as the TTS goes along. Bonus points if you can regularly identify and skip things that are not necessarily relevant like page numbers, headers, footnote markers, and so on, which is something that Edge TTS within Edge struggles with when reading PDFs.
lf-non
The ReadEra app for android supports this, and I use it for reading/listening to ebooks during commute. It works well.
gostsamo
You can use a screen reader. Most of them have a focus highlight feature and use local tts.
modeless
Why would you pirate a TTS service when there are so many great options for local open source TTS now? Models like Fish and Kokoro and StyleTTSv2 are great and very fast.
Click the leaderboard tab here: https://huggingface.co/spaces/TTS-AGI/TTS-Arena
itake
The models you shared only support the top ~10 languages / english only.
I believe the Edge API supports more models:
https://gist.github.com/BettyJJ/17cbaa1de96235a7f5773b8690a2...
Do you know any commercial licensed TTS that support 50+ languages and are relatively small (e.g. many small models, not 1 big model)? Meta's open models supports like 300 languages, but the license doesn't permit commercial use :-/
archerx
I have been experimenting with piper TTS recently, it's free, open source, fast and has a lot of voices in different languages but the quality is not the best but it's still good enough for most cases.
modeless
I don't know, but the Edge API is not licensed for any use, commercial or otherwise (outside of Edge itself).
homarp
or directly try https://kokorotts.com/ or https://huggingface.co/spaces/hexgrad/Kokoro-TTS
userbinator
"pirate"? This was always free.
modeless
The API endpoint was clearly intended for use only by Edge. Yes, reverse engineering the authentication (even if trivial) and using it for other applications, knowing that was not its intended use, I consider a form of piracy.
userbinator
That is a very hazardous slope to go down. We are already seeing user-agent discrimination and this is no different than using Bing from a browser that isn't Edge.
dcre
Not sure if the CLI does this directly, but here's a command that takes text either as an arg or through stdin.
function tts() {
if [ -p /dev/stdin ]; then
edge-playback --file -
else
edge-playback --text "$*"
fi
}
chopete3
Its not running on the edge. A hack to use MS online tts.
>> edge-tts is a Python module that allows you to use Microsoft Edge's online text-to-speech service from within your Python code or using the provided edge-tts or edge-playback command.
wiradikusuma
Edge = Microsoft Edge, a browser
croes
I guess parent wanted just to clarify that it’s using Edge not running on the edge.
caseyy
So is this entirely offline? If so, it could have quite many useful applications, if not for copyleft of course.
I wouldn't use Edge TTS for commercial projects since it's using an internal Microsoft API that was reverse engineered.
If you are looking for a commercial API, I just launched a TTS API powered by the the best performing open source model Kokoro: https://www.lemonfox.ai/text-to-speech-api. The API is compatible with OpenAI and ElevenLabs and up to 25x cheaper.