Re: Suggestions for Text-To-Speech (TTS) from Org sources?

emacs-orgmode

[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Suggestions for Text-To-Speech (TTS) from Org sources?

From:	briangpowell
Subject:	Re: Suggestions for Text-To-Speech (TTS) from Org sources?
Date:	Sat, 9 Sep 2023 17:20:28 -0400

I've turned OrgMode files into audio desktops

It was pretty simple

Just find the code that reveals what an icon is when you hover over it & pipe it to some text-to-speech engine & then on to usual routes

On Sat, Sep 9, 2023 at 2:06 PM Jens Lechtenboerger <lechten@wi.uni-muenster.de> wrote:

Dear all,

does someone here produce audio via Text-To-Speech (TTS) from Org
sources? I plan to do that in the context of emacs-reveal to
generate voice-over for reveal.js presentations, with open questions
[1] concerning my initial, experimental approach.

Currently, I like the default model of Coqui-AI TTS [2] and
Microsoft SpeechT5 [3] best. Any suggestions for free and open TTS
implementations that produce even better results? Other models of
Coqui-AI? The solution should work without GPU support, which seems
to rule out Suno Bark [4].

The above models do not pronounce numbers/digits, and they fail to
pronounce most acronyms. In a preprocessing step I could replace
those. I use preprocessing anyways to get rid of Org markup that
might confuse the language models. Anyone here who did that
already? Maybe gruut [5] in conjunction with SSML [6] handling?

Any other suggestions?

Best wishes
Jens

[1] https://gitlab.com/oer/emacs-reveal/-/issues/20
[2] https://github.com/coqui-ai/TTS/
[3] https://huggingface.co/microsoft/speecht5_tts
[4] https://github.com/suno-ai/bark
[5] https://github.com/rhasspy/gruut
[6] https://www.w3.org/TR/speech-synthesis11/

[Prev in Thread]

Current Thread

[Next in Thread]

Suggestions for Text-To-Speech (TTS) from Org sources?, Jens Lechtenboerger, 2023/09/09
- Re: Suggestions for Text-To-Speech (TTS) from Org sources?, briangpowell <=
  - Re: Suggestions for Text-To-Speech (TTS) from Org sources?, Jens Lechtenboerger, 2023/09/10
- Re: Suggestions for Text-To-Speech (TTS) from Org sources?, Ihor Radchenko, 2023/09/10
  - Re: Suggestions for Text-To-Speech (TTS) from Org sources?, Jens Lechtenboerger, 2023/09/10
    - Re: Suggestions for Text-To-Speech (TTS) from Org sources?, Christian Thäter, 2023/09/11
    - Re: Suggestions for Text-To-Speech (TTS) from Org sources?, Jens Lechtenboerger, 2023/09/11
    - Re: Suggestions for Text-To-Speech (TTS) from Org sources?, briangpowell, 2023/09/11
    - Re: Suggestions for Text-To-Speech (TTS) from Org sources?, Jude DaShiell, 2023/09/11
    - Re: Suggestions for Text-To-Speech (TTS) from Org sources?, tomas, 2023/09/11
    - Re: Suggestions for Text-To-Speech (TTS) from Org sources?, Jude DaShiell, 2023/09/11
    - Re: Suggestions for Text-To-Speech (TTS) from Org sources?, tomas, 2023/09/11

Prev by Date: Org mode version 9.7-pre (9.7-pre-n/a-g2030c1 @ /home/n/.emacs.d/elpaca/builds/org/); org--get-expected-indentation: useless cond clause
Next by Date: [BUG] org-export-with-broken-links not respected for (org-icalendar-combine-agenda-files t) [9.6.9 ( @ /home/jorge/.config/emacs/elpa/org-9.6.9/)]
Previous by thread: Suggestions for Text-To-Speech (TTS) from Org sources?
Next by thread: Re: Suggestions for Text-To-Speech (TTS) from Org sources?
Index(es):
- Date
- Thread