All you do is give it your book in any ebook format and it’ll let you select a voice for each character it finds in it. So far I’ve gotten it to work on: Intel mac, linux,windows, and steam deck. For a 5+ hour audiobook (example being the first book in the guardians of Ga’Hoole series), Generating on cpu: 1050 min Generating on 3060 in Ubuntu natively:130 min Generating on windows with 3060: 250 min

You can even set the chapter deliminator which will choose the keyword it uses to detect when there’s a new chapter so the final files are chap1, chap2,… and so on.

The whole thing runs locally for free, been working on this project for quite a while so far, love to hear your thoughts!

  • corporat@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    Your demo audio is fascinating. The narrator voice is great, the Clud(?) voice is awful, the father is pretty good, and the mother is pretty bad. I think I caught that it made a mistake attributing narration to the wrong voice at one point. But overall I like the way it sounds :)

    How do you recommend making fixes like “oops, wrong voice here, use the narrator” or “change this voice here?”

    • Impossible_Belt_7757@alien.topOPB
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      Hm I never thought about those cases just suspected no one would ever want to do fine tuning like at, I suppose I’ll have to integrate a way to change it in the gui, at the moment the only way I know how would be to manually modify the book.csv and type in the speaker for that quote your looking at

      • Impossible_Belt_7757@alien.topOPB
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        The book.csv file is located in VoxNovel/working_files/Book/ is just a csv file where each row is a quote from a character/narrator talking, the row “text” contains the quote and the row “Speaker” contains the name of the character talking,

    • Impossible_Belt_7757@alien.topOPB
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      Voices wise tho, it just uses voice cloning from audio samples I gave it of celebrities talking, so you can add your own voices if you want, I just haven’t added a button for it yet, right now you have to go into the tortoise voices directory and manually create a new folder for that voice and put the reference mp3 or mp4 of them talking in it