A set of jupyter notebooks
-
Updated
Dec 18, 2024 - Jupyter Notebook
A set of jupyter notebooks
InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.
Free voice cloning for creators using Coqui XTTS-v2 on Google Colab. Clone your voice with just a few minutes of audio. Complete guide to build your own notebook.
This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.
Jupyter notebook for turning textual dialogue into voice audio.
Free voice cloning and TTS for creators using Qwen3-TTS on Google Colab. Clone your voice with just a few seconds of audio. Complete guide to build your own notebook.
A notebook created for training StableTTS models in Google Colab easily!
Bark - test notebook
AI-assisted vocabulary notebook for Android with Gemini-powered definitions and offline Room storage.
Speakeasy GPT is a Jupyter notebook that utilizes several natural language processing utilities to provide a seamless and low-latency speech interface to ChatGPT and other large language models.
A personal sandbox of Python scripts and notebooks spanning data engineering, AI/GenAI, speech processing, web scraping, and data analysis — with an Indonesian context.
This is a notebook from kaggle I had made that allows user's to refine and clean up their tacotron 2 models in terms of the audio output. In order to use this notebook you'll require a Tacotron 2 model fully trained, with the dataset used and the transcription and/or validation file too.
Currently this is not working as the repo used to make the notebook work is under-development, This is a notebook from Kaggle I had made that allows user's to make their own AI voices using 16bit PCM, 22050 HZ WAV files on the Neural networks provided by NVIDIA's creation of Tacotron 2 which has been further developed and worked on by the team a…
✭ MAGNETRON ™ ✭: This is a Google Colab/Jupyter Notebook for developing a VOICE PROXIA (B) when working with ARTIFICIAL INTELLIGENCE 2.0 ™ (ARTIFICIAL INTELLIGENCE 2.0™ is part of MAGNETRON ™ TECHNOLOGY).
This is a notebook from kaggle I had made that allows user's to make their own AI voices using 16bit PCM, 22050 HZ WAV files on the Neural networks provided by NVIDIA's creation of Tacotron 2.
A ready-to-use Google Colab notebook for running the open-source VibeVoice TTS model from Microsoft, using the quantized Large Q8 variant (~12 GB VRAM) for multi-speaker long-form audio generation
This is a notebook from Kaggle I had made that allows user's to make their own AI voices using 16bit PCM, 22050 HZ WAV files on the Neural networks provided by NVIDIA's creation of Tacotron 2 that has been slightly modified to use arpabet to help the model enunciate words better when synthesizing.
This project showcases the use of IBM Watson Text to Speech API within a Google Colab notebook. It securely handles API credentials stored in Google Drive, reads input text files, converts text to spoken audio, and saves the resulting MP3 files directly to Google Drive for easy access.
Add a description, image, and links to the text-to-speech topic page so that developers can more easily learn about it.
To associate your repository with the text-to-speech topic, visit your repo's landing page and select "manage topics."