|
@@ -11,7 +11,9 @@
|
|
|
"\n",
|
|
|
"In this notebook, we will learn how to generate Audio using both `suno/bark` and `parler-tts/parler-tts-mini-v1` models first. \n",
|
|
|
"\n",
|
|
|
- "After that, we will use the output from Notebook 3 to generate our complete podcast"
|
|
|
+ "After that, we will use the output from Notebook 3 to generate our complete podcast\n",
|
|
|
+ "\n",
|
|
|
+ "Note: Please feel free to extend this notebook with newer models. The above two were chosen after some tests using a sample prompt."
|
|
|
]
|
|
|
},
|
|
|
{
|
|
@@ -117,11 +119,7 @@
|
|
|
"id": "50b62df5-5ea3-4913-832a-da59f7cf8de2",
|
|
|
"metadata": {},
|
|
|
"source": [
|
|
|
- "Generally in life, you set your device to \"cuda\" and are happy. \n",
|
|
|
- "\n",
|
|
|
- "However, sometimes you want to compensate for things and set it to `cuda:7` to tell the system but even more-so the world that you have 8 GPUS.\n",
|
|
|
- "\n",
|
|
|
- "Jokes aside please set `device = \"cuda\"` below if you're using a single GPU node."
|
|
|
+ "Please set `device = \"cuda\"` below if you're using a single GPU node."
|
|
|
]
|
|
|
},
|
|
|
{
|
|
@@ -161,7 +159,7 @@
|
|
|
],
|
|
|
"source": [
|
|
|
"# Set up device\n",
|
|
|
- "device = \"cuda:7\" if torch.cuda.is_available() else \"cpu\"\n",
|
|
|
+ "device = \"cuda\" if torch.cuda.is_available() else \"cpu\"\n",
|
|
|
"\n",
|
|
|
"# Load model and tokenizer\n",
|
|
|
"model = ParlerTTSForConditionalGeneration.from_pretrained(\"parler-tts/parler-tts-mini-v1\").to(device)\n",
|
|
@@ -640,6 +638,19 @@
|
|
|
]
|
|
|
},
|
|
|
{
|
|
|
+ "cell_type": "markdown",
|
|
|
+ "id": "c7ce5836",
|
|
|
+ "metadata": {},
|
|
|
+ "source": [
|
|
|
+ "### Suggested Next Steps:\n",
|
|
|
+ "\n",
|
|
|
+ "- Experiment with the prompts: Please feel free to experiment with the SYSTEM_PROMPT in the notebooks\n",
|
|
|
+ "- Extend workflow beyond two speakers\n",
|
|
|
+ "- Test other TTS Models\n",
|
|
|
+ "- Experiment with Speech Enhancer models as a step 5."
|
|
|
+ ]
|
|
|
+ },
|
|
|
+ {
|
|
|
"cell_type": "code",
|
|
|
"execution_count": null,
|
|
|
"id": "26cc56c5-b9c9-47c2-b860-0ea9f05c79af",
|