|
7 hónapja | |
---|---|---|
.. | ||
1B-chat-start.py | 7 hónapja | |
1B-debating-script.py | 7 hónapja | |
Bark-Testing.ipynb | 7 hónapja | |
Parler-Testing.ipynb | 7 hónapja | |
Prompt_testing.md | 7 hónapja | |
README.md | 7 hónapja | |
Step-1 PDF Pre-Processing Logic.ipynb | 7 hónapja | |
Step-2-70B-Rewriter.ipynb | 7 hónapja | |
Step-2-8B-Rewriter.ipynb | 7 hónapja | |
Step-5-TTS-Workflow.ipynb | 7 hónapja | |
clean_extracted_text.txt | 7 hónapja | |
gradio-app.py | 7 hónapja |
Steps:
Path:
Report written
2 Agents debate/interact? Podcast style -> Write a Transcript
TTS Engine (E25 or ) Make the podcast
Running 1B-Model: python 1B-chat-start.py --temperature 0.7 --top_p 0.9 --system_message "you are acting as an old angry uncle and will debate why LLMs are bad" --user_message "I love LLMs"
Running Debator: python 1B-debating-script.py --initial_topic "The future of space exploration" --system_prompt1 "You are an enthusiastic advocate for space exploration" --system_prompt2 "You are a skeptic who believes we should focus on Earth's problems first" --n_turns 4 --temperature 0.8 --top_p 0.9 --model_name "meta-llama/Llama-3.2-1B-Instruct"
Actually this IS THE MOST CONSISTENT PROMPT: Small:
description = """
Laura's voice is expressive and dramatic in delivery, speaking at a fast pace with a very close recording that almost has no background noise.
"""
Large:
description = """
Alisa's voice is consistent, quite expressive and dramatic in delivery, with a very close recording that almost has no background noise.
"""
Small:
description = """
Jenna's voice is consistent, quite expressive and dramatic in delivery, with a very close recording that almost has no background noise.
"""
Bark is cool but just v6 works great, I tried v9 but its quite robotic and that is sad.
So Parler is next-its quite cool for prompting
xTTS-v2 by coquai is cool, however-need to check the license-I think an example is allowed
Torotoise is blocking because it needs HF version that doesnt work with llama-3.2 models so I will probably need to make a seperate env-need to eval if its worth it
Side note: The TTS library is a really cool effort!
Bark-Tests: Best results for speaker/v6 are at speech_output = model.generate(**inputs, temperature = 0.9, semantic_temperature = 0.8)
Audio(speech_output[0].cpu().numpy(), rate=sampling_rate)
Tested sound effects:
Ignore/Delete this in final stages, right now this is a "vibe-check" for TTS model(s):
Starting with: Bark but if it falls apart, here is the order
Vibe check:
Higher Barrier to testing (In other words-I was too lazy to test):
Try later: