| 
					
				 | 
			
			
				@@ -97,7 +97,7 @@ Audio(speech_output[0].cpu().numpy(), rate=sampling_rate)``` 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				  
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 Promising directions to explore in future: 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				  
			 | 
		
	
		
			
				 | 
				 | 
			
			
				-- [MeloTTS](huggingface.co/myshell-ai/MeloTTS-English) This is most popular (ever) on HuggingFace 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				+- [MeloTTS](https://huggingface.co/myshell-ai/MeloTTS-English) This is most popular (ever) on HuggingFace 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 - [WhisperSpeech](https://huggingface.co/WhisperSpeech/WhisperSpeech) sounded quite natural as well 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 - [F5-TTS](https://github.com/SWivid/F5-TTS) was the latest release at this time, however, it felt a bit robotic 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 - E2-TTS: r/locallama claims this to be a little better, however, it didn't pass the vibe test 
			 |