YouTube → diarization → D-Mel token pipeline for TTS fine-tuning, with LoRA and full fine-tune evaluation.