test-repo/docs/PODCAST_PRD.md

1.6 KiB

Daily Digest Podcast - PRD & Roadmap

Current State

DONE - What's Built & Working

TTS Generation:

  • Provider: macOS say (built-in, no external API)
  • Converts blog content to speech

Audio Mixing:

  • Music plays at 12% volume UNDER speech (continuous bed)
  • Music fades in at start (30%), fades out at end
  • Works reliably with ffmpeg

Files:

  • Intro: blog-creator/public/podcast-audio/intro.mp3 (71 sec)
  • Outro: blog-creator/public/podcast-audio/outro.mp3 (34 sec)

Tested: Audio generates successfully with music bed!


Configuration

# blog-backup .env.local
ENABLE_TTS=true
TTS_PROVIDER=macsay
ENABLE_PODCAST_MUSIC=true
INTRO_MUSIC_URL=/path/to/intro.mp3
OUTRO_MUSIC_URL=/path/to/outro.mp3

What's Left to Do

1. Integrate TTS into Daily Digest Cron

  • 7am cron creates digest but doesn't auto-generate audio
  • Need to add TTS call to workflow

2. Pre-slice Intro/Outro (Optional Enhancement)

  • Create 5-10 sec intro clip (currently using full 71 sec as bed)
  • Create 5-10 sec outro clip
  • This would enable distinct intro/speech/outro segments

3. Transition Sounds (Optional)

  • Add brief music bump between stories
  • Requires pre-sliced clips

Documentation

Full audio mixing docs: ~/.openclaw/workspace/docs/AUDIO_MIXING.md


Test Result

Working audio generated:

  • URL: https://qnatchrjlpehiijwtreh.supabase.co/storage/v1/object/public/podcast-audio/tts-xxx.mp3
  • Duration: ~120 seconds (matches speech length)
  • Sound: Speech with background music bed throughout