test-repo/docs/PODCAST_PRD.md

66 lines
1.6 KiB
Markdown

# Daily Digest Podcast - PRD & Roadmap
## Current State
### ✅ DONE - What's Built & Working
**TTS Generation:**
- Provider: macOS `say` (built-in, no external API)
- Converts blog content to speech
**Audio Mixing:**
- Music plays at 12% volume UNDER speech (continuous bed)
- Music fades in at start (30%), fades out at end
- Works reliably with ffmpeg
**Files:**
- Intro: `blog-creator/public/podcast-audio/intro.mp3` (71 sec)
- Outro: `blog-creator/public/podcast-audio/outro.mp3` (34 sec)
**Tested:** Audio generates successfully with music bed!
---
## Configuration
```bash
# blog-backup .env.local
ENABLE_TTS=true
TTS_PROVIDER=macsay
ENABLE_PODCAST_MUSIC=true
INTRO_MUSIC_URL=/path/to/intro.mp3
OUTRO_MUSIC_URL=/path/to/outro.mp3
```
---
## What's Left to Do
### 1. Integrate TTS into Daily Digest Cron
- [ ] 7am cron creates digest but doesn't auto-generate audio
- [ ] Need to add TTS call to workflow
### 2. Pre-slice Intro/Outro (Optional Enhancement)
- [ ] Create 5-10 sec intro clip (currently using full 71 sec as bed)
- [ ] Create 5-10 sec outro clip
- [ ] This would enable distinct intro/speech/outro segments
### 3. Transition Sounds (Optional)
- [ ] Add brief music bump between stories
- [ ] Requires pre-sliced clips
---
## Documentation
**Full audio mixing docs:** `~/.openclaw/workspace/docs/AUDIO_MIXING.md`
---
## Test Result
**Working audio generated:**
- URL: `https://qnatchrjlpehiijwtreh.supabase.co/storage/v1/object/public/podcast-audio/tts-xxx.mp3`
- Duration: ~120 seconds (matches speech length)
- Sound: Speech with background music bed throughout