The Audio app is a voice-to-notes workflow. Users record or upload audio, get a transcript, then generate a structured summary with action items. Way more useful than a plain “speech-to-text” page.
What Your Users Can Do
The app is built around a simple but practical workflow:Generate a summary
OpenAI takes the transcript and produces a structured summary with a title and action items.
What Powers It
- Replicate Whisper for transcription
- OpenAI via the AI SDK for structured summaries
- Storage for uploaded audio files
- Better Auth + PostgreSQL for user accounts, recordings, and summaries
What You Need
Better Auth + PostgreSQL
Auth, saved recordings, transcripts, and summaries
Storage
Uploaded recordings
OpenAI
Structured summary generation
Replicate
Whisper transcription
How It Works Under the Hood
AI summarization
OpenAI generates a structured summary, title, and action items from the transcript.
Great Starting Point For…
- Meeting summarizers
- Founder voice memo tools
- Podcast notes
- Sales call summaries
- Interview analysis
- Private internal knowledge capture
Files to Customize
If you want to make this your own, start here:app/(apps)/audio/toolConfig.tsapp/(apps)/audio/api/transcribe/route.tsapp/(apps)/audio/api/summarize/route.tsapp/(apps)/audio/components/*
Verification
Your setup is working if:
- You can upload or record audio
- Transcription completes successfully
- A summary is generated with a title and action items
- Recordings remain available after refresh

