The Audio app is a voice-to-notes workflow. Users can record or upload audio,
get a transcript, then generate a structured summary with action items. It is
a much more useful demo than a plain “speech-to-text only” page.
What Users Can Do
The current app is built around a simple but practical workflow:- record audio or upload a file
- transcribe it
- generate a summary
- review action items
- revisit previous recordings later
What Powers It
The app currently uses:- Replicate Whisper for transcription
- OpenAI via the AI SDK for structured summaries
- storage for uploaded audio files
- Supabase for recordings, transcripts, and summaries
What You Need
To run this app, set up Supabase, OpenAI, Replicate, and Storage.Supabase
Needed for auth, saved recordings, transcripts, and summaries.
Storage
Needed for uploaded recordings.
OpenAI
Needed for the structured summary step.
Replicate
Needed for the current Whisper transcription path.
How It Works
Here is the real flow in simple language:- the user records audio in the app or uploads a file
- the file is saved to object storage
- Replicate Whisper creates a transcript
- the transcript is saved in Supabase
- OpenAI generates a structured summary, title, and action items
- the user can reopen that recording later and review everything
Why This App Is Useful As A Template
You can repurpose the same pattern for:- meeting summarizers
- founder voice memo tools
- podcast notes
- sales call summaries
- interview analysis
- private internal knowledge capture
Files To Know
If you want to customize this app, start here:app/(apps)/audio/toolConfig.tsapp/(apps)/audio/api/transcribe/route.tsapp/(apps)/audio/api/summarize/route.tsapp/(apps)/audio/components/*
Verification
Your setup is working if:- you can upload or record audio
- transcription completes successfully
- a summary can be generated
- recordings remain available after refresh

