Skip to main content
The Audio app is a voice-to-notes workflow. Users record or upload audio, get a transcript, then generate a structured summary with action items. Way more useful than a plain “speech-to-text” page.

What Your Users Can Do

The app is built around a simple but practical workflow:
1

Record or upload audio

Users can record directly in the browser or upload an existing audio file.
2

Transcribe

The audio is sent to Replicate Whisper for accurate transcription.
3

Generate a summary

OpenAI takes the transcript and produces a structured summary with a title and action items.
4

Review and revisit

Everything is saved — users can come back to any recording, re-read the transcript, and check their action items.
This is a strong pattern for meeting notes, voice memos, interviews, and internal productivity tools.

What Powers It

  • Replicate Whisper for transcription
  • OpenAI via the AI SDK for structured summaries
  • Storage for uploaded audio files
  • Better Auth + PostgreSQL for user accounts, recordings, and summaries

What You Need

Better Auth + PostgreSQL

Auth, saved recordings, transcripts, and summaries

Storage

Uploaded recordings

OpenAI

Structured summary generation

Replicate

Whisper transcription

How It Works Under the Hood

1

Audio capture

The user records audio in-app or uploads a file.
2

Storage

The file is saved to object storage.
3

Transcription

Replicate Whisper creates a text transcript from the audio.
4

Database save

The transcript is saved in PostgreSQL, linked to the user’s account.
5

AI summarization

OpenAI generates a structured summary, title, and action items from the transcript.
6

Revisit anytime

The user can reopen any recording later and review everything in one place.
This app isn’t just transcription. It turns raw audio into something you can actually act on. That’s what makes it useful as a product pattern.

Great Starting Point For…

  • Meeting summarizers
  • Founder voice memo tools
  • Podcast notes
  • Sales call summaries
  • Interview analysis
  • Private internal knowledge capture

Files to Customize

If you want to make this your own, start here:
  • app/(apps)/audio/toolConfig.ts
  • app/(apps)/audio/api/transcribe/route.ts
  • app/(apps)/audio/api/summarize/route.ts
  • app/(apps)/audio/components/*

Verification

Your setup is working if:
  • You can upload or record audio
  • Transcription completes successfully
  • A summary is generated with a title and action items
  • Recordings remain available after refresh