← Back to Overview

πŸŽ™οΈ Speech Transcription

Real-time speech-to-text transcription with advanced audio processing, meeting minutes generation, and comprehensive export options.

Overview

The Speech Transcription feature provides two powerful tools for converting speech to text:

πŸ“

Speech Transcription

General-purpose real-time transcription with sensitive data detection. Perfect for notes, dictation, and quick recordings.

πŸ‘₯

Meeting Transcription

Specialised for meetings with automatic minutes generation, action items extraction, and participant tracking.

πŸ’‘ Privacy Reminder: This feature demonstrates what's possible when an app has microphone access. It's a practical tool, but also a reminder to be mindful of which apps you grant this permission to. All processing happens on-device.

1. Speech Transcription

Services: SpeechTranscriptionService, SensitiveInfoDetector, TranscriptStorageService

Features

Sensitive Data Detection

Automatically detects and flags potentially sensitive information:

Live Statistics

Audio Controls

2. Meeting Transcription

Services: MeetingMinutesService, TranscriptExportService

Meeting Setup

Recording Modes

Mode Best For Description
Standard General use Default transcription settings
Meeting Group discussions Optimised for multiple speakers
Interview Two-person conversations Balanced for dialogue
Lecture Presentations Single speaker, long duration

Automatic Meeting Minutes

The app analyses your transcript and generates structured meeting minutes:

Action Item Detection

Automatically extracts action items based on keywords like:

Priority is determined by urgency keywords (urgent, critical, ASAP = High; when possible, eventually = Low).

3. Advanced Audio Processing

Service: AdaptiveLMSFilter, AudioEngineHandler

Two mutually exclusive audio enhancement modes to improve transcription quality:

πŸ”‡

Denoise (LMS Filter)

Adaptive Least Mean Squares filter that learns and removes background noise in real-time.

  • 32-tap FIR filter
  • Adapts to your environment
  • Removes constant noise (fans, AC, traffic)
  • Preserves speech frequencies
🎀

Clarity (Voice Enhancement)

5-band EQ filter that emphasises voice frequencies for clearer speech recognition.

  • High-pass at 200Hz (removes rumble)
  • Boost at 400Hz (voice fundamentals)
  • Boost at 1.5kHz (clarity)
  • Boost at 3kHz (articulation)
  • Low-pass at 6kHz (removes hiss)
Note: Denoise and Clarity are mutually exclusive β€” only one can be active at a time. This saves CPU resources and prevents conflicting audio processing.

Signal-to-Noise Ratio (SNR) Indicator

Live SNR display shows audio quality during recording:

SNR Level Quality Colour
< 10 dB Poor (noisy environment) Red
10-20 dB Acceptable Orange
20-30 dB Good Yellow
> 30 dB Excellent Green

4. Export & Sharing

Service: TranscriptExportService

Export Formats

Format Extension Best For
Plain Text .txt Simple sharing, compatibility
Markdown .md Documentation, formatted notes
PDF .pdf Professional documents, email attachments
JSON .json Data processing, backups

Sharing Options

Meeting Minutes Export

Meeting minutes can be exported with full formatting:

Corporate PDF Export

Professional PDF design for business use:

5. Search & Analysis

Search Functionality

Statistics Dashboard

Document Behaviour

Growing Document: The transcript is a continuous document that grows over time. Text never disappears β€” it only accumulates. When recognition restarts (due to Apple's 1-minute limit), all previous text is preserved and new text is appended.

6. Technical Implementation

Core Technologies

Swift 6 Concurrency

Recognition Handling