Skip to content

AudioReportsV1

This workflow compresses large audio or video files to under 25MB while preserving speech clarity, then transcribes them using OpenAI's Whisper model. It's designed to handle media files that are too large for direct transcription by first optimizing them for speech recognition.

Purpose

No business context provided yet — add a context.md to enrich this documentation.

This workflow solves the common problem of transcribing large audio or video files that exceed the size limits of transcription services. By compressing files while maintaining speech quality, it enables automated transcription of lengthy recordings, podcasts, or video content.

How It Works

  1. Manual Start: The workflow begins with a manual trigger
  2. Compression Job: Sends the audio/video file to CloudConvert with specific compression settings optimized for speech
  3. Wait Period: Waits 7 seconds to allow the compression job to start processing
  4. Status Check: Polls CloudConvert to check if the compression job is complete
  5. Loop Until Done: If not finished, waits and checks again until the job completes
  6. Download: Retrieves the compressed audio file from CloudConvert
  7. Transcription: Sends the compressed file to OpenAI Whisper for transcription

Workflow Diagram

graph TD
    A[start workflow] --> B[send compression job to cloudconvert]
    B --> C[Wait before checking status]
    C --> D[get compression job status]
    D --> E{is compression finished ?}
    E -->|Yes| F[Download compressed audio]
    E -->|No| C
    F --> G[Transcribe audio with whisper]

Trigger

  • Type: Manual Trigger
  • Activation: Requires manual execution to start the workflow

Nodes Used

Node Type Purpose
Manual Trigger Starts the workflow manually
HTTP Request Sends compression job to CloudConvert API
Wait Pauses execution for 7 seconds
HTTP Request Checks compression job status
If Determines if compression is complete
HTTP Request Downloads the compressed audio file
OpenAI Transcribes audio using Whisper model
Sticky Note Documentation and workflow description

External Services & Credentials Required

CloudConvert

  • Purpose: Audio/video compression service
  • Credentials: API Bearer token (currently hardcoded - should be moved to credentials)
  • Operations: File compression with speech-optimized settings

OpenAI

  • Purpose: Audio transcription using Whisper
  • Credentials: OpenAI API key (stored as "OpenAI Assistants API")
  • Operations: Audio transcription

Environment Variables

No environment variables are currently used. The CloudConvert API token is hardcoded in the workflow and should be moved to a secure credential store.

Data Flow

Input

  • Audio or video file URL (currently hardcoded as an example MP3 file)
  • File can be in various formats supported by CloudConvert

Processing

  • Compresses to MP3 format with:
    • 32 kbps bitrate
    • 22050 Hz frequency
    • Mono channel (1 channel)
  • Optimized settings preserve speech clarity while minimizing file size

Output

  • Text transcription of the audio content
  • Compressed audio file (temporarily available via CloudConvert)

Error Handling

The workflow includes basic error handling through the conditional check for compression completion. However, there are no explicit error handling paths for: - CloudConvert API failures - Network timeouts - Invalid file formats - OpenAI transcription errors

Known Limitations

  • CloudConvert API token is hardcoded and exposed in the workflow
  • Input file URL is hardcoded (needs to be parameterized)
  • No error handling for API failures
  • Workflow is archived and inactive
  • Limited to files that CloudConvert can process
  • No validation of input file format or size

This workflow is tagged with "VoiceNote Reports", "YouTube", "Content", and "Video", suggesting it's part of a larger content processing system, but no specific related workflows are referenced.

Setup Instructions

  1. Import Workflow: Import the JSON into your n8n instance

  2. Configure CloudConvert:

    • Sign up for CloudConvert API access
    • Create a new credential in n8n for CloudConvert
    • Replace the hardcoded Bearer token with the credential reference
  3. Configure OpenAI:

    • Ensure you have OpenAI API access
    • Verify the "OpenAI Assistants API" credential is properly configured
  4. Customize Input:

    • Replace the hardcoded URL in the first HTTP Request node
    • Consider adding a webhook or form trigger to accept dynamic file URLs
  5. Test:

    • Start with a small test file
    • Verify compression settings work for your use case
    • Check transcription quality
  6. Security:

    • Move all API tokens to n8n credentials
    • Review and update compression settings as needed
    • Consider adding error handling nodes
  7. Activate: Enable the workflow once configuration is complete