Documentation
HomeAPISign In
  • Getting Started
    • Overview
      • Core Concepts
      • Building your First Workflow
    • API Reference
  • Your Data
    • Brand Kits
    • Knowledge Bases
      • Add Data
        • Upload Files
        • Web Scrape
        • Import from Google Drive
        • Import from SQL Database
        • Import from Shopify
      • Knowledge Base Search
      • Knowledge Base Metadata
      • Knowledge Base API
  • Building Workflows
    • Workflow Concepts
      • Workflow Inputs
        • Input Types
      • Workflow Outputs
      • Variable Referencing
      • Liquid Templating
    • Workflow Steps
      • AI
        • Prompt LLM
          • Model Selection Guide
          • Prompting Guide
        • Transcribe Audio File
      • Web Research
        • Google Search
        • Web Page Scrape
      • Code
        • Run Code
        • Call API
        • Format JSON
        • Run SQL Query
        • Write Liquid Text
      • Flow
        • Condition
        • Iteration
        • Human Review
        • Content Comparison
        • Error
      • Data
        • Read from Grid
        • Write to Grid
        • Search Knowledge Base
        • Write to Knowledge Base
        • Get Knowledge Base File
      • AirOps
        • Workflow
        • Agent
      • Image & Video
        • Generate Image with API
        • Search Stock Images
        • Fetch Stock Image with ID
        • Resize Image
        • Screenshot from URL
        • Create OpenGraph Image
        • Create Video Avatar
      • SEO Research
        • Semrush
        • Data4SEO
      • Content Quality
        • Detect AI Content
        • Scan Content for Plagiarism
      • Content Processing
        • Convert Markdown to HTML
        • Convert PDF URL to Text
        • Group Keywords into Clusters
      • B2B Enrichment
        • Hunter.io
        • People Data Labs
      • CMS Integrations
        • Webflow
        • WordPress
        • Shopify
        • Contentful
        • Sanity
        • Strapi
      • Analytics Integrations
        • Google Search Console
      • Collaboration Integrations
        • Gmail
        • Google Docs
        • Google Sheets
        • Notion
        • Slack
    • Testing and Iteration
    • Publishing and Versioning
  • Running Workflows
    • Run Once
    • Run in Bulk (Grid)
    • Run via API
    • Run via Trigger
      • Incoming Webhook Trigger
      • Zapier
    • Run on a Schedule
    • Error Handling
  • Grids
    • Create a Grid
      • Import from Webflow
      • Import from Wordpress
      • Import from Semrush
      • Import from Google Search Console
    • Add Columns in the Grid
    • Run Workflows in the Grid
      • Add Workflow Column
      • Run Workflow Column
      • Map Workflow Outputs
      • Review Workflow Run Metadata
    • Review Content in the Grid
      • Review Markdown Content
      • Review HTML Content
      • Compare Content Difference
    • Publish to CMS from Grid
    • Pull Analytics in the Grid
    • Export as CSV
  • Copilot
    • Chat with Copilot
    • Edit Workflows with Copilot
    • Fix Errors with Copilot
  • Monitoring
    • Task Usage
    • Analytics
    • Alerts
    • Execution History
  • Your Workspace
    • Create a Workspace
    • Folders
    • Settings
    • Billing
    • Use your own LLM API Keys
    • Secrets
    • Team and Permissions
  • Chat Agents (Legacy)
    • Agent Quick Start
    • Chat Agents
    • Integrate Agents
      • Widget
      • Client Web SDK
  • About
    • Ethical AI and IP Production
    • Principles
    • Security and Compliance
Powered by GitBook
On this page
  • How to Configure the Transcription Step
  • Transcription Model
  • Adding Your File into AirOps
  • Multiple Speakers?
  • Detect Language?
  • Language

Was this helpful?

  1. Building Workflows
  2. Workflow Steps
  3. AI

Transcribe Audio File

Transcribe audio or video files into text

Last updated 2 months ago

Was this helpful?

How to Configure the Transcription Step

When configuring a Transcription Step, there are two main pieces to consider:

  1. Selecting the best transcription model for your use-case

  2. How to pass your audio file to AirOps

Once you're ready to get started, you can click the "Configure" button of your Transcription Step to set these values.

Transcription Model

AirOps offers 5 transcription models at this time:

  • : fast, reliable transcription that includes built-in diarization (speaker identification). With the ability to auto detect language or set the language

  • : the fastest model to-date

  • : provides the best overall value

  • : higher accuracy and better word recognition. With the ability to auto detect language or set the language

  • : With the ability to select the number of speakers expected in a transcript, AssemblyAI is an excellent choice for diarization

Keep in mind: Deepgram has a 2GB file size limit and AssemblyAI has a 5GB file size limit

Adding Your File into AirOps

There are currently two methods for passing your audio or video files into AirOps.

Option #1: Upload via the AirOps UI

  • In the Start Step of your Workflow, define your Workflow Input as "File Media"

  • Add the input as the File to transcribe

Option #2: Upload via Google Drive

  • Within Google Drive, configure your audio or video file so that "Anyone with the link" can view:

  • Add an input with the variable name google_drive_link

  • Add a code step with the following Javascript to convert the shareable URL from Google Drive into a downloadable URL:

  • const fileID = google_drive_link.match(/[-\w]{25,}/);
    
    return `https://drive.google.com/uc?export=download&id=${fileID[0]}&confirm=t`
  • Add the output of the code step as the File to transcribe

Multiple Speakers?

If selected, the model will automatically detect multiple speakers. This will result in the following outputs from the model.

Speaker 0:

Speaker 1:

Speaker 0:

Speaker A:

Speaker B:

Speaker A:

Only AssemblyAI allows you to select the # of expected speakers. Without selecting # of speakers, the transcription may detect more (or fewer) speakers than expected

Detect Language?

Check to automatically detect the language of the file

Language

If Detect Language? is unchecked, you can specify the language you want to detect.

Not all models support multiple languages. Check out the documentation of each model below to determine which languages are supported

Deepgram Whisper Large
code
Deepgram Nova
Deepgram Nova 2
Deepgram Enhanced
code
AssemblyAI
Language OverviewDeepgram Docs
Supported languages | AssemblyAI Docs
Logo
Logo