Wav file to text converter

Wav file to text converter how to#

It’s pretty well done but requires a specific installation of FFmpeg.Īfter copying and tweaking bits from similar projects, I came up with the following code: 'use strict' const ffmpeg = require('fluent-ffmpeg') const mime = require('mime') const fs = require('fs') module. With our audio to text converter you can transcribe audio and video twice as fast while keeping costs under 0.10 per minute of audio. I found a node module that wraps FFmpeg (fluent-ffmpeg). So I had to code a node.js version of this command. Though I’m still not sure if the last line really means something or was auto-generated by a Twitter bot, I downloaded the FFmpeg binary for my platform, tried the command and it worked!

Wav file to text converter how to#

The Google Speech API documentation specifies that the expected input format is LINEAR16, but never really explains concretely how to convert an existing file to this format. Like instructed, save the JSON file generated with your credentials locally, and create a copy at the root of your project (as credentials.json for example). Export and share your MP3 file in a range of formats such Word, PDF, Avid DS, SubRip, WebVTT. Make adjustments to the transcription where needed. We can transcribe an hour long file in less than 15 minutes. You will have to create a service account for your application, so follow the instructions at Creating a Service Account. We use industry leading artificial intelligence to transcribe your MP3 file. I won’t really go into details for this, just go to and follow the steps for signing up.

Display the transcription into the console.

Send the uploaded file to the Speech API.Upload the converted file to Google Storage.Let’s split the problem into simple tasks:

Audio files that last more than 1 minute must be uploaded to Google Storage, you can’t send them to the Google Speech API directly.The expected input format for the Google Speech API is LINEAR16 PCM (.wav), not m4a.This created the following issues for the project: