Automatic audio to title

keruxjeff wrote on 1/14/2024, 4:07 PM

I am looking for a simple solution to take spoken audio and convert it to titles (not captions). Is this a possibility with any of the Magix products?

Intel i7-3610QM @ 2.30GHz; Microsoft Windows NT 6.2.9200.0 64-bit; 1 TB Samsung SSD 860 EVO Drive; Intel(R) HD Graphics 4000 1600x900; Windows 10, Version 1909 (OS Build 18363.476); VPX 11 upgraded from MAGIX Movie Edit Pro MX Premium Download Version.


johnebaker wrote on 1/15/2024, 2:02 AM



None of the Magix video products have Speech to Text features.

For good Speech to Text conversion you are going to be looking at using third party software which can get expensive eg Nuance's Dragon Professional - formerly known as Dragon Naturally Speaking.

For one-off conversions there are several online products available, however accuracy, and text quantity, can be an issue with the 'free' and low cost options.

All will provide a text file which you then have to copy/paste into a title object in program.


John EB
Forum Moderator

VPX 15, Movie Studio 2023, and earlier versions 2015 and 2016, Music Maker Premium 2024.

PC - running Windows 11 22H2 Professional 64bit on Intel i7-8700K 3.2 GHz, 16GB RAM, RTX 2060 6GB 192-bit GDDR6, 1 x 1Tb Sabrent NVME SSD (OS and programs), 2 x 4TB HDD (Data) internal HDD + 1TB internal SSD (Work disc), + 6 ext backup HDDs.

Laptop - Lenovo Legion 5i Phantom - running Windows 11 22H2 on Intel Core i7-10750H, 16GB DDR4-SDRAM, 512GB SSD, 43.9 cm screen Full HD 1920 x 1080, Intel UHD 630 iGPU and NVIDIA GeForce RTX 2060 (6GB GDDR6)

Sony FDR-AX53e Video camera, DJI Osmo Action 3 and Sony HDR-AS30V Sports cams.

Gid wrote on 1/15/2024, 4:12 AM

@keruxjeff You could upload your vid to YT where you can download a ,vtt, .srt, .sbv file or you can copy the Transcript, the button is in the Description,, SRT files do load into Vegas but as mentioned these files don't import into Magix. (don't think they do anyway)

Vegas Pro 21 
Magix Movie Studio Platinum 2024 - from 2004-ish - Latest version,
Magix VPX14
Boris Continuum & Sapphire, 
Silhouette Standalone + Plugin, 
Mocha Pro Standalone + Plugin, 
Boris Optics,
Desktop PC Microsoft Windows 10 Pro - 64-Bit
AMD Ryzen Threadripper PRO 3975WX 3.5GHz 32 Core
Corsair iCUE H150i RGB PRO XT 360mm All-in-One Liquid CPU Cooler
RAM 256GB ( 8x Micron 32GB (1x 32GB) 2666MHz DDR4 RAM )
2x Western Digital Black SN850 2TB M.2-2280 SSD, 7000MB/s Read, 5100MB/s Write
(programs on one, project files on the other)
Graphics MSI GeForce RTX 3090 SUPRIM X 24GB OC GPU
ASUS ROG Thor 1200W Semi-Modular 80+ Platinum PSU 
Fractal Design Define 7 XL Dark TG Case with 3 Fans
Dell SE3223Q 31.5 Inch 4K UHD (3840x2160) Monitor, 60Hz, 

I'm 55, I've been a UK joiner for 39yrs, apprentice trained & time served, I make videos about my work & post them on Youtube so that I can share what I've learnt over the yrs,

At the moment my filming is done with a Samsung Galaxy S23 Ultra 

JeanCollier wrote on 1/31/2024, 6:34 AM

Transcribe is an application that can assist you with this. Also, if you want to know more about the application, ask filmy4wap web experts. They have done this well.

CubeAce wrote on 1/31/2024, 6:58 AM

@JeanCollier @johnebaker @Gid @keruxjeff

Personally I hate any form of automatic transcription that can often throw up anomalies whichever way it is employed.

Not knowing the differences between row, row, and roe is just one example of many. Most people that use such services on YouTube never seem to check their work after or are oblivious to the mistakes.

Then again I may just be showing my age as speech such as 'rareerer' and 'mostest' seems to be encroaching speech more in some cultures. I know language evolves over time and when I was younger and even now, some of the grammatical errors I can accept and do not bother me are grating to other people older than myself, but it is annoying to start to find myself becoming a part of that group of discontents.



Windows 10 Enterprise. Version 22H2 OS build 19045.3086. Direct X 12.1 latest hardware updates for Western Digital hard drives.

Asus ROG STRIX Z390-F Gaming motherboard Rev 1.xx with Supreme FX inboard audio using the S1220A code. Driver No 6.0.8960.1 Bios version 1401

Intel i9900K Coffee Lake 3.6 to 5.1GHz CPU with Intel UHD 630 Graphics .Driver version with 64GB of 3200MHz Corsair DDR4 ram.

1000 watt EVGA modular power supply.

1 x 250GB Evo 970 NVMe: drive for C: drive backup 1 x 1TB Sabrent NVMe drive for Operating System / Programs only. + x2 WD BLACK 2TB internal SATA 7,200rpm hard drives.1 for internal projects, 1 for Library clips/sounds/music/stills./backup of working projects. 1x500GB SSD current project only drive, 1x WD RED 2TB drive for latest footage storage. Total 16TB of five external WD drives for backup.

ASUS NVIDIA GeForce RTX 3060 12GB. nVidia Studio driver version 551.23. - 3584xCUDA cores Direct X 12.1. Memory interface 192bit Memory bandwidth 360.o5GB/s 12GB of dedicated GDDR6 video memory, shared system memory 16307MB PCi Express x8 Gen3. Two Samsung 27" LED SA350 monitors with 5000000:1 contrast ratios at 60Hz.

Running MEP Premium and VPX (UDP3)

M Audio Axiom AIR Mini MIDI keyboard Ver

VXP 14, MEP 2022, Vegas Studio 16, Vegas Pro 18, Cubase 4. CS6, NX Studio, Mixcraft 9 Pro.

Audio System 5 x matched bi-wired 150 watt Tannoy Reveal speakers plus one Tannoy 15" 250 watt sub with 5.1 class A amplifier. Tuned to room with Tannoy audio application.

Ram Acoustic Studio speakers amplified by NAD amplifier.

Rogers LS7 speakers run from Cambridge Audio P50 amplifier

Schrodinger's Backup. "The condition of any backup is unknown until a restore is attempted."

emmrecs wrote on 1/31/2024, 7:56 AM


Can you please explain more about your post? What is it that needs to be turned off?

Without a lot more context from you it is likely that your post could be hidden by a moderator since at least it appears to have nothing to do with the original question asked.

Forum Moderator

Win 10 Pro 64 bit, Intel i7 Quad Core 6700K @ 4GHz, 32 GB RAM, NVidia GTX 1660TI and Intel HD530 Graphics, MOTU 8-Pre f/w audio interface, VPX, MEP, Music Maker, PhotoStory Deluxe, Photo Manager Deluxe, Xara 3D Maker 7, Reaper, Adobe Audition 3, CS6 and CC, 2 x Canon HG10 cameras, 1 x Canon EOS 600D, Akaso EK7000 Pro Action Cam