Wishlist 0 ¥0.00

How to Turn Audio into Text: Best Free & Cheap Converters

Last time when I interviewed our guest Chris Pirillo, I needed an app that could convert an audio file with his speech into a text document. Frankly speaking, I wanted to save my time instead of boring typing each word that he had pronounced. So I surfed the Internet carefully and came across several good apps which could convert audio files (in MP3, WMA or M4A formats) into text docs automatically. Now I’m happy to share them with you.

1. VoiceBase

UPD: Voicebase used to be the best voice to text solution for many years. Unfortunately, since 2019 it’s no longer a free audio to text conveter. Now it provides API for audio transcription and speech analytics on the paid basis. So you’d better skip the part about Voicebase and try the tools below.

VoiceBase is an online voice to text transcription service for companies and individuals. Though, it mainly focuses on business clients, an ordinary user, like you and me, can convert a voice recording into a text file for free at VoiceBase. As for January 2016, each new user is granted a free account with $60 credit and up to 50 hours of audio storage. It costs about $0.01 to transcribe 10 second speech. VoiceBase uses smart voice recognition technology, so the quality of its machine audio transcript is high.

Obviously, the final text quality depends on original sound track and the speaker’s accent. VoiceBase understands US English pronunciation seamlessly. If a person speaks clearly, then the text is close to manually written. If an interviewer mumbles or lisps, then you’ll have to review the transcript or hire someone for text checkup. Fortunately, you can order human transcript right in your VoiceBase account. Moreover, you can turn video into text!

SEE ALSO: 200+ Useful Resources & Tools for Teachers & Students

This audio to text converter understands English, Dutch, French, German, Italian, Spanish (including Latin American version). In fact, VoiceBase is remarkable for quick and easy speech to text conversion. The website interface is clear and you smoothly go step by step:

    1. Go to www.voicebase.com and click the green Upload a file button in the middle of the screen.
    2. Create a free VoiceBase account. Provide your name, email address and click the Sign Up button. You have to confirm your account via email to get access to VoiceBase.
    3. Click the green Upload button at the top right corner.
    4. Add an audio or a video file of a supported format. If needed, to join video or audio parts together. Name your file, add a description, select the Machine Transcription, and a file sharing type (Private or Public).
      Tip: use Freemake Free Audio Converter to make a supported audio file for VoiceBase.
    5. Your file will be processed and you’ll be notified by email when it’s ready. Later, you can find the file at the My Content tab. For example, I’ve added a 10 minute audio interview in M4A format and it took about 15 minutes to convert it into a text file.
    6. When the text file is done, go to My Content tab in your VoiceBase account and click on the name of your file.Text version of audio file
    7. Check the Machine Transcript box right under your audio file.
    8. Copy the transcript and save it as text document.

Summary: VoiceBase is a fast online audio to text converter. Needless to say, it is suitable for everyone no matter what you need: an automatic or human speech to document conversion.

2. Dragon Dictation

Definitely, you may try another voice-to-text converter: Dragon Dictation. We dedicated a special article to it. In a few words, Dragon Dictation is completely different from VoiceBase. It pretends to be a universal speech recognition tool for Windows, Mac, iOS, Android and other platforms. Please note that the desktop version is paid ($75-150 for home users, $300 for enterprises), while the mobile apps are free for US & Canada.

Like Apple’s Siri, Dragon Dictation is capable of understanding what you say to it. However, the main focus of the app is to memorize your speech notes as a piece of text. It is easy to create documents of any length and edit, format and share them directly from your mobile device. Dragon can handle specialized industry vocabulary, and it comes with excellent features, such as the ability to transcribe text from an audio file you upload.

To do this, follow the steps:

  1. Open the software. From the DragonBar, select Tools>Transcribe Audio>Transcribe Recording.
    dragon dictation upload
  2. Click Select the speaker and select who the voice in the recording belongs to – Me or Someone else.
  3. In the Input audio file field, enter the file name of the recording and the directory path where it’s located, or click Browse to navigate to it.
    In the Output text file field, enter a file name for the transcribed output file and enter the directory path where you want to save it.
  4. Optionally deselect Automatically add commas and periods if you do not want Dragon to add this punctuation to the transcription, as the accuracy may degrade when this option is selected.
  5. Then follow the transcription wizard, it will prompt you to choose what you want to do next. Select the needed options and click Done.

Summary: Dragon Dictaion is much more than a simple audio to text converter. You should invest into it only if you’re sure to use dictation options on the regular basis. For occasional uses, it’s advisable to try a free program from the ones listed below.

3. Sonix.ai

Sonix.ai is an online app to trascribe audio. The free trial includes 30 minutes of free audio to text conversion. I think it’s enough for an occasional use. The developers provide a complete access to all the features with no credit card required. The only thing you need is to sign up, you may do this with your Google account just in one click. The premium account isn’t expensive (from $11.25 per month).

To convert a speech file into Word document, follow the steps:

  1. Drag and drop the audio (or video!) file into the browser window from your PC or choose the required file from your Dropbox or Google Drive.
    sonix
  2. While the file is being uploaded, choose the language spoken. Click the big blue button below.
  3.  Reply a few questions about the quality of the audio file (about background noise, etc.). Press Continue trascribing.
  4. Wait a bit while the text file is being prepared. After that, you may review and edit the text.
  5. Download the Word file to your PC, share online or save to your Google Drive.

Summary: Sonix.ai is brilliant for rare audio transcriptions. It provides a decent text quality and is not overloaded with feature. Definitely, a must have for picky users.

4. Inqscribe

Inqscribe is a transcription software for Windows, Mac OS. You can use it free with no license (with limited features) or instantly unlock all the features by purchasing a paid license ($99) or by requesting a 14-day trial.

Apart from audio files, you can also transcribe long video files including full-length movies, there is no time limit in all version. However, with a free one you won’t be able to save and download the resulted text file. Still you may copy the text to the clipboard.

Inscribe

The tool works in the same way as all the above mentioned. You need to add a multimedia file, choose a language and launch the audio to text conversion. InqScribe transcripts contain embedded timecodes that allow instant access to arbitrary times within the media file.

SEE ALSO: 5 Easiest Ways to Add Captions to Video Free and Fast

InqScribe also features a flexible editing environment, QuickTime and Windows Media support, customizable keyboard shortcuts for controlling media playback and inserting repetitive text, and a range of import and export options available in the paid version.

Summary: InqScribe is like a Swiss knife for creating captions and subtitles. You should try the evaluation version if you need to precisely transcribe a long video with further media export.

三款功能不逊于收费软件的开源备份工具,中小企业福音

虽然云计算给中小企业带来了很大便利,但是数据备份这件事情是无论是否上云都必须做的。前不久的腾讯云丢失客户数据事件便是很好的一个提醒。

备份,备份,一定要备份

但是大多数中小企业尤其是初创公司资金都是比较紧张的,那么开源备份软件是很好的选择。其实有的开源备份软件功能强大堪比收费的商业软件。下面就介绍三款比较成熟且知名的。

1、阿曼达AMANDA

Advanced Maryland Automatic Network Disk Archiver 的缩写。它允许系统管理员创建一个单独的备份服务器来将网络上的其他主机的数据备份到磁带驱动器、硬盘等介质上,典型的CS结构,支持对MySQL、Oracle数据库的在线备份。

操作系统:支持跨平台运行。备份级别:完全,差异,增量,合并。数据格式:开放(可以通过tar等工具恢复)。自动转换:支持。备份介质:支持磁带,磁盘和DVD。加密数据流:支持。数据库:支持MSSQL, Oracle。跨卷备份:支持。VSS(卷影复制):支持。许可:GPL, LGPL, Apache, Amanda License。

下载链接:amanda.org

2.Bacula

一个适用于异构网络的CS架构备份工具。

操作系统:支持跨平台运行。备份级别:完全,差异,增量,合并。数据格式:支持自定义且完全开放。自动转换:支持。备份介质:支持磁带,磁盘和DVD。加密数据流:支持。数据库:支持MSSQL、PostgreSQL、Oracle 。跨卷备份:支持VSS(卷影复制):支持。许可:Affero General Public License v3.0。

下载链接:bacula.org

3.Backuppc

可以用来备份基于Linux 和Windows 系统的主服务器硬盘。它配备了一个巧妙的池计划来最大限度的减少磁盘储存、磁盘 I/O 和网络I/O。

操作系统:支持Linux,Unix 和Windows。备份级别:支持完全和增量备份(rsync +hard 链接和pooling 计划)数据格式:开放。自动转换:N/A。备份介质:磁盘和磁盘阵列。加密数据流:支持。数据库:支持(通过Shell 脚本)跨卷备份:未知VSS(卷影复制):未知许可:GPL。

下载链接:backuppc.sourceforge.net

开源绿色又免费的录屏软件

1.ScreenToGif

国外的一款Gif动画录制工具。它是免费的开源软件,体积非常小巧,只有几百KB。用户可以使用它,录制电脑屏幕的各种画面,然后将其保存为gif、视频、png、psd等文件。值得一提的是,所有的录制内容,我们都可以使用软件自带的编辑器进行编辑,做删除帧、增加帧、重复播放、裁剪等操作。另外,我们还可以给不同帧增加字幕文本、添加水印、绘制图形等。可以说是,自定义性非常强。小编特别喜欢ScreenToGif的编辑器功能。除了录制的视频编辑,它还能直接导入视频进行编辑,然后另存为成gif格式。如此一来,我们以后不管需要什么gif图片,都能自己制作了。最重要的是,它不管是录制,还是编辑,最终的成品清晰度都不错,需要的朋友,可以去Github免费下载使用。

ScreenToGif

2.Captura (强烈推荐)

录制游戏视频解说、游戏精彩画面、课程教学、视频直播,就需要使用到视频录制工具,现在有很多的视频录制工具,很多都需要收费激活才能无限制录制,普通用户录制视频在时间、功能上会受到限制,不能很好地录制一个完美的视频,因此小编特意带了Captura这款屏幕录像工具,软件绿色小巧,简单易用,最重要的是完全免费使用,无任何限制,通过这款工具可以帮助你轻松录制各种视频。

支持全屏录制、区域录制两种方式,全屏录制可以录制全部的电脑屏幕,将电脑屏幕上所有的动态都录制下来,适用于课程教学视频录制,区域录制可以任意设置你需要录制的区域,可以录制屏幕上的任意区域。并且在可以设置录制视频的帧率和质量,以及音频大小,根据你的需要进行设置。除了视频录制功能之外,该软件还拥有视频编解码器,可以对视频进行解码,支持mp4、avi、GIF、webm等格式,可以满足一般的解码需要。还支持屏幕截图、剪贴板,简单的图像编辑等功能,是一款非常好用的屏幕录像工具,需要的朋友,可以去Github免费下载使用。

Captura

3.OBS Studio

一款非常知名,使用用户庞大的一款OBS直播软件,非常强大的免费开源无广告国外开发的软件(录屏只是其功能之一, 并且对于某些高端玩家既要录制屏幕又要录制摄像头选择这款),该软件的直播架构模式是采用开元的方式进行录制的,开源意味着可以让直播的用户随意选择自己的喜欢的直播模式,比如用户可以让观众看到指定的视频展现模式,从而使得直播的方式变得丰富,这充分考虑到了所有类型的直播,操作起来也是比较方便的,让用户可以随时进行直播的方式切换,操作非常方便,支持实时视频、音频的混合以及捕捉,可以轻松创建由图像,文本,浏览器窗口等多个来源组成的场景,还内置了丰富的插件,包括窗口捕获,文本,图像等直播常用的插件,让你的直播更方便,比传统OBS软件功能更加丰富,好用!

不管你是想要录制下软件的操作教程还是游戏的玩法视频,都可以通过obs视频录制来完成。它可以让你快速清晰地录制下每一个步骤哦!需要的朋友,可以去Github免费下载使用。

OBS Studio

硬盘结构,主引导记录MBR,硬盘分区表DPT,主分区、扩展分区和逻辑分区,电脑启动过程

硬盘结构
硬盘有很多盘片组成,每个盘片的每个面都有一个读写磁头。如果有N个盘片。就有2N个面,对应2N个磁头(Heads),从0、1、2开始编号。每个盘片的半径均为固定值R的同心圆再逻辑上形成了一个以电机主轴为轴的柱面(Cylinders),从外至里编号为0、1、2……。每个盘片上的每个磁道又被划分为几十个扇区(Sector),通常的容量是512byte,并按照一定规则编号为1、2、3……形成Cylinders×Heads×Sector个扇区。
                                     


主引导扇区
主引导扇区位于整个硬盘的0柱面0磁头1扇区{(柱面,磁头,扇区)|(0,0,1)},bios在执行自己固有的程序以后就会jump到MBR中的第一条指令。将系统的控制权交由mbr来执行。主引导扇区主要由三部分组成:主引导记录 MBR(Master Boot Record或者Main Boot Record)、硬盘分区表 DPT(Disk Partition Table)和结束标志字三大部分组成。


对于硬盘而言,一个扇区可能的字节数为128×2n (n=0,1,2,3)。大多情况下,取n=2,即一个扇区(sector)的大小为512字节。在总共512byte的主引导记录中,MBR的引导程序占了其中的前446个字节(偏移0H~偏移1BDH),随后的64个字节(偏移1BEH~偏移1FDH)为DPT(Disk PartitionTable,硬盘分区表),最后的两个字节“55 AA”(偏移1FEH~偏移1FFH)是分区有效结束标志。

主引导记录MBRmaster boot record
主引导记录中包含了硬盘的一系列参数和一段引导程序。其中的硬盘引导程序的主要作用是检查分区表是否正确并且在系统硬件完成自检以后引导具有激活标志的分区上的操作系统,并将控制权交给启动程序。MBR是由分区程序(如Fdisk)所产生的,它不依赖任何操作系统,而且硬盘引导程序也是可以改变的,从而能够实现多系统引导。

硬盘分区表DPTDisk Partition Table
硬盘分区表占据MBR扇区的64个字节(偏移01BEH--偏移01FDH),可以对四个分区的信息进行描述,其中每个分区的信息占据16个字节。具体每个字节的定义可以参见硬盘分区结构信息。



结束标志字
结束标志字55,AA(偏移1FEH- 偏移1FFH)是MBR扇区的最后两个字节,是检验主引导记录是否有效的标志。


电脑启动过程

  • 系统开机或者重启。
  • BIOS 加电自检 ( Power On Self Test -- POST )。BIOS执行内存地址为 FFFF:0000H 处的跳转指令,跳转到固化在ROM中的自检程序处,对系统硬件(包括内存)进行检查。
  • 读取主引导记录(MBR)扇区。当BIOS检查到硬件正常并与 CMOS 中的设置相符后,按照 CMOS 中对启动设备的设置顺序检测可用的启动设备。BIOS将相应启动设备的第一个扇区(也就是MBR扇区)读入内存地址为0000:7C00H 处。
  • 检查0000:7DFEH-0000:7DFFH(MBR的结束标志位)是否等于 AA55H,若不等于则转去尝试其他启动设备,如果没有启动设备满足要求则显示"NO ROM BASIC"然后死机。
  • 当检测到有启动设备满足要求后,BIOS将控制权交给相应启动设备。启动设备的MBR将自己复制到0000:0600H处, 然后继续执行。
  • 在主分区表中搜索标志为活动的分区,也就是检验磁盘分区表DPT的首字节是不是80H。如果检测到80H,则表示该分区为活动分区,将该活动分区的第一个扇区(操作系统引导记录区,Dos Boot Recorder,DBR)读入内存地址 0000:7C00H 处。
  • 检查0000:7DFEH-0000:7DFFH(DBR的结束标志位)是否等于 AA55H, 若不等于则显示 : "Missing Operating System" 然后停止。
  • 当检测到有分区满足要求后,MBR将控制权交给相应的活动分区。

for short:
BIOS -> 硬盘MBR -> 活动分区DBR -> 操作系统

主引导扇区与硬盘分区

从主引导扇区的结构可以知道,它仅仅包含一个64个字节的硬盘分区表。由于每个分区信息需要16个字节,所以对于采用MBR型分区结构的硬盘(其磁盘卷标类型为MS-DOS),最多只能识别4个主要分区。所以对于一个采用此种分区结构的硬盘来说,想要得到4个以上的主要分区是不可能的。这里就需要引出扩展分区了。扩展分区也是Primary partition的一种,但它与主分区的不同在于可以划分为无数个逻辑分区。

扩展分区中逻辑驱动器的引导记录是链式的。每一个逻辑分区都有一个和MBR的分区表结构类似的扩展引导记录(EBR),其分区表的第一项指向该逻辑分区本身的引导扇区,第二项指向下一个逻辑驱动器的EBR。对于Windows系统而言,一般都是只划分一个主分区给系统,剩余的部分全部划为扩展分区。

蓝色是主分区;绿、红、紫是逻辑分区;灰色包含着逻辑分区是扩展分区;

 

扩展分区表项的内容

扩展分区表项 分区表项的内容 第一个项 包括数据的开始地址在内的与扩展分区中当前逻辑驱动器有关的信息 第二个项 有关扩展分区中的下一个逻辑驱动器的信息,包括包含下一个逻辑驱动器的EBR的扇区的地址。如果不存在进一步的逻辑驱动器的话,该字段不会被使用 第三个项 未用 第四个项 未用


reference:
http://zh.wikipedia.org/zh-cn/%E4%B8%BB%E5%BC%95%E5%AF%BC%E6%89%87%E5%8C%BA
www.raid-recovery.org/Article/sjhfdoc/200404/1.htmlwww.pcguide.com/ref/hdd/file/structPartitions-c.html
www.msexchange.org/tutorials/Disk-Geometry.html

About Us

Since 1996, our company has been focusing on domain name registration, web hosting, server hosting, website construction, e-commerce and other Internet services, and constantly practicing the concept of "providing enterprise-level solutions and providing personalized service support". As a Dell Authorized Solution Provider, we also provide hardware product solutions associated with the company's services.
 

Contact Us

Address: No. 2, Jingwu Road, Zhengzhou City, Henan Province

Phone: 0086-371-63520088 

QQ:76257322

Website: 800188.com

E-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.