Speech recognition of text in Ubuntu using Google Docs
There are not many speech recognition software available on Linux systems, especially native desktop applications. There are some applications available to convert speech to text using IBM Watson and other APIs, but they are not user-friendly and require some complex user interaction, such as some programming or scripting in the corresponding language.
However, not many users know that Google Docs uses its own AI technology to provide advanced speech recognition, which can be used by accessing Google Docs through Chrome.
Any user can use this feature to convert speech to text without advanced computer knowledge. The best thing about this feature of Google Docs is that you can use it on any Ubuntu derivative or any Linux distribution that supports Chrome.
Let’s see how to enable it in Ubuntu.
How to Convert Speech to Text
The prerequisite is that you should have Chrome installed in your system and have a Google account. If you do not have Chrome installed, you can visit this link and download and install Chrome.
Also, if you don’t have a Google account, you can create one for free using This link.
Step 1
Open https://www.php.cn/link/de535e267c10a7c88f2ed4283e8484da from Chrome and create a blank document.
Create an empty document
Step 2
After loading the blank document, click Tools > Speech from the menu enter".
Enable voice input
Step 3
On the left side, you can see a microphone icon. Click the microphone icon and Chrome will ask for permission to access the microphone through the browser for the first time. Click Allow.
Click the microphone
Allow documents to access the microphone
Default , it uses your system language as the detection language for speech while converting it to text; however, you can change it to any language you want based on the list of available languages. So far, Google Docs supports and recognizes more than 60+ languages while converting them into text.
Step 4
After clicking Allow, the microphone icon will turn orange and it is now ready to accept or recognize your voice. Start saying whatever you want and voila! You will see your speech converted to text and written to the document.
Voice to text in progress
Complete. You have successfully converted speech to text in Ubuntu via Google Chrome and Google Docs.
All Linux users can take advantage of this great feature for free. If you know of other apps that can convert speech to text in Linux, please leave a comment in the comment section below. Also, let me know if you found this article useful.
Troubleshooting
If the above features are not working in your browser, be sure to check out the following.
- Open the settings window (in the GNOME desktop of Ubuntu or other distributions).
- Go to Privacy > Microphone.
- And make sure it is enabled.
Checking Microphone Settings in Ubuntu
Summary
Although, there is a cloud-based solution recently Available on Amazon Polly, etc. But they come at a high price. In addition, some useful knowledge is required.
And Google Chrome’s built-in speech recognition feature is simple and easy to use. Although it's a bit slow, it gets the job done for the average user.
That said, I hope this guide helps you convert speech to text.
The above is the detailed content of Speech recognition of text in Ubuntu using Google Docs. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

VS Code system requirements: Operating system: Windows 10 and above, macOS 10.12 and above, Linux distribution processor: minimum 1.6 GHz, recommended 2.0 GHz and above memory: minimum 512 MB, recommended 4 GB and above storage space: minimum 250 MB, recommended 1 GB and above other requirements: stable network connection, Xorg/Wayland (Linux)

The five basic components of the Linux system are: 1. Kernel, 2. System library, 3. System utilities, 4. Graphical user interface, 5. Applications. The kernel manages hardware resources, the system library provides precompiled functions, system utilities are used for system management, the GUI provides visual interaction, and applications use these components to implement functions.

vscode built-in terminal is a development tool that allows running commands and scripts within the editor to simplify the development process. How to use vscode terminal: Open the terminal with the shortcut key (Ctrl/Cmd). Enter a command or run the script. Use hotkeys (such as Ctrl L to clear the terminal). Change the working directory (such as the cd command). Advanced features include debug mode, automatic code snippet completion, and interactive command history.

To view the Git repository address, perform the following steps: 1. Open the command line and navigate to the repository directory; 2. Run the "git remote -v" command; 3. View the repository name in the output and its corresponding address.

Although Notepad cannot run Java code directly, it can be achieved by using other tools: using the command line compiler (javac) to generate a bytecode file (filename.class). Use the Java interpreter (java) to interpret bytecode, execute the code, and output the result.

Writing code in Visual Studio Code (VSCode) is simple and easy to use. Just install VSCode, create a project, select a language, create a file, write code, save and run it. The advantages of VSCode include cross-platform, free and open source, powerful features, rich extensions, and lightweight and fast.

The main uses of Linux include: 1. Server operating system, 2. Embedded system, 3. Desktop operating system, 4. Development and testing environment. Linux excels in these areas, providing stability, security and efficient development tools.

Causes and solutions for the VS Code terminal commands not available: The necessary tools are not installed (Windows: WSL; macOS: Xcode command line tools) Path configuration is wrong (add executable files to PATH environment variables) Permission issues (run VS Code as administrator) Firewall or proxy restrictions (check settings, unrestrictions) Terminal settings are incorrect (enable use of external terminals) VS Code installation is corrupt (reinstall or update) Terminal configuration is incompatible (try different terminal types or commands) Specific environment variables are missing (set necessary environment variables)
