Google Cloud Platform

Cloud Speech

Google cloud API service enables the features of converting audio to text format. Google says that this API can support over 110 languages. This technology is already developed and also used in many Google products like Google search engines, Google Indic keyboard, and many more. The process of conversion of audio to text is based on deep neural networks and algorithms. It can also recognize the format of the audio file. If file is not in a proper format then we can also convert into the specific format by help of tools like 'FFMpeg'. This API also enables the voice commands for apps and services. By the help of this speech API one user can use voice command and can make a call or open an app like the compass, map, message, camera etc. To use this API we just need to enable microphone and control it through our voice or audio files. This API can return recognized text from audio stored in a file. This API also handles noisy audio from various environments as it doesn't need signal processing. This API is supported by almost all devices including phone, tablet, desktop, PC's, car, TV speaker etc.


In real world based scenario the Google speech API help people in many ways like, while driving we can command Google speech engine to open the navigation or music system. It helps user to reach destination safely.


This deployment manager uses three types of files.

  • A configuration file that was written in YAML.
  • Template file that is written in python or Ninja2.
  • Schema files that define rules for the templet.

Features

Automatic speech recognition

This is a feature where the voice will be automatically transcribed into text format.

Real-time audio support

In the real-time, we can capture our voice from a microphone or we can also use the pre recorded audio file. Multiple audio formats are supported.

Automatic speech recognition

This is a feature which the automatically transcribed voice input into text format.

Language

It can supports over 110 language vocabulary.

Noise

It can handle audio in noisy environments.

Streaming recognition

It can detect language and return results while user still speaking.

Inappropriate content and word hint

It can filter inappropriate text from the results and give suggestions for limited language. It also gives us hint, when we should record speech for text enhancement.

Integrated API

We can integrate this API to services like cloud storage, Google search API, translate API etc.


Must Read Article

How to launch a VM instances in Google Cloud

How to launch a VM instances in Google Cloud

In this tutorial we are going to launch a Compute Egine instance.

How to install wordpress in VM instances in Google Cloud

How to install WordPress on VM instance in Google Cloud

Before you start installing WordPress on VM instance on Google Cloud. You must log in into the machine. You can launch a VM Instance on Google Cloud using this tutorial. After launch of machine log in into the machine using SSH. If you do not know how to SSH in VM instance you can follow this tutorial.

How to create a VM instance in Azure

How to create a VM instance in Azure

In this Tutorial we are going to launch a Virtual Machine in azure.

How to install WordPress on VM instance in Azure

How to install WordPress on VM instance in Azure

Before you start installing WordPress on VM instance on Azure. You must log in into the machine. You can launch a VM Instance in Azure using this tutorial. After launch of machine log in into the machine using SSH. If you do not know how to SSH in VM instance you can follow this tutorial.

How To Launch EC2 Machine

How To Launch EC2 Machine

In this article we are going to learn how to launch a EC2 instance. For this you need to have AWS account.

How to install WordPress on ec2 machine

How to install WordPress on ec2 machine

Before you start installing WordPress on ec2 machine. You must log in into the machine.