Solutions / Speech Analytics for Contact Centers

Speech analytics based on Yandex SpeeckKit ML technologies with Kazakh language support

Yandex SpeechKit supports 16 languages, including Russian, Kazakh, and Uzbek. To train recognition models and improve speech recognition quality, Yandex Cloud developers use thousands of hours of audio for each language.

Speech ML models of SpeechKit can be deployed on your infrastructure. We offer both hybrid options and 100% traffic processing within your environment.

The Yandex SpeechKit service provides high-quality recognition of Kazakh speech, as well as mixed Kazakh-Russian speech.

Since 2023, Sanatel Consulting has been a Yandex partner in speech technologies.

A demo version of the solution is available via this link.

Applications of Speech Analytics

Analytics is useful for all customer-focused businesses, but especially effective in sectors with a high volume of calls: banks, insurance companies, online stores, medical centers, delivery services, and various call centers.

Script compliance monitoring – ensuring adherence to sales scripts, regulations, and phone communication standards.
Dissatisfaction analysis – identifying cases of customer dissatisfaction.
Needs identification – detecting and organizing customer needs.
Data accumulation – collecting historical communication data for future use, such as analytics by new criteria or training an AI bot.

Speech analytics presentation slides based on Yandex SpeeckKit:

Why Speech Analytics is Needed

Speech analytics enables monitoring of 100% of phone calls and automatic evaluation of employee performance, script compliance, and call standards. It also helps identify dissatisfied customers and reasons for customer churn.

Using speech analytics helps to:

Reduce labor and operational costs;
Shorten onboarding time for new employees;
Increase sales volume;
Proactively respond to dissatisfied clients;
Improve customer communication and loyalty;

How Speech Analytics Works

The “Speech Analytics” solution is deployed on the client’s server or in the cloud. It integrates with the client’s telephony system, retrieves voice recordings from the sales or contact center departments, transcribes speech into text, and analyzes it for script adherence, negative sentiment, prohibited or filler words. It then generates reports, calculates employee ratings, and provides individual and departmental analytics.

Telephony integration – retrieves recordings and transcribes them into text messages.
Text analysis – searches by text, using dictionaries and script fragments.
Reporting – calculates operator ratings, generates reports and charts for individuals or teams.

Video overview of speech analytics based on Yandex SpeeckKit:

Quantitative Analysis

Quantitative text analysis focuses on numerical data and statistics to evaluate text by measurable characteristics. The main goal is to objectively measure certain elements of text, such as words and phrases, without interpreting their meaning.

The following symbols can be used to search for words and phrases:

"*" – any word ending in the phrase.
"<>" – swap the words in the phrase.
"/" – alternative word options in the phrase.

For example, the pattern "добрый<>день/вечер" searches for four phrase variations:

добрый день
добрый вечер
день добрый
вечер добрый

Semantic Analysis

Semantic analysis using GPT-based classifiers enables a deeper understanding and structuring of textual data by identifying topics, patterns, and categories. The GPT model, trained on vast datasets, classifies text into specified categories such as “Product novelty,” “Creating urgency,” and “Summary.”

To train a GPT classifier, a dataset of “correct” phrases is created with at least 100 examples, preferably up to 10,000. For instance, to train a GPT classifier on “Creating urgency,” the following phrases are selected from collected dialogues:

[
		{
			"This product is in high demand. Are you sure you can't come soon? It might be out of stock tomorrow."
		},
		{
			"Loan conditions change every month. You might miss a good deal if you delay the purchase."
		},
		{
			"If we don't place the order for you soon, you'll have to wait for the next shipment, which takes a long time."
		}
		]

The trained GPT classifier then evaluates new dialogues and highlights those that are semantically similar to the reference phrases.

Video – GPT classifiers in the sales department:

What Sections the Analytics Includes

Speech Transcription Section

Ability to listen to voice recordings from telephony with dialogue transcription. Clicking a dialogue line jumps to the corresponding audio timestamp.

Matches found using dictionaries and script validation results are highlighted in the dialogue text.

Call statistics: speech rate, pauses, whether the manager interrupted the client.

Integration with the customer's CRM system is possible to navigate directly to the lead or deal in CRM.

Dictionaries Section

Dictionary management: add or remove words. Upload word lists to dictionaries from a text file.

Scripts Section

Fragment management: edit, add, or delete words in fragments.

Script management: edit scripts, add or remove fragments. Configure script parameters and assign scripts to operator groups or departments.

Reports Section

“Summary Metrics” report – dialogue analysis over a period, general statistics.

“Average Score by Dictionaries” report – evaluates managers over a period, comparing dictionary scores.

“Script Execution by Employees” report – analyzes script adherence over time, compares manager ratings.

“Fragment Statistics” report – analyzes detected script fragments in dialogues. Ranks most common matches.

“STT Billing” report – analyzes speech-to-text recognition costs.

Other Section

Telegram Notifications – logs of Telegram alerts about matches from critical dictionaries, e.g. “Complaint” dictionary.

Users and roles in the system, access rights to sections, editing permissions.

System service log: speech recognition logs and technical error messages.

Future Prospects After Implementing Speech Analytics

Speech analytics accumulates data from company-client communications. In the future, the large volume of collected data can be used to train the company’s voice bot.

The Yandex SpeechKit module can be used not only for speech recognition but also for speech synthesis when implementing an intelligent bot within the company.