Нутқ товушларини таниш алгоритмини ишлаб чиқиш
- № 3 (51) 2019
Страницы:
19
–
29
Язык: узбекский
Аннотация
В данной статье описан один из способов распознавания слов узбекского языка. На первом этапе была создана спектрограмма речевого сигнала на основе обработки в частотной области. На втором этапе была выполнена параметризация на спектрограмме и проведен корреляционный анализ с использованием сравнения речевых звуков.
Ушбу мақолада ўзбек тили нутқ товушларини таниш усулларидан бири келтирилган. Биринчи босқичда нутқ сигналларини частота соҳасида қайта ишлаш асосида спектрограмма тасвирлари ҳосил қилинган. Иккинчи босқичда ҳосил қилинган спектрограмма тасвирларини дастлабки қайта ишлаш асосида параметрлаш ва улар асосида нутқ товушларини бир-бири билан солиштиришнинг корреляцион таҳлил амалга оширилган.
This article outlines the algorithm for speech recognition. The article outlines the stages of the speech spectrogram image. The speech spectrum image is the richest parameter that characterizes speech. High accuracy can be achieved by recognizing the speech in other ways by processing the resulting spectrogram image. Today, many speech recognition algorithms use the speech spectrogram. An important aspect of familiarity with the speech spectrogram is the transition from one-dimensional signal to two-dimensional. The main parameters characterizing the speech during the transition are separated using spectral transformation methods. As the basic parameters, the main tone frequency serves as a form. There are many effective ways to get to knowing speech, and most commonly, this speech is familiar to them through the processing of the spectrogram image. We have already mentioned the familiar ways of using point-to-speech sound, speeches of the speech spectrogram, and the steps to reproduce the image. The article discusses the topics of modern speech recognition algorithms, initial processing methods for speech signals, algorithm of speech spectrogram image generation, stages of spectrum image processing (filtration, site allocation,correlation comparison). Any speech (word or phrase) consists of small phonemes (letters or combination of letters). Given this fact, this article considers the most commonly used phonemes in the Uzbek language, namely, the development of six vowel letters.
Ушбу мақолада ўзбек тили нутқ товушларини таниш усулларидан бири келтирилган. Биринчи босқичда нутқ сигналларини частота соҳасида қайта ишлаш асосида спектрограмма тасвирлари ҳосил қилинган. Иккинчи босқичда ҳосил қилинган спектрограмма тасвирларини дастлабки қайта ишлаш асосида параметрлаш ва улар асосида нутқ товушларини бир-бири билан солиштиришнинг корреляцион таҳлил амалга оширилган.
This article outlines the algorithm for speech recognition. The article outlines the stages of the speech spectrogram image. The speech spectrum image is the richest parameter that characterizes speech. High accuracy can be achieved by recognizing the speech in other ways by processing the resulting spectrogram image. Today, many speech recognition algorithms use the speech spectrogram. An important aspect of familiarity with the speech spectrogram is the transition from one-dimensional signal to two-dimensional. The main parameters characterizing the speech during the transition are separated using spectral transformation methods. As the basic parameters, the main tone frequency serves as a form. There are many effective ways to get to knowing speech, and most commonly, this speech is familiar to them through the processing of the spectrogram image. We have already mentioned the familiar ways of using point-to-speech sound, speeches of the speech spectrogram, and the steps to reproduce the image. The article discusses the topics of modern speech recognition algorithms, initial processing methods for speech signals, algorithm of speech spectrogram image generation, stages of spectrum image processing (filtration, site allocation,correlation comparison). Any speech (word or phrase) consists of small phonemes (letters or combination of letters). Given this fact, this article considers the most commonly used phonemes in the Uzbek language, namely, the development of six vowel letters.