Determination of input parameters of the neural network model, intended for phoneme recognition of a voice signal in the systems of distance learning

Berik Akhmetov; Igor Tereykovsky; Aliya Doszhanova; Lyudmila Tereykovskaya

Determination of input parameters of the neural network model, intended for phoneme recognition of a voice signal in the systems of distance learning

Authors

Berik Akhmetov Caspian State University of Technologies and Engineering named after Sh. Yessenov
Igor Tereykovsky National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"
Aliya Doszhanova Almaty University of Power Engineering and Telecommunications
Lyudmila Tereykovskaya Kyiv National University of Construction Architecture

Abstract

The article is devoted to the problem of voice
signals recognition means introduction in the system of distance
learning. The results of the conducted research determine the
prospects of neural network means of phoneme recognition.
It is also shown that the main diculties of creation of the
neural network model, intended for recognition of phonemes
in the system of distance learning, are connected with the
uncertain duration of a phoneme-like element. Due to this
reason for recognition of phonemes, it is impossible to use
the most eective type of neural network model on the basis
of a multilayered perceptron, at which the number of input
parameters is a xed value. To mitigate this shortcoming, the
procedure, allowing to transform the non-stationary digitized
voice signal to the xed quantity of mel-cepstral coecients,
which are the basis for calculation of input parameters of
the neural network model, is developed. In contrast to the
known ones, the possibility of linear scaling of phoneme-
like elements is available in the procedure. The number of
computer experiments conrmed expediency of the fact that
the use of the oered coding procedure of input parameters
provides the acceptable accuracy of neural network recognition
of phonemes under near-natural conditions of the distance
learning system. Moreover, the prospects of further research in
the eld of development of neural network means of phoneme
recognition of a voice signal in the system of distance learning
is connected with an increase in admissible noise level. Besides,
the adaptation of the oered procedure to various natural
languages, as well as to other applied tasks, for instance, a
problem of biometric authentication in the banking sector, is
also of great interest.

Author Biographies

Berik Akhmetov, Caspian State University of Technologies and Engineering named after Sh. Yessenov

Rector

Igor Tereykovsky, National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"

professor

Aliya Doszhanova, Almaty University of Power Engineering and Telecommunications

assosiated professor of the Department IT-engineering

References

V. Mikhaylenko Neural network models and methods of recognition

of phonemes in a voice signal in the system of distance learning:

[Monograph] / V. M. Mikhailenko, L. O. Tereykovskaya, I.

A. Tereykovsky., B. B. Akhmetov. - K .: CP "Komprint", 2017.-

A Najib, A Basari, A Pee, M Daimon, A Rahman, L Tahir

ONLINE PERFORMANCE DIALOGUE SYSTEM MODEL

(e-DP): A REQUIREMENT ANALYSIS STUDY AT BATU

PAHAT DISTRICT EDUCATION OFFICE Journal of Theoretical

and Applied Information Technology. 31st December

Vol.95. No 24 P. 6699 6706.

A. Mosa, M. Mahrin, R. Yuso A SYSTEMATIC LITERATURE

REVIEW OF TECHNOLOGICAL FACTORS FOR ELEARNING

READINESS IN HIGHER EDUCATION. Journal

of Theoretical and Applied Information Technology. 30th

November 2016. Vol.93. No.2. P. 500 521.

I. Veritawati, I. Wasito, T. Basaruddin TEXT INTERPRETATION

USING A MODIFIED PROCESS OF THE ONTOLOGY

AND SPARSE CLUSTERING. Journal of Theoretical

and Applied Information Technology 15th March 2017. Vol.95.

No 5. P. 1019-1028.

A.Kadir, A. Yauri AUTOMATED SEMANTIC QUERY FORMULATION

USING MACHINE LEARNING APPROACH.

Journal of Theoretical and Applied Information Technology.

th June 2017. Vol.95. No 12. P. 2761-2775.

J. Park, J. Yoon, Y. Seo, G. Jang SPECTRAL ENERGY

BASED VOICE ACTIVITY DETECTION FOR REAL-TIME

VOICE INTERFACE. Journal of Theoretical and Applied

Information Technology. 15th September 2017. Vol. 95 No17.

P. 4304-4312.

A Agranovsky, D. Lednov Theoretical aspects of algorithms

for processing and classifying speech signals. - M .: Radio and

Communication, 2004. - Ch. 1. 164 c.

L. Babenko, D. Subbotin, V. Fedorov, P. Yurkov DEFINITION

OF THE BORDERS BETWEEN THE FONEMAS BY A

NEUROET NETWORK METHOD. Izvestiya Southern Federal

University. Technical science. 2003 4 Òîì33. Pp. 321-323.

T. Kartbayev, B. Akhmetov, A. Doszhanova, K. Mukapil,

A. Kalizhanova, G. Nabiyeva, L. Balgabayeva, F. Malikova

DEVELOPMENT OF A COMPUTER SYSTEM FOR IDENTITY

AUTHENTICATION USING ARTIFICIAL NEURAL

NETWORKS. Image Analysis & Stereology, 10.5566/ias.1612.

V.36, 1, 2017.

O. Fedyaev, I. Bondarenko Neural network algorithm for

speaker-independent recognition of speech phonemes. USIM,

, No. 4 C. 41- 50.

B. Meyer , T. Jurgens, T. Wesker, T. Brand, B. Kollmeier

Human phoneme recognition depending on speech-intrinsic

variability. J Acoust Soc Am. 2010 Nov;128(5):3126-41.

Y. Qian, M. Bi, T. Tan, K. Yu, "Very deep convolutional neural

networks for noise robust speech recognition," in IEEE/ACM

Trans. Audio Speech Language Process. , vol. 24, no. 12, pp.

-2276, 2016.

V. Lila, E. Puchkov Methodology of training a recurrent arti-

cial neural network with dynamic stack memory. International

magazine "Software products and systems", Tver, 4, 2014 p.

[on pages 132-135].

Understanding LSTM Networks Posted on August 27,

(http://colah.github.io/posts/2015-08-Understanding-

LSTMs/) .

A. Waibel, T. Hanazawa, G. Hinton, K. Shikano, K. Lang

¾Phoneme Recognition Using Time - Delay Neural Networks¿,

IEEE Transactions On Accoustics, Speech And Signal Processing,

Vol. 37, 1989.

M. Gusev Methods and models of recognition of Russian speech

in information systems: dis. ... doctors of techn. Sciences:

13.01 / MN Gusev - St. Petersburg, 2014. - 378 p.

I. Tereykovskii Optimization of the structure of a two-chirped

perceptron, possible distribution of fertility of anomalous inuences

of experimental parameters of computer technology / IA

Tereykovskii // Scientic and technical journal "Management

of branching of folded systems" Kiev. National University of

Architecture. - 2011. - Vol. 5. - S. 128-131.

I. Boykov, A. Ivanov, D. Kalashnikov ALGORITHM OF

THE CONSTRUCTION OF THE STATISTIC DISCRETECONTINUOUS

DESCRIPTION OF THE DURATION OF

THE AUDIO SOURCES OF THE INCREASED SPEECH OF

THE DICTOR. News of higher educational institutions. The

Volga region. 4 (36), 2015 p.64-76

Downloads

Published

2018-10-28

Issue

Vol. 64 No. 4 (2018)

Section

ARTICLES / PAPERS / General

License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

1. License

The non-commercial use of the article will be governed by the Creative Commons Attribution license as currently displayed on https://creativecommons.org/licenses/by/4.0/.

2. Author’s Warranties

The author warrants that the article is original, written by stated author/s, has not been published before, contains no unlawful statements, does not infringe the rights of others, is subject to copyright that is vested exclusively in the author and free of any third party rights, and that any necessary written permissions to quote from other sources have been obtained by the author/s. The undersigned also warrants that the manuscript (or its essential substance) has not been published other than as an abstract or doctorate thesis and has not been submitted for consideration elsewhere, for print, electronic or digital publication.

3. User Rights

Under the Creative Commons Attribution license, the author(s) and users are free to share (copy, distribute and transmit the contribution) under the following conditions: 1. they must attribute the contribution in the manner specified by the author or licensor, 2. they may alter, transform, or build upon this work, 3. they may use this contribution for commercial purposes.

4. Rights of Authors

Authors retain the following rights:

- copyright, and other proprietary rights relating to the article, such as patent rights,

- the right to use the substance of the article in own future works, including lectures and books,

- the right to reproduce the article for own purposes, provided the copies are not offered for sale,

- the right to self-archive the article

- the right to supervision over the integrity of the content of the work and its fair use.

5. Co-Authorship

If the article was prepared jointly with other authors, the signatory of this form warrants that he/she has been authorized by all co-authors to sign this agreement on their behalf, and agrees to inform his/her co-authors of the terms of this agreement.

6. Termination

This agreement can be terminated by the author or the Journal Owner upon two months’ notice where the other party has materially breached this agreement and failed to remedy such breach within a month of being given the terminating party’s notice requesting such breach to be remedied. No breach or violation of this agreement will cause this agreement or any license granted in it to terminate automatically or affect the definition of the Journal Owner. The author and the Journal Owner may agree to terminate this agreement at any time. This agreement or any license granted in it cannot be terminated otherwise than in accordance with this section 6. This License shall remain in effect throughout the term of copyright in the Work and may not be revoked without the express written consent of both parties.

7. Royalties

This agreement entitles the author to no royalties or other fees. To such extent as legally permissible, the author waives his or her right to collect royalties relative to the article in respect of any use of the article by the Journal Owner or its sublicensee.

8. Miscellaneous

The Journal Owner will publish the article (or have it published) in the Journal if the article’s editorial process is successfully completed and the Journal Owner or its sublicensee has become obligated to have the article published. Where such obligation depends on the payment of a fee, it shall not be deemed to exist until such time as that fee is paid. The Journal Owner may conform the article to a style of punctuation, spelling, capitalization and usage that it deems appropriate. The Journal Owner will be allowed to sublicense the rights that are licensed to it under this agreement. This agreement will be governed by the laws of Poland.

By signing this License, Author(s) warrant(s) that they have the full power to enter into this agreement. This License shall remain in effect throughout the term of copyright in the Work and may not be revoked without the express written consent of both parties.

Determination of input parameters of the neural network model, intended for phoneme recognition of a voice signal in the systems of distance learning

Authors

Abstract

Author Biographies

Berik Akhmetov, Caspian State University of Technologies and Engineering named after Sh. Yessenov

Igor Tereykovsky, National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"

Aliya Doszhanova, Almaty University of Power Engineering and Telecommunications

References

Downloads

Published

Issue

Section

License

Information

Current Issue