RESEARCH OF THE PROBLEM OF SPEECH RECOGNITION FOR SOLUTION OF SPECIAL TASKS

Authors

  • О. Pomortseva O.M. Beketov National University of Urban Economy in Kharkiv
  • S. Kobzan O.M. Beketov National University of Urban Economy in Kharkiv

DOI:

https://doi.org/10.33042/2522-1809-2022-6-173-91-95

Keywords:

scription, time code, language decoding, geolocation, database, geographic information system

Abstract

In the article, the authors conducted a study of the actual problem of machine translation of information from audio or video files into text form (transcription). This is necessary for people with limited physical capabilities, or diseases or for those who need to process information in the form of a text file. The process of transcription is relevant at present (in the conditions of hostilities). Today in Ukraine, transcription is necessary to solve complex special tasks. Namely, solving the task of searching and identifying certain content that is transmitted by various means of communication in conversations in the form of audio files. Such tasks are currently quite relevant and quite time-consuming and take same time.

To solve this problem, the authors conducted a study and identified the strengths and weaknesses of the programs that are often used for these purposes. The types of transcription and the software currently used are presented in separate tables with all their features. Existing automatic language transcription algorithms still make significant errors, but their main advantage is time (or synchronicity). When it comes to solving special tasks, time is the most decisive factor.

Terabytes of clearly annotated data are needed to increase the accuracy of the text received by the transcriber program. Programs with artificial intelligence, in addition to extracting essences to understand the meaning of language, allow us to recognize and understand the form: combinations of sounds, letters, and syllables that are built into words and sentences. Only in this way will the machine be able to decode human speech correctly and correctly. An extremely important task is to determine the location of the speaker - geolocation, even with the determination of the specific location of the real estate object. This can be used for data collection and subsequent analysis of public sentiment and rapid response with subsequent localization of illegal activities. In the article, the authors concluded that for decoding audio files and automatically converting them into text format, a promising direction is the use of not just ready-made services, but the use of services with a built-in artificial intelligence function, so-called self-learning systems.

Author Biographies

О. Pomortseva, O.M. Beketov National University of Urban Economy in Kharkiv

Ph.D., Associate Professor, Associate Professor at Department of Land Administration and Geographic Information Systems

S. Kobzan, O.M. Beketov National University of Urban Economy in Kharkiv

Ph.D., Associate Professor, Associate Professor at Department of Land Administration and Geographic Information Systems

References

Kobzan, S., Pomortseva, О. (2021). Real estate market research using GIS. Trends and prospects for development. Collection of scientific works ΛΌГOΣ 2021, vol. 3. рр. 151 – 156. Oxford, United Kingdom.

S. M. Kobzan. (2019). Real estate market formation: practical aspects and features of evaluation. Monograph / Kyiv: Yurinkom Inter. - 267 p.

Petrushina V., Boitsova M., Kobzan S. (2008). Real estate transactions. Factor Publishing House. - 678 p.

Transcription software to help transcribe speech into text. Retrieved from: https://sendpulse.com/ru/blog/transcription-software.

Prospects for ensuring the military campaign of 2023: the Ukrainian view, V. Zaluzhnyi, M. Zabrodskyi. Retrieved from: https://www.ukrinform.ua/rubric-ato/3566162-ak-zabezpeciti-voennu-kampaniu-u-2023-roci-ukrainskij-poglad.html.

Tolstokhatko, V. A., Pomortseva, O. E., Patrakeev, I. M. (2014) Databases: design and use for real estate accounting: training. manual. Hark. national city university farm named after OHM. Beketov - 176 p.

Olena Pomortseva, Sergiy Kobzan, O. Voronkov, A. Yevdokimov (2021). Geospatial Modeling of the Infrastructure Facility Optimal Location / Second International Conference on Sustainable Futures: Environmental, Technological, Social, and Economic Matters, which held at Kryvyi Rih National University, Kryvyi Rih, Ukraine, on May 19-21. EasyChair . №5537.

Published

2022-12-16

How to Cite

Pomortseva О., & Kobzan, S. (2022). RESEARCH OF THE PROBLEM OF SPEECH RECOGNITION FOR SOLUTION OF SPECIAL TASKS. Municipal Economy of Cities, 6(173), 91–95. https://doi.org/10.33042/2522-1809-2022-6-173-91-95

Most read articles by the same author(s)

1 2 > >>