• Ei tuloksia

Fundamentally, as mentioned before, this study wanted to show that if a comprehensive enough phonetically-annotated corpora is used, it can reveal patterns that the experts might have missed or overstated. This project proved that to be the case – some errors were confirmed to be as prevalent as the experts describe, others were found to be not as frequent, while new consistent mispronunciations were uncovered. This research chose to look at Russian speakers of English because this area of study is underdeveloped, with little to none phonetically-annotated corpora containing enough data to analyze segmental errors in detail available. This is the main reason why this study was conducted, corpus collected and annotated and the data analyzed. This study can allow for further developments in this area, with the data of combined frequency, functional load and error priority will also prove useful for different settings. Ultimately, these findings should still be confirmed and adjusted with the help of L1 speakers of English, who would provide their judgements of comprehensibility, intelligibility and accentedness. The errors that this analysis identified should prove to be a good direction for collection of such judgements.

References

1. Akishina, A. A., & Baranovskaya, S. A. (1990). Russkaja fonetika [Russian phonetics] (2nd ed.). Moscow, Russia: Russkij Yazyk.

2. Ali, A.M., & Renals, S. (2018). Word Error Rate Estimation for Speech Recognition:

e-WER. ACL.

3. Anderson-Hsieh, J., & Koehler, K. (1988). The effect of foreign accent and speaking rate on native speaker comprehension. Language Learning, 38, 561–613.

4. Asher, J. J., & García, R. (1969). The optimal age to learn a foreign language. The Modern Language Journal, 53, 334–341.

5. Avery, P., & S. Ehrlich (Eds.). (1992). Teaching American English pronunciation.

Oxford: Oxford University Press.

6. Beerends, J.G., Larsen, E., Iyer, N., & Vugt, J.V. (2004). Measurement of speech intelligibility based on the PESQ approach.

7. Bernstein, J. (2003). Objective measurement of intelligibility.

8. Best, C.T., & Tyler, M. (2007). Nonnative and second-language speech perception:

Commonalities and complementarities. In Bohn, O.S., & M. Munro (Eds.), Second-language speech learning:

9. Biber, D. (2019). Corpus analysis of spoken discourse: Research findings, prospects, implications for teaching. Plenary speech given at Pronunciation in Second Language Learning and Teaching conference, September 2019, Northern Arizona University, Flagstaff, AZ, USA.

10. Bongaerts, T., van Summeren, C., Planken, B., & Schils, E. (1997). Age and ultimate attainment in the pronunciation of a foreign language. Studies in Second Language Acquisition, 19, 447–465.

11. Brown, A. (1988). Functional load and the teaching of pronunciation. TESOL Quarterly, 22, 593–606.

12. Catford, J.C. (1987). Phonetics and the teaching of pronunciation. In Morley, J. (Ed.), Current perspectives on pronunciation (pp. 87–100). Alexandria, VA: TESOL Press.

13. Catford, J.C. (1988). Functional load and diachronic phonology. In Tobin, Y. (Ed.), The Prague School and its legacy (pp. 3–19). Amsterdam: John Benjamins.

14. Ćavar, M.E., & Lulich, S.M. (2021). Variation in the articulation of Russian stressed vowels and the mechanics of palatalization in consonants.

15. Cooper, E., & Wang, X. (2017). Utterance selection for optimizing intelligibility of tts voices trained on asr data. Interspeech 2017, 1.

16. Derwing, T., & Thomson, R. (2015). The effectiveness of L2 pronunciation instruction: A narrative review. Applied Linguistics, 36, 326–344.

17. Draxler, C., & Jänsch, K. (2004). SpeechRecorder - a Universal Platform Independent Multi-Channel Audio Recording Software. LREC.

18. Escribano, P.D. (2004). Exploring cognition processes in second language acquisition: the case of cognates and false-friends in EST. Iberica, 7, 87-106.

19. Favre, B., Cheung, K., Kazemian, S., Lee, A., Liu, Y., Munteanu, C., Nenkova, A., Ochei, D., Penn, G., Tratz, S., Voss, C.R., & Zeller, F. (2013). Automatic human utility evaluation of ASR systems: does WER really predict performance?

INTERSPEECH.

20. Flege, J. E. (1984). The detection of French accent by American listeners. The Journal of Acoustical Society of America, 76, 692–707.

21. Flege, J. E. (1995). Second-language speech learning: Findings, and problems. In W.

Strange (Ed.), Speech perception and linguistic experience: Theoretical and methodological issues (pp. 233–273). Timonium, MD: York Press.

22. Flege, J. E., Munro, M. J., & MacKay, I. R. A. (1995). Factors affecting strength of perceived foreign accent in a second language. Journal of the Acoustical Society of America, 97, 3125–3134.

23. Flege, J.E. (2006). Second Language Speech Learning Theory , Findings , and Problems.

24. Flege, J.E., Frieda, E. M., & Nozawa, T. (1997). Amount of native-language (L1) use affects the pronunciation of an L2. Journal of Phonetics, 25, 169–86

25. Gallardo, L.F., Möller, S., & Beerends, J.G. (2017). Predicting Automatic Speech Recognition Performance Over Communication Channels from Instrumental Speech Quality and Intelligibility Scores. INTERSPEECH.

26. Gao, Z., & Weinberger, S.H. (2018). Which phonetic features should pronunciation Instructions focus on? An evaluation on the accentedness of segmental/syllable errors in L2 speech. Research in Language, 16, 135 - 154.

27. Gerosa, M., Giuliani, D., Narayanan, S.S., & Potamianos, A. (2009). A review of ASR technologies for children's speech. WOCCI '09.

28. Gildersleeve-Neumann, C.E., & Wright, K.L. (2010). English speech acquisition in 3- to 5-year-old children learning Russian and English. Language, speech, and hearing services in schools, 41 4, 429-44.

29. Grosman, J. (2014). Language Development of Bilingual Russian/ English Speaking Children Living in the United States: A Review of the Literature.L1 Influence on L2 Intonation in Russian Speakers of English

30. Gzhayanz, E.M. (1969) Korrektivniy fonetiko-yazikovoy kurs : posobie dlya

studentov yazikovih I pedagogicheskih institutov [Corrective Speech Pattern Course : a textbook for foreign language department and teaching department students].

Leningrad: Prosvesheniye.

31. Hamilton, W. S. (1980). Introduction to Russian phonology and word structure.

Columbus, OH: Slavica Publishers.

32. He, X., Deng, L., & Acero, A. (2011). Why word error rate is not a good metric for speech recognizer training for the speech translation task? 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 5632-5635.

33. Jain, A., Singh, V.P., & Rath, S.P. (2019). A Multi-Accent Acoustic Model Using Mixture of Experts for Speech Recognition. INTERSPEECH.

34. Jain, A., Upreti, M., & Jyothi, P. (2018). Improved Accented Speech Recognition Using Accent Embeddings and Multi-task Learning. INTERSPEECH.

35. Jesney, K. (2004). The Use of Global Foreign Accent Rating in Studies of L 2 Acquisition. Calgary, AB: University of Calgary Language Research Centre Reports 36. Johnson, C. E., & Lancaster, P. (1998). The development of more than one

phonology: A case study of a Norwegian–English child. The International Journal of Bilingualism, 2, 265–300.

37. Kanedera, N., Arai, T., Hermansky, H., & Pavel, M. (1999). On the relative

importance of various components of the modulation spectrum for automatic speech recognition. Speech Commun., 28, 43-55.

38. Kang, O., & Moran, M. (2014). Functional Loads of Pronunciation Features in Nonnative Speakers' Oral Assessment. TESOL Quarterly, 48, 176-187.

39. Kartushina, N., Frauenfelder, U.H., & Golestani, N. (2016). How and When Does the Second Language Influence the Production of Native Speech Sounds: A Literature Review. Language Learning, 66, 155-186.

40. Këpuska, V.; Bohouta, G. Comparing speech recognition systems (Microsoft API, Google API and CMU Sphinx). Int. J. Eng. Res. Appl. 2017, 7, 20–24.

41. Kichigina E.V. (2002). Angliyskiy yazik. Uchebnoe posobie po fonetike [English language. Phonetics Textbook]. Chelyabinsk: YUrGU.

42. Kitzing, P., Maier, A.K., & Åhlander, V.L. (2009). Automatic speech recognition (ASR) and its use as a tool for assessment or therapy of voice, speech, and language disorders. Logopedics Phoniatrics Vocology, 34, 91 - 96.

43. Krishna, G., Tran, C., Carnahan, M., & Tewfik, A.H. (2019). Advancing Speech Recognition With No Speech Or With Noisy Speech. 2019 27th European Signal Processing Conference (EUSIPCO), 1-5.

44. Lombardi, L. (2003). Second language data and constraints on Manner: explaining substitutions for the English interdentals. Second Language Research, 19, 225 - 250.

45. Mairano, P., Bassetti, B., Sokolović-Perović, M., & Cerni, T. (2018). Effects of L1 orthography and L1 phonology on L2 English pronunciation. Revue Francaise De Linguistique Appliquee, 23, 45-57.

46. McAuliffe, M., Socolof, M., Mihuc, S., Wagner, M., & Sonderegger, M. (2017).

Montreal Forced Aligner: Trainable Text-Speech Alignment Using Kaldi.

INTERSPEECH.

47. Meister, L. (2006). Assessment of the degree of foreign accent: a pilot study. The Phonetics Symposium 2006, Helsinki, 113 - 119.

48. Meeker, M. Internet Trends 2017; Kleiner Perkins: Los Angeles, CA, USA, 2017.

49. Meister, L. (2007). Perceptual Assessment of the Degree of Russian Accent.

NODALIDA.

50. Meister, L., Meister, E. (2005). Acoustic correlates of Russian accent in Estonian.

Proc. SPECOM 2005, Patras, 437 - 440.

51. Mirghafori, N., Fosler-Lussier, E., & Morgan, N. (1996). Towards robustness to fast speech in ASR. 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, 1, 335-338 vol. 1.

52. Moore, M., Saxon, M.S., Venkateswara, H., Berisha, V., & Panchanathan, S. (2019).

Say What? A Dataset for Exploring the Error Patterns That Two ASR Engines Make.

INTERSPEECH.

53. Munro, M. J. (1995). Nonsegmental factors in foreign accent. Studies in Second Language Acquisition, 17, 17–34.

54. Munro, M. J., & Derwing, T. M. (1995). Processing time, accent, and

comprehensibility in the perception of native and foreign-accented speech. Language and Speech, 38, 289–306.

55. Munro, M. J., Derwing, T. M., & Morton, S. L. (2006). The mutual intelligibility of L2 speech. Studies in Second Language Acquisition, 28, 111–131.

56. Munro, M.J. (2018). How well can we predict second language learners’

pronunciation difficulties? CATESOL Journal, 30, 267–281.

57. Munro, M.J., & Derwing, T.M. (2006). The functional load principle in ESL pronunciation instruction: An exploratory study. System, 34, 520–531.

58. Munro, M.J., & Derwing, T.M. (2008). Segmental Acquisition in Adult ESL Learners: A Longitudinal Study of Vowel Production. Language Learning, 58, 479-502.

59. Munro, M.J., Derwing, T.M., & Thomson, R.I. (2015). Setting segmental priorities for English learners: Evidence from a longitudinal study. International Review of Applied Linguistics in Language Teaching, 53, 39–60.

60. Najafian, M., DeMarco, A., Cox, S.J., & Russell, M.J. (2014). Unsupervised model selection for recognition of regional accented speech. INTERSPEECH.

61. Nittrouer, S., & Burton, L.T. (2005). The role of early language experience in the development of speech perception and phonological processing abilities: evidence from 5-year-olds with histories of otitis media with effusion and low socioeconomic status. Journal of communication disorders, 38 1, 29-63.

62. Park, Y., Patwardhan, S., Visweswariah, K., & Gates, S.C. (2008). An empirical analysis of word error rate and keyword error rate. INTERSPEECH.

63. Paunović, T., & Čubrović, B. (2012). Exploring English phonetics.

64. Raju, K., Krishna, A.S., & Murali, M. (2020). AUTOMATIC SPEECH

RECOGNITION SYSTEM USING MFCC-BASED LPC APPROACH WITH BACK PROPAGATED ARTIFICIAL NEURAL NETWORKS.

65. Rehman, I., Silpachai, A., Levis, J.M., Zhao, G., & Gutierrez-Osuna, R. (2020). The English pronunciation of Arabic speakers: A data-driven approach to segmental error identification. Language Teaching Research.

66. Ryazanova-Clarke, L., & Wade, T.L. (1999). The Russian Language Today.

67. Schmid, Monika & Hopp, Holger. (2014). Comparing foreign accent in L1 attrition and L2 acquisition: Range and rater effects. Language Testing. 31. 367-388.

68. Schwarz, F. (2016). Weak vs. strong definite articles: Meaning and form across languages.

69. Sokolova M.A., Gintovt K.P. & Kanter L.A. (2008). Prakticheskaya fonetika Angliyskogo yazika [Practical Phonetics for English Language]. Moscow, Russia:

Vlados.

70. Sokolova, M.A. (2008). Prakticheskaya fonetika angliyskogo yazika: uchebnoe posobie dlya institutov i fakul’tetov inostrannyh yazikov [English Practical Phonetics:

Textbook for universities and foreign languages departments]. Minsk: Vysheishaya Shkola.

71. Southwood, H., Flege, J. (1999). Scaling foreign accent: direct magnitude estimation versus interval scaling. Clinical Linguistics & Phonetics, Vol. 13, No. 5, 335-349.

72. Sun, S., Yeh, C., Hwang, M., Ostendorf, M., & Xie, L. (2018). Domain Adversarial Training for Accented Speech Recognition. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4854-4858.

73. Swan, M. (1987). Learner English: a teacher's guide to interference and other problems / Michael Swan, Bernard Smith.

74. Tejedor-García, C., Cardeñoso-Payo, V., & Escudero-Mancebo, D. (2021). Automatic Speech Recognition (ASR) Systems Applied to Pronunciation Assessment of L2 Spanish for Japanese Speakers.

75. Toda, T., Black, A.W., & Tokuda, K. (2007). Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory. IEEE Transactions on Audio, Speech, and Language Processing, 15, 2222-2235.

76. Ullakonoja, Riikka. (2009). Fluency development in L2 during study abroad: Finnish students of Russian.

77. Zharkova, N. (2005). Strategies in the acquisition of segments and syllables in Russian-speaking children. Leiden Papers in Linguistics, 2, 189–213.

78. Zheng, Y., Sproat, R., Gu, L., Shafran, I., Zhou, H., Su, Y., Jurafsky, D., Starr, R.L.,

& Yoon, S. (2005). Accent detection and speech recognition for Shanghai-accented Mandarin. INTERSPEECH.

79. Zsiga, E.C. (2003). ARTICULATORY TIMING IN A SECOND LANGUAGE.

Studies in Second Language Acquisition, 25, 399 - 432.

Appendix I. List of ARPAbet symbols and corresponding IPA symbols and