METHODOLOGICAL ASPECTS OF TEACHING STUDENTS TO CREATE DIGITAL FREQUENCY DICTIONARIES
PDF (Ukrainian)

Keywords

frequency dictionary
Python programming language
program code
teaching methods

How to Cite

[1]
O. V. Riezina, “METHODOLOGICAL ASPECTS OF TEACHING STUDENTS TO CREATE DIGITAL FREQUENCY DICTIONARIES”, ITLT, vol. 72, no. 4, pp. 214–225, Sep. 2019, doi: 10.33407/itlt.v72i4.2498.

Abstract

Frequency dictionaries are created to identify the most frequently used words in a natural language, a writer’s language, a particular literary work, etc. These dictionaries are used while learning foreign languages, creating other kinds of dictionaries, conducting language experiments, carrying out research in the fields of lexical semantics, psycholinguistics, morphology, etc. and designing applications focused on natural language processing. Information and communication technology and access to large digital linguistic corpora and national word corpora based on the subtitles of movies and TV shows have accelerated research in the field of statistical text processing. Due to frequency dictionaries being widely used in various fields of activity and their creation offering the solution to a wide range of linguistic issues, it seems reasonable to analyze the technology of creating such dictionaries in the process of training future specialists in Applied Linguistics and Computer Science as well as teachers of Computer Science. The paper runs about the methods of teaching students to create frequency dictionaries. An algorithm of creating a frequency dictionary is offered. The peculiarities of realizing each stage of the algorithm with the help of the popular Python programming language, which has an open code and extensive libraries, are analyzed. The author provides program codes and justifies the use of corresponding modules, string methods, functions, constants, data structures, and regular expressions. The proposed methodology is aimed at 1) raising the students’ motivation to study and 2) revealing the practical significance of the acquired programming methods and techniques. The author argues that the approach presented in this paper can be of advantage to teachers of computer-related courses.

PDF (Ukrainian)

References

M. Brysbaert, and B. New, " Moving beyond Kučera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English," Behavior Research Methods, vol. 41, no. 4, pp. 977-990, Nov. 2009. doi: https://doi.org/10.3758/BRM.41.4.977.

M. Davies, and D. Gardner, A Frequency Dictionary of American English: Word Sketches, Collocates and Thematic Lists. London, UK: Routledge, 2010.

C. Lehmann, "Frequency dictionary", Christianlehmann.eu, 2018. [Електронний ресурс]. Доступно: https://www.christianlehmann.eu/ling/ling_meth/ling_description/lexicography/frequency_dict.html. Дата звернення: Липень 17, 2018.

Word frequency data, Wordfrequency.info, 2018. [Електронний ресурс]. Доступно: https://www.wordfrequency.info/uses.asp. Дата звернення: Липень 17, 2018.

J. DeRocher, M. Miron, S. Patten and C. Pratt, The Counting of Words: A Review of the History, Techniques and Theory of Word Counts with Annotated Bibliography. New York, NY, USA: Syracuse University Research Corporation, 1973.

E. Tseng, S. A. Gandhi, A. D. Kramer, and L. S. Clair, "Blending customized user dictionaries based on frequency of usage - Google Patents", Patents.google.com, 2018. [Електронний ресурс]. Доступно: https://patents.google.com/patent/US9977774B2/en. Дата звернення: Липень 17, 2018.

R. Meier, J. Hausmann, H. Urbschat, and T. Wanschura, "Hierarchical Dictionary with Statistical Filtering Based on Word Frequency - Google Patents", Patents.google.com, 2018. [Електронний ресурс]. Доступно: https://patents.google.com/patent/US20170220679A1/en. Дата звернення: Липень 17, 2018.

Н. Дарчук, "Можливості семантичної розмітки корпусу української мови (КУМ)", Науковий часопис НПУ імені М. П. Драгоманова. Серія 9: Сучасні тенденції розвитку мов, випуск №15, с. 18-28, 2017.

Н. Угольнікова, та М. Чухненко, "Особливості класифікації творів масової літератури", Вісник НТУ «ХПІ». Актуальні проблеми розвитку українського суспільства, № 29 (1251), с. 91-97, 2017.

W. Turkel, and A. Crymble, "Counting Word Frequencies with Python", Programming historian.org, 2018. [Електронний ресурс]. Доступно: https://programminghistorian.org/en/lessons/counting-frequencies. Дата звернення: Липень 17, 2018.

С. Помирча, та І. Пучков, "Електронні словники з української мови як засіб формування лексикографічної компетентності майбутніх учителів початкової школи", Інформаційні технології і засоби навчання, т. 59, №3, с. 104-113, 2017.

D. Jurafsky, and J. Martin, Speech and language processing, 2nd ed. Upper Saddle River, NJ, USA: Prentice Hall, Inc., 2009.

Common string operations – Python 3.7.0 documentation, Docs.python.org, 2018. [Електронний ресурс]. Доступно:https://docs.python.org/3/library/string.html. Дата звернення: Липень 18, 2018.

Data Structures – Python 3.7.0 documentation, Docs.python.org, 2018. [Електронний ресурс]. Доступно: https://docs.python.org/3/tutorial/datastructures.html. Дата звернення: Липень 18, 2018.

S. Bird, E. Klein, and E. Loper,"Accessing Text Corpora and Lexical Resources", in Natural language processing with Python – Analyzing Text with the Natural Language Toolkit, Nltk.org, 2018. [Електронний ресурс]. Доступно:https://www.nltk.org/book/ch02.html. Дата звернення: Липень 18, 2018.

О. Рєзіна, "Технології статистичного опрацювання текстів та методика їх навчання", Науковий часопис НПУ імені М. П. Драгоманова, Серія 2: Комп’ютерно-орієнтовані системи навчання, випуск №19(26), с. 98-103, 2017.


REFERENCES (TRANSLATED AND TRANSLITERATED)

M. Brysbaert, and B. New, " Moving beyond Kučera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English," Behavior Research Methods, vol. 41, no. 4, pp. 977-990, Nov. 2009. doi: https://doi.org/10.3758/BRM.41.4.977. (in English)

M. Davies, and D. Gardner, A Frequency Dictionary of American English: Word Sketches, Collocates and Thematic Lists. London, UK: Routledge, 2010. (in English)

C. Lehmann, "Frequency dictionary", Christianlehmann.eu, 2018. [Online]. Available: https://www.christianlehmann.eu/ling/ling_meth/ling_description/lexicography/frequency_dict.html. Accessed on: Jul. 17, 2018. (in English)

Word frequency data, Wordfrequency.info, 2018. [Online]. Available: https://www.wordfrequency.info/uses.asp. Accessed on: Jul. 17, 2018. (in English)

J. DeRocher, M. Miron, S. Patten and C. Pratt, The Counting of Words: A Review of the History, Techniques and Theory of Word Counts with Annotated Bibliography. New York, NY, USA: Syracuse University Research Corporation, 1973. (in English)

E. Tseng, S. A. Gandhi, A. D. Kramer, and L. S. Clair, "Blending customized user dictionaries based on frequency of usage - Google Patents", Patents.google.com, 2018. [Online]. Available: https://patents.google.com/patent/US9977774B2/en. Accessed on: Jul. 17, 2018. (in English)

R. Meier, J. Hausmann, H. Urbschat, and T. Wanschura, "Hierarchical Dictionary with Statistical Filtering Based on Word Frequency - Google Patents", Patents.google.com, 2018. [Online]. Available: https://patents.google.com/patent/US20170220679A1/en. Accessed on: Jul. 17, 2018. (in English)

N. Darchuk, " Capabilities of Semantic Tagging Within the Ukrainian Corpus", Scientific journal of National Drahomanov Pedagogical University. Series 9: Modern Trends in the Development of Languages, No. 15, pp. 18-28, 2017. (in Ukrainian)

N. Uholnikova, and M. Chukhnenko, "Features of the classification of works of mass literature", Bulletin of NTU "KhPI". Series: Actual problems of Ukrainian society development, No. 29 (1251), pp. 91-97, 2017. (in Ukrainian)

W. Turkel, and A. Crymble, "Counting Word Frequencies with Python", Programminghistorian.org, 2018. [Online]. Available: https://programminghistorian.org/en/lessons/counting-frequencies. Accessed on: Jul. 17, 2018. (in English)

S. Pomyrcha, and I. Puchkov, "Electronic dictionaries in ukrainian as a mean of forming lexicographical competence of future primary school teachers", Information Technologies and Learning Tools, vol. 59, no. 3, pp. 104-113, 2017. (in Ukrainian)

D. Jurafsky, and J. Martin, Speech and language processing, 2nd ed. Upper Saddle River, NJ, USA: Prentice Hall, Inc., 2009. (in English)

Common string operations – Python 3.7.0 documentation, Docs.python.org, 2018. [Online]. Available: https://docs.python.org/3/library/string.html. Accessed on: Jul. 18, 2018. (in English)

Data Structures – Python 3.7.0 documentation, Docs.python.org, 2018. [Online]. Available: https://docs.python.org/3/tutorial/datastructures.html. Accessed on: Jul. 18, 2018. (in English)

S. Bird, E. Klein, and E. Loper,"Accessing Text Corpora and Lexical Resources", in Natural language processing with Python – Analyzing Text with the Natural Language Toolkit, Nltk.org, 2018. [Online]. Available: https://www.nltk.org/book/ch02.html. Accessed on: Jul. 18, 2018. (in English)

O. Riezina, "Statistical text processing techniques and their teaching methods", Scientific journal of National Drahomanov Pedagogical University. Series 2: Computer-oriented learning systems, No. 19(26), pp. 98-103, 2017. (in Ukrainian)

Authors who publish in this journal agree to the following terms:

  1. Authors hold copyright immediately after publication of their works and retain publishing rights without any restrictions.
  2. The copyright commencement date complies the publication date of the issue, where the article is included in.

Content Licensing

  1. Authors grant the journal a right of the first publication of the work under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0) that allows others freely to read, download, copy and print submissions, search content and link to published articles, disseminate their full text and use them for any legitimate non-commercial purposes (i.e. educational or scientific) with the mandatory reference to the article’s authors and initial publication in this journal.
  2. Original published articles cannot be used by users (exept authors) for commercial purposes or distributed by third-party intermediary organizations for a fee.

Deposit Policy

  1. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) during the editorial process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (see this journal’s registered deposit policy at Sherpa/Romeo directory).
  2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
  3. Post-print (post-refereeing manuscript version) and publisher's PDF-version self-archiving is allowed.
  4. Archiving the pre-print (pre-refereeing manuscript version) not allowed.

Downloads

Download data is not yet available.