Bc. Dan Makalouš
Master's thesis
Age-of-Acquisition Ratings for Czech Words
Age-of-Acquisition Ratings for Czech Words
Abstract:
Práce prezentuje odhady věků osvojení pro 32~954 českých slov. Tvoří ji shromáždění zdrojů, odhadnutí jejich kvality a jejich skombinování za účelem vytvoření výsledného data setu. Mezi použité zdroje patřily vlastnosti slov (délka, frekvence), zahraniční data sety pro věky osvojení, word embeddings pro určení podobnosti slov a vlastní experiment, v rámci kterého jsme sesbírali 5~778 odpovědí (subjektivních …moreAbstract:
This thesis presents age-of-acquisition ratings for 32~954 Czech words. It consists of gathering sources, assessing their quality, and combining them to create the resulting data set. These sources included word properties (length, frequency), foreign age-of-acquisition studies, word embedding vectors (to find and utilize similarity of words), and a self-conducted experiment where we collected 5~778 …more
Language used: English
Date on which the thesis was submitted / produced: 17. 5. 2022
Identifier:
https://is.muni.cz/th/huzae/
Thesis defence
- Date of defence: 24. 6. 2022
- Supervisor: doc. Mgr. Radek Pelánek, Ph.D.
- Reader: RNDr. Pavel Šmerk, Ph.D.
Full text of thesis
Contents of on-line thesis archive
Published in Theses:- světu
Other ways of accessing the text
Institution archiving the thesis and making it accessible: Masarykova univerzita, Fakulta informatikyMasaryk University
Faculty of InformaticsMaster programme / field:
Artificial intelligence and data processing / Machine learning and artificial intelligence
Theses on a related topic
-
Self-encrypting drives: collection and visualization of features
Matyáš Szabó -
Systematic collection of TPM 2.0 chips attributes on Linux
Daniel Zaťovič -
Automated collection of network traffic
Eduard Ruisl -
Ansible Collection for Perun
Šimon Brauner -
Technologies for collection and automated processing of security information
Šimon Hajský -
Automated Collection of Open Source Intelligence
Ondřej Zoder -
Provisioning of Monitoring Infrastructure for Traffic Flow Collection
Ondřej Molík