Fast Similarity Searching of Text Documents using Learned Metric Index – Bc. Jakub Žovák
Bc. Jakub Žovák
Bachelor's thesis
Fast Similarity Searching of Text Documents using Learned Metric Index
Fast Similarity Searching of Text Documents using Learned Metric Index
Abstract:
Textové dokumenty, ako sú blogy, statusy na sociálnych sieťach, spravodajské články, eseje a textové správy, predstavujú jeden z hlavných zdrojov informácií na internete. Preto je mimoriadne dôležité takéto dáta efektívne indexovať a vyhľadávať. Keďže sú však textové objekty rozsiahle a komplexné, hľadanie presnej zhody je prakticky nemožné. Preto sa tieto objekty musia vyhľadávať na základe podobnosti …moreAbstract:
Text documents such as blog posts, tweets, news articles, essays, and text messages, represent one of the primary sources of information on the internet. Therefore, it is paramount to index and search such data efficiently. However, since these objects are large and complex, searching for an exact match is practically impossible. Therefore, text objects must be searched based on the notion of similarity …more
Language used: English
Date on which the thesis was submitted / produced: 19. 5. 2022
Identifier:
https://is.muni.cz/th/wmtet/
Thesis defence
- Date of defence: 29. 6. 2022
- Supervisor: RNDr. Matej Antol, Ph.D.
- Reader: Mgr. Miriama Jánošová
Citation record
Full text of thesis
Contents of on-line thesis archive
Published in Theses:- světu
Other ways of accessing the text
Institution archiving the thesis and making it accessible: Masarykova univerzita, Fakulta informatikyMasaryk University
Faculty of InformaticsBachelor programme / field:
Informatics / Informatics
Theses on a related topic
-
Similarity searching of proteins using machine learning techniques
Martin Gendiar -
Implementation of Unsupervised Learned Metric Index
Vojtěch Kaňa -
Implementace Learned Metric Index
Terézia Slanináková -
Learned Indexing in Vector Database Management Systems
Jakub Žovák