![]() ![]() The previous version of Collatinus (11 bêta) added a lot of new features: It is possible to get the number of occurrences of known words. lemmatisation and translation of words that are supposed to be known (during the colorization of a text) are not given anymore.the english version is now fully available.Ability to take into account characters of the same rank, for instance ph=f et y=i (in Djvu). Collatinus now takes into account non-Latin alphabets in a different alphabetical order, including those with digrams, for the consultation of dictionaries in Djvu format (for example if you want to add a dictionary Czech-Latin).documentation of the server-side application of Collatinus (in French only).recovery of 13 000 english translations.ability to export the results of a lemmatisation or tagging process as CSV. ![]() The current version of Collatinus, 11.2, comes with some new features, improvements and bug fixes: When scanning a text, Collatinus applies the usual rules of elision and hiatus. Starting from a lemma and its associated flexional endings, Collatinus is also capable of displaying the corresponding inflection tables, which Latin learners may find useful.įinally, when syllable quantities are known for a given lemma, Collatinus can scan the word and even the entire text. Adding lemmas with spelling variants (such as medieval spellings, for example) would make it possible to recognise all of their inflected forms as well. The advantage to this approach is that Collatinus, with its 11,000 lemmas, is capable of recognising over half a million forms. Unlike the majority of lemmatisers, which use lists of inflected forms, Collatinus uses a lexicon containing the lemmas and all the necessary information for their inflection. to provide students with exercises based on Latin texts.as a lexical and morphological reference when reading a Latin text disposer,.Further improvements and adjustments were made when it became apparent that many people were using it for other purposes: Originally, Collatinus was meant to produce printed documents, and it is still used for this purpose. software source code refactored in a modular way in order to facilitate the development of more specific applications.ability to query Collatinus from a client in console mode.ability to use Collatinus from another program (e.g.access to dictionaries in text or image mode (djvu).probabilistic tagger based on statistics from LASLA texts (~1,800,000 words).scansion and accentuation of a word or text,.ability to colourize a text according to a list of known words,.translations of about 11,000 lemmas in 6 different languages, other than French and English (to be improved).classification of the lemmatizations and morphosyntactic analysis according to the number of occurrences observed in LASLA texts,.basic glossary of 24,155 lemmas (Classic texts from LASLA) and extension of 58,384 lemmas, with some additional variants and the assimilation of the prefix,.Collation with those from Georges 1913 and Jeanneau 2017 (with the author's permission), lemmas from Lewis & Short 1879 and Gaffiot 2016 (with the author's permission) with translations into English and French.depends of course on the machine on which the program is running), efficiency of the lemmatization process (~1000 words/s.display syllable quantities (long and short syllables) and inflection (declension and conjugation). ![]()
0 Comments
Leave a Reply. |