Aleš Horák | Masaryk University (original) (raw)

Papers by Aleš Horák

Research paper thumbnail of Sustainable long-term WordNet development and maintenance: Case study of the Czech WordNet

Cognitive Studies, Dec 20, 2018

Research paper thumbnail of Technological Approaches to Detecting Online Disinformation and Manipulation

Challenging Online Propaganda and Disinformation in the 21st Century, 2021

Research paper thumbnail of Bilingual Logical Analysis of Natural Language Sentences

One of the main aims of logical analysis of natural language ex- pressions lies in the task to ca... more One of the main aims of logical analysis of natural language ex- pressions lies in the task to capture the meaning structures independently on the selected “mean of transport,” i.e. on a particular natural language used. Logical analysis should just offer a “bridge between language ex-pressions.” In this paper, we show the preliminary results of automated bilingual logical analysis, namely the analysis of English and Czech sentences. The underlying logical formalism, the Transparent Intensional Logic (TIL), is a representative of a higher-order temporal logic designed to express full meaning relations of natural language expressions. We present the details of the current development and preparations of the supportive lexicons for the AST (automated semantic analysis) tool when working with a new language, i.e. English. The AST provides an implementation of the Normal Translation Algorithm for TIL aiming to offer a normative logical analysis of the input sentences. We show the simila...

Research paper thumbnail of Contract Metadata Identification in Czech Scanned Documents

Proceedings of the 13th International Conference on Agents and Artificial Intelligence, 2021

Research paper thumbnail of Increasing Coverage of Translation Memories with Linguistically Motivated Segment Combination Methods

Research paper thumbnail of Benchmark Dataset for Propaganda Detection in Czech Newspaper Texts

Proceedings - Natural Language Processing in a Deep Learning World, 2019

Research paper thumbnail of USING WORDNETS AND ONTOLOGIES FOR TEXT-MEANING ASSIGNMENT - Implementation Details of the KYOTO Project First Phase

Proceedings of the 4th International Conference on Software and Data Technologies, 2009

Research paper thumbnail of DEBVisDic: Instant Wordnet Building

The semantic network editor DEBVisDic has been used by different development teams to create more... more The semantic network editor DEBVisDic has been used by different development teams to create more than 20 national wordnets. The editor was recently re-developed as a multi-platform web-based application for general semantic networks editing. One of the main advantages, when compared to the previous implementation, lies in the fact that no client-side installation is needed now. Following the successful first phase in building the Open Dutch Wordnet, DEBVisDic was extended with features that allow users to easily create, edit, and share a new (usually national) wordnet without the need of any complicated configuration or advanced technical skills. The DEBVisDic editor provides advanced features for wordnet browsing, editing, and visualization. Apart from the user-friendly web-based application, DEBVisDic also provides an API interface to integrate the semantic network data into external applications.

Research paper thumbnail of Wordnet Consistency Checking via Crowdsourcing

Velke ontologie a semanticke sitě představuji komplexni viceurovňove struktury, ktere nelze snadn... more Velke ontologie a semanticke sitě představuji komplexni viceurovňove struktury, ktere nelze snadno ověřit běžnými metodami kontroly. Automaticke kontroly konzistence mohou odhalit systemove chyby, např. chybějici odkazy, ale nalezt chybějici význam slova je obtižne. Běžna řeseni spolehaji na postupne konzultace mnoha informacnich zdrojů při postupnem recenznim řizeni. V clanku je popsan nový přistup pro ověřeni a rozsiřovani dat wordnetu pomoci zapojeni uživatelů. Tento přistup zajisťuje brzke vydani plne datove sady pro použiti cilovou skupinou s pozdějsimi neustalými upravami podle navrhů veřejných uživatelů a kontrolou těchto navrhů experty. Tým expertů ma k dispozici navrhy oprav v přehledne agregovane podobě, a take podporou revizi a editace.

Research paper thumbnail of Improving RNN-based Answer Selection for Morphologically Rich Languages

Proceedings of the 12th International Conference on Agents and Artificial Intelligence, 2020

Research paper thumbnail of TOWARDS AN INTELLIGENT QUESTION-ANSWERING SYSTEM - State-of-the-art in the Artificial Mind

Proceedings of the 4th International Conference on Agents and Artificial Intelligence, 2012

Research paper thumbnail of Question and Answer Classification in Czech Question Answering Benchmark Dataset

Proceedings of the 11th International Conference on Agents and Artificial Intelligence, 2019

Research paper thumbnail of A New Approach for Semi-Automatic Building and Extending a Multilingual Terminology Thesaurus

International Journal on Artificial Intelligence Tools, 2019

This paper describes a new system for semi-automatically building, extending and managing a termi... more This paper describes a new system for semi-automatically building, extending and managing a terminological thesaurus — a multilingual terminology dictionary enriched with relationships between the terms themselves to form a thesaurus. The system allows to radically enhance the workow of current terminology expert groups, where most of the editing decisions still come from introspection. The presented system supplements the lexicographic process with natural language processing techniques, which are seamlessly integrated to the thesaurus editing environment. The system’s methodology and the resulting thesaurus are closely connected to new domain corpora in the six languages involved. They are used for term usage examples as well as for the automatic extraction of new candidate terms. The terminological thesaurus is now accessible via a web-based application, which (a) presents rich detailed information on each term, (b) visualizes term relations, and (c) displays real-life usage exam...

Research paper thumbnail of Lexicographic Tools to Build New Encyclopaedia of the Czech Language

The Prague Bulletin of Mathematical Linguistics, 2016

The first edition of the Encyclopaedia of the Czech Language was published in 2002 and since that... more The first edition of the Encyclopaedia of the Czech Language was published in 2002 and since that time it has established as one of the basic reference books for the study of the Czech language and related linguistic disciplines. However, many new concepts and even new research areas have emerged since that publication. That is why a preparation of a complete new edition of the encyclopaedia started in 2011, rather than just re-printing the previous version with supplements. The new edition covers current research status in all concepts connected with the linguistic studies of (prevalently, but not solely) the Czech language. The project proceeded for five years and it has finished at the end of 2015, the printed edition is currently in preparation. An important innovation of the new encyclopaedia lies in the decision that the new edition will be published both as a printed book and as an electronic on-line encyclopaedia, utilizing the many advantages of electronic dictionaries.In t...

Research paper thumbnail of Preparing VerbaLex Printed Edition

Research paper thumbnail of Semiautomatic Building and Extension of Terminological Thesaurus for Land Surveying Domain

Research paper thumbnail of Improving Coverage of Translation Memories with Language Modelling

Research paper thumbnail of Automatic classification of patterns from the Pattern Dictionary of English Verbs}

Research paper thumbnail of Knowledge Base for Transparent Intensional Logic and Its Use in Automated Daily News Retrieval and Answering Machine

International Journal of Machine Learning and Computing, 2012

Research paper thumbnail of Linguistic Logical Analysis of Direct Speech

RASLAN 2012 Recent Advances in Slavonic Natural Language Processing

Logical analysis of natural language allows to extract semantic relations that are not revealed f... more Logical analysis of natural language allows to extract semantic relations that are not revealed for standard full text search methods. Intensional logic systems, such as the Transparent Intensional Logic (TIL), can rigorously describe even the higher-order relations between the speaker and the content or meaning of the discourse. In this paper, we concentrate on the mechanism of logical analysis of direct and indirect discourse by means of TIL. We explicate the procedure within the Normal Translation Algorithm (NTA) for Transparent Intensional Logic (TIL), which covers ...

Research paper thumbnail of Sustainable long-term WordNet development and maintenance: Case study of the Czech WordNet

Cognitive Studies, Dec 20, 2018

Research paper thumbnail of Technological Approaches to Detecting Online Disinformation and Manipulation

Challenging Online Propaganda and Disinformation in the 21st Century, 2021

Research paper thumbnail of Bilingual Logical Analysis of Natural Language Sentences

One of the main aims of logical analysis of natural language ex- pressions lies in the task to ca... more One of the main aims of logical analysis of natural language ex- pressions lies in the task to capture the meaning structures independently on the selected “mean of transport,” i.e. on a particular natural language used. Logical analysis should just offer a “bridge between language ex-pressions.” In this paper, we show the preliminary results of automated bilingual logical analysis, namely the analysis of English and Czech sentences. The underlying logical formalism, the Transparent Intensional Logic (TIL), is a representative of a higher-order temporal logic designed to express full meaning relations of natural language expressions. We present the details of the current development and preparations of the supportive lexicons for the AST (automated semantic analysis) tool when working with a new language, i.e. English. The AST provides an implementation of the Normal Translation Algorithm for TIL aiming to offer a normative logical analysis of the input sentences. We show the simila...

Research paper thumbnail of Contract Metadata Identification in Czech Scanned Documents

Proceedings of the 13th International Conference on Agents and Artificial Intelligence, 2021

Research paper thumbnail of Increasing Coverage of Translation Memories with Linguistically Motivated Segment Combination Methods

Research paper thumbnail of Benchmark Dataset for Propaganda Detection in Czech Newspaper Texts

Proceedings - Natural Language Processing in a Deep Learning World, 2019

Research paper thumbnail of USING WORDNETS AND ONTOLOGIES FOR TEXT-MEANING ASSIGNMENT - Implementation Details of the KYOTO Project First Phase

Proceedings of the 4th International Conference on Software and Data Technologies, 2009

Research paper thumbnail of DEBVisDic: Instant Wordnet Building

The semantic network editor DEBVisDic has been used by different development teams to create more... more The semantic network editor DEBVisDic has been used by different development teams to create more than 20 national wordnets. The editor was recently re-developed as a multi-platform web-based application for general semantic networks editing. One of the main advantages, when compared to the previous implementation, lies in the fact that no client-side installation is needed now. Following the successful first phase in building the Open Dutch Wordnet, DEBVisDic was extended with features that allow users to easily create, edit, and share a new (usually national) wordnet without the need of any complicated configuration or advanced technical skills. The DEBVisDic editor provides advanced features for wordnet browsing, editing, and visualization. Apart from the user-friendly web-based application, DEBVisDic also provides an API interface to integrate the semantic network data into external applications.

Research paper thumbnail of Wordnet Consistency Checking via Crowdsourcing

Velke ontologie a semanticke sitě představuji komplexni viceurovňove struktury, ktere nelze snadn... more Velke ontologie a semanticke sitě představuji komplexni viceurovňove struktury, ktere nelze snadno ověřit běžnými metodami kontroly. Automaticke kontroly konzistence mohou odhalit systemove chyby, např. chybějici odkazy, ale nalezt chybějici význam slova je obtižne. Běžna řeseni spolehaji na postupne konzultace mnoha informacnich zdrojů při postupnem recenznim řizeni. V clanku je popsan nový přistup pro ověřeni a rozsiřovani dat wordnetu pomoci zapojeni uživatelů. Tento přistup zajisťuje brzke vydani plne datove sady pro použiti cilovou skupinou s pozdějsimi neustalými upravami podle navrhů veřejných uživatelů a kontrolou těchto navrhů experty. Tým expertů ma k dispozici navrhy oprav v přehledne agregovane podobě, a take podporou revizi a editace.

Research paper thumbnail of Improving RNN-based Answer Selection for Morphologically Rich Languages

Proceedings of the 12th International Conference on Agents and Artificial Intelligence, 2020

Research paper thumbnail of TOWARDS AN INTELLIGENT QUESTION-ANSWERING SYSTEM - State-of-the-art in the Artificial Mind

Proceedings of the 4th International Conference on Agents and Artificial Intelligence, 2012

Research paper thumbnail of Question and Answer Classification in Czech Question Answering Benchmark Dataset

Proceedings of the 11th International Conference on Agents and Artificial Intelligence, 2019

Research paper thumbnail of A New Approach for Semi-Automatic Building and Extending a Multilingual Terminology Thesaurus

International Journal on Artificial Intelligence Tools, 2019

This paper describes a new system for semi-automatically building, extending and managing a termi... more This paper describes a new system for semi-automatically building, extending and managing a terminological thesaurus — a multilingual terminology dictionary enriched with relationships between the terms themselves to form a thesaurus. The system allows to radically enhance the workow of current terminology expert groups, where most of the editing decisions still come from introspection. The presented system supplements the lexicographic process with natural language processing techniques, which are seamlessly integrated to the thesaurus editing environment. The system’s methodology and the resulting thesaurus are closely connected to new domain corpora in the six languages involved. They are used for term usage examples as well as for the automatic extraction of new candidate terms. The terminological thesaurus is now accessible via a web-based application, which (a) presents rich detailed information on each term, (b) visualizes term relations, and (c) displays real-life usage exam...

Research paper thumbnail of Lexicographic Tools to Build New Encyclopaedia of the Czech Language

The Prague Bulletin of Mathematical Linguistics, 2016

The first edition of the Encyclopaedia of the Czech Language was published in 2002 and since that... more The first edition of the Encyclopaedia of the Czech Language was published in 2002 and since that time it has established as one of the basic reference books for the study of the Czech language and related linguistic disciplines. However, many new concepts and even new research areas have emerged since that publication. That is why a preparation of a complete new edition of the encyclopaedia started in 2011, rather than just re-printing the previous version with supplements. The new edition covers current research status in all concepts connected with the linguistic studies of (prevalently, but not solely) the Czech language. The project proceeded for five years and it has finished at the end of 2015, the printed edition is currently in preparation. An important innovation of the new encyclopaedia lies in the decision that the new edition will be published both as a printed book and as an electronic on-line encyclopaedia, utilizing the many advantages of electronic dictionaries.In t...

Research paper thumbnail of Preparing VerbaLex Printed Edition

Research paper thumbnail of Semiautomatic Building and Extension of Terminological Thesaurus for Land Surveying Domain

Research paper thumbnail of Improving Coverage of Translation Memories with Language Modelling

Research paper thumbnail of Automatic classification of patterns from the Pattern Dictionary of English Verbs}

Research paper thumbnail of Knowledge Base for Transparent Intensional Logic and Its Use in Automated Daily News Retrieval and Answering Machine

International Journal of Machine Learning and Computing, 2012

Research paper thumbnail of Linguistic Logical Analysis of Direct Speech

RASLAN 2012 Recent Advances in Slavonic Natural Language Processing

Logical analysis of natural language allows to extract semantic relations that are not revealed f... more Logical analysis of natural language allows to extract semantic relations that are not revealed for standard full text search methods. Intensional logic systems, such as the Transparent Intensional Logic (TIL), can rigorously describe even the higher-order relations between the speaker and the content or meaning of the discourse. In this paper, we concentrate on the mechanism of logical analysis of direct and indirect discourse by means of TIL. We explicate the procedure within the Normal Translation Algorithm (NTA) for Transparent Intensional Logic (TIL), which covers ...