0000000000178653

AUTHOR

Normunds Gruzitis

0000-0003-0511-1829

showing 9 related works from this author

Pini Language and PiniTree Ontology Editor: Annotation and Verbalisation for Atomised Journalism

2020

We present a new ontology language Pini and the PiniTree ontology editor supporting it. Despite Pini language bearing lot of similarities with RDF, UML class diagrams, Property Graphs and their frontends like Google Knowledge Graph and Protege, it is a more expressive language enabling FrameNet-style natural language annotation for Atomised journalism use case.

Computer science05 social sciences050801 communication & media studies02 engineering and technologycomputer.file_formatOntology languageProtégéLinguisticsAnnotation0508 media and communicationsUnified Modeling Language0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingJournalismClass diagramRDFcomputerNatural languagecomputer.programming_language
researchProduct

FrameNet Resource Grammar Library for GF

2012

In this paper we present an ongoing research investigating the possibility and potential of integrating frame semantics, particularly FrameNet, in the Grammatical Framework (GF) application grammar development. An important component of GF is its Resource Grammar Library (RGL) that encapsulates the low-level linguistic knowledge about morphology and syntax of currently more than 20 languages facilitating rapid development of multilingual applications. In the ideal case, porting a GF application grammar to a new language would only require introducing the domain lexicon - translation equivalents that are interlinked via common abstract terms. While it is possible for a highly restricted CNL,…

Morphology (linguistics)GrammarComputer sciencebusiness.industrymedia_common.quotation_subjectGrammatical Frameworkcomputer.software_genreLexiconSyntaxConstructed languageNounFrame semanticsArtificial intelligenceArgument (linguistics)FrameNetbusinesscomputerNatural language processingmedia_common
researchProduct

Polysemy in Controlled Natural Language Texts

2015

Computational semantics and logic-based controlled natural languages (CNL) do not address systematically the word sense disambiguation problem of content words, i.e., they tend to interpret only some functional words that are crucial for construction of discourse representation structures. We show that micro-ontologies and multi-word units allow integration of the rich and polysemous multi-domain background knowledge into CNL thus providing interpretation for the content words. The proposed approach is demonstrated by extending the Attempto Controlled English (ACE) with polysemous and procedural constructs resulting in a more natural CNL named PAO covering narrative multi-domain texts.

FOS: Computer and information sciencesComputer Science - Computation and LanguageInterpretation (logic)Computer sciencebusiness.industryRepresentation (arts)Content wordcomputer.software_genrelanguage.human_languageControlled natural languageComputational semanticslanguageAttempto Controlled EnglishArtificial intelligencePolysemybusinessComputation and Language (cs.CL)computerNatural languageNatural language processing
researchProduct

Extracting Formal Models from Normative Texts

2016

Normative texts are documents based on the deontic notions of obligation, permission, and prohibition. Our goal is model such texts using the C-O Diagram formalism, making them amenable to formal analysis, in particular verifying that a text satisfies properties concerning causality of actions and timing constraints. We present an experimental, semi-automatic aid to bridge the gap between a normative text and its formal representation. Our approach uses dependency trees combined with our own rules and heuristics for extracting the relevant components. The resulting tabular data can then be converted into a C-O Diagram.

Dependency (UML)business.industryComputer scienceFormalism (philosophy)Deontic logic020207 software engineering0102 computer and information sciences02 engineering and technologyPermissioncomputer.software_genre01 natural sciencesCausality (physics)010201 computation theory & mathematicsComputingMethodologies_DOCUMENTANDTEXTPROCESSING0202 electrical engineering electronic engineering information engineeringNormativeObligationArtificial intelligenceHeuristicsbusinesscomputerNatural language processing
researchProduct

Embedded controlled language to facilitate information extraction from eGov policies

2015

The goal of this paper is to propose a system that can extract formal semantic knowledge representation from natural language eGov policies. We present an architecture that allows for extracting Controlled Natural Language (CNL) statements from heterogeneous natural language texts with the ability to support multilinguality. The approach is based on the concept of embedded CNLs.

Language identificationNatural language user interfacebusiness.industryComputer scienceNatural language programmingcomputer.software_genrelanguage.human_languageInformation extractionUniversal Networking LanguageControlled natural languageQuestion answeringlanguageArtificial intelligencebusinesscomputerNatural language processingNatural languageProceedings of the 17th International Conference on Information Integration and Web-based Applications & Services
researchProduct

Domain Expert Platform for Goal-Oriented Dialog Collection

2021

Today, most dialogue systems are fully or partly built using neural network architectures. A crucial prerequisite for the creation of a goal-oriented neural network dialogue system is a dataset that represents typical dialogue scenarios and includes various semantic annotations, e.g. intents, slots and dialogue actions, that are necessary for training a particular neural network architecture. In this demonstration paper, we present an easy to use interface and its back-end which is oriented to domain experts for the collection of goal-oriented dialogue samples. The platform not only allows to collect or write sample dialogues in a structured way, but also provides a means for simple annotat…

Subject-matter expertService (systems architecture)AnnotationGoal orientationHuman–computer interactionComputer scienceInterface (Java)Sample (statistics)Dialog boxDomain (software engineering)Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations
researchProduct

Rigotrio At Semeval-2017 Task 9: Combining Machine Learning And Grammar Engineering For Amr Parsing And Generation

2017

By addressing both text-to-AMR parsing and AMR-to-text generation, SemEval-2017 Task 9 established AMR as a powerful semantic interlingua. We strengthen the interlingual aspect of AMR by applying the multilingual Grammatical Framework (GF) for AMR-to-text generation. Our current rule-based GF approach completely covered only 12.3% of the test AMRs, therefore we combined it with state-of-the-art JAMR Generator to see if the combination increases or decreases the overall performance. The combined system achieved the automatic BLEU score of 18.82 and the human Trueskill score of 107.2, to be compared to the plain JAMR Generator results. As for AMR parsing, we added NER extensions to our SemEva…

InterlinguaGenerator (computer programming)Parsingbusiness.industryComputer scienceSpeech recognitionGrammatical Framework02 engineering and technologycomputer.software_genreComputer scienceSemEvallanguage.human_languageTask (project management)020204 information systems0202 electrical engineering electronic engineering information engineeringlanguage020201 artificial intelligence & image processingGrammar engineeringArtificial intelligencebusinesscomputerNatural language processingBLEU
researchProduct

Extracting Semantic Knowledge from Unstructured Text Using Embedded Controlled Language

2016

Nowadays, most of the data on the Web is still in the form of unstructured text. Knowledge extraction from unstructured text is highly desirable but extremely challenging due to the inherent ambiguity of natural language. In this article, we present an architecture of an information extraction system based on the concept of Embedded Controlled Language that allows for extracting formal semantic knowledge from an unstructured text corpus. Moreover, the presented approach has a potential to support multilingual input and output.

Information retrievalConcept searchNoisy text analyticsbusiness.industryComputer scienceText simplification010401 analytical chemistryText graph02 engineering and technologycomputer.software_genre01 natural scienceslanguage.human_language0104 chemical sciencesInformation extractionControlled natural languageKnowledge extractionExplicit semantic analysis0202 electrical engineering electronic engineering information engineeringlanguage020201 artificial intelligence & image processingArtificial intelligencebusinesscomputerNatural language processing2016 IEEE Tenth International Conference on Semantic Computing (ICSC)
researchProduct

Abstract Syntax as Interlingua: Scaling Up the Grammatical Framework from Controlled Languages to Robust Pipelines

2020

Abstract syntax is an interlingual representation used in compilers. Grammatical Framework (GF) applies the abstract syntax idea to natural languages. The development of GF started in 1998, first as a tool for controlled language implementations, where it has gained an established position in both academic and commercial projects. GF provides grammar resources for over 40 languages, enabling accurate generation and translation, as well as grammar engineering tools and components for mobile and Web applications. On the research side, the focus in the last ten years has been on scaling up GF to wide-coverage language processing. The concept of abstract syntax offers a unified view on many oth…

Linguistics and LanguageInterlinguaComputer scienceGrammatical Framework0102 computer and information sciences02 engineering and technologycomputer.software_genre01 natural sciencesLanguage and LinguisticsDevelopment (topology)Artificial IntelligenceAbstract syntax0202 electrical engineering electronic engineering information engineeringScalingProgramming languageRepresentation (systemics)lcsh:P98-98.5language.human_languageComputer Science Applications010201 computation theory & mathematicslanguage020201 artificial intelligence & image processingCompilerlcsh:Computational linguistics. Natural language processingcomputerNatural languageComputational Linguistics
researchProduct