Search results for "Parsing"
showing 10 items of 46 documents
Testing Grammars for Parsability
1990
In the preceding chapters we have studied in detail the major methods of deterministic context-free parsing: strong LL(k) parsing (Chapter 5), simple precedence parsing (Chapter 5), canonical LR(k) parsing, LALR(k) parsing, and SLR(k) parsing (Chapters 6 and 7), and canonical LL(k) parsing (Chapter 8). Each of these methods induces a class of grammars that are “parsable” using that method, that is, a class of grammars for which a deterministic parser employing that method can be constructed. For example, the LL(k) grammars constitute the class of grammars parsable by the LL(k) parsing method. By definition, a context-free grammar is an LL(k) grammar if and only if its canonical LL(k) parser…
On the use of relational expressions in the design of efficient algorithms
2005
Relational expressions have finite binary relations as arguments and the operations are composition (·), closure (*), inverse (−1), and union (U). The efficient computation of the relation denoted by a relational expression is considered, and a tight bound is established on the complexity of the algorithm suggested by Hunt, Szymanski and Ullman. The result implies a unified method for deriving efficient algorithms for many problems in parsing. For example, optimal algorithms are derived for strong LL(1) and strong LL(2) parser construction and an efficient polynomialtime algorithm is derived for determining the inessential error entries in an LR(1) parsing table.
WCxf: An exchange format for Wilson coefficients beyond the Standard Model
2018
We define a data exchange format for numerical values of Wilson coefficients of local operators parameterising low-energy effects of physics beyond the Standard Model. The format facilitates interfacing model-specific Wilson coefficient calculators, renormalisation group (RG) runners, and observable calculators. It is designed to be unambiguous (defining a non-redundant set of operators with fixed normalisation in each basis), extensible (allowing the addition of new EFTs or bases by the user), and robust (being based on industry standard file formats with parsers implemented in many programming languages). We have implemented the format for the Standard Model EFT (SMEFT) and for the weak e…
New evidence for chunk-based models in word segmentation.
2014
International audience; : There is large evidence that infants are able to exploit statistical cues to discover the words of their language. However, how they proceed to do so is the object of enduring debates. The prevalent position is that words are extracted from the prior computation of statistics, in particular the transitional probabilities between syllables. As an alternative, chunk-based models posit that the sensitivity to statistics results from other processes, whereby many potential chunks are considered as candidate words, then selected as a function of their relevance. These two classes of models have proven to be difficult to dissociate. We propose here a procedure, which lea…
RIGA at SemEval-2016 Task 8: Impact of Smatch Extensions and Character-Level Neural Translation on AMR Parsing Accuracy
2016
Two extensions to the AMR smatch scoring script are presented. The first extension com-bines the smatch scoring script with the C6.0 rule-based classifier to produce a human-readable report on the error patterns frequency observed in the scored AMR graphs. This first extension results in 4% gain over the state-of-art CAMR baseline parser by adding to it a manually crafted wrapper fixing the identified CAMR parser errors. The second extension combines a per-sentence smatch with an en-semble method for selecting the best AMR graph among the set of AMR graphs for the same sentence. This second modification au-tomatically yields further 0.4% gain when ap-plied to outputs of two nondeterministic…
How Do Viewers Spontaneously Segment Animated Diagrams of Mechanical and Biological Subject Matter?
2012
A challenges for learning from animated diagrams is to first parse the continuous flow of information into discrete event units. Inadequacies in this parsing process can prejudice the quality of the mental model constructed from the depiction. One approach that has been proposed for ameliorating such problems is for the designer to pre-segment the animation. However, the pre-segmentation techniques used tend to be either intuitive or based on an expert's understanding of the subject matter. Neither of these approaches takes proper account of the psychological processing that must occur for an external animation to be properly internalized. This poster reports a study of the processes that l…
An Extended Data Object-driven Approach to Data Quality Evaluation: Contextual Data Quality Analysis
2019
This research is an extension of a data object-driven approach to data quality evaluation allowing to analyse data object quality in scope of multiple data objects. Previously presented approach was used to analyse one particular data object, mainly focusing on syntactic analysis. It means that the primary data object quality can be analysed against secondary data objects of unlimited number. This opportunity allows making more comprehensive, in-depth contextual data object analysis. The given analysis was applied to open data sets, making comparison between previously obtained results and results of application of the extended approach, underlying importance and benefits of the given exten…
FrameNet CNL: A Knowledge Representation and Information Extraction Language
2014
The paper presents a FrameNet-based information extraction and knowledge representation framework, called FrameNet-CNL. The framework is used on natural language documents and represents the extracted knowledge in a tailor-made Frame-ontology from which unambiguous FrameNet-CNL paraphrase text can be generated automatically in multiple languages. This approach brings together the fields of information extraction and CNL, because a source text can be considered belonging to FrameNet-CNL, if information extraction parser produces the correct knowledge representation as a result. We describe a state-of-the-art information extraction parser used by a national news agency and speculate that Fram…
Rigotrio At Semeval-2017 Task 9: Combining Machine Learning And Grammar Engineering For Amr Parsing And Generation
2017
By addressing both text-to-AMR parsing and AMR-to-text generation, SemEval-2017 Task 9 established AMR as a powerful semantic interlingua. We strengthen the interlingual aspect of AMR by applying the multilingual Grammatical Framework (GF) for AMR-to-text generation. Our current rule-based GF approach completely covered only 12.3% of the test AMRs, therefore we combined it with state-of-the-art JAMR Generator to see if the combination increases or decreases the overall performance. The combined system achieved the automatic BLEU score of 18.82 and the human Trueskill score of 107.2, to be compared to the plain JAMR Generator results. As for AMR parsing, we added NER extensions to our SemEva…
SAP UI5 lietotņu pirmkoda automatizētas pārstrādes rīka izstrāde
2018
Kvalifikācijas darbā tika izstrādāts rīks SAP UI5 lietotņu pirmkoda automatizētai pārveidei, kas veic SAP UI5 JavaScript koda analīzi un aizvieto novecojušās un mazāk efektīvās sinhronās moduļu ielādes metodes ar aktuālajām un tīmekļa kontekstā efektīvākām asinhronajām moduļu ielādes metodēm, kā arī veic koda izmaiņas, kuru nepieciešamība izriet no iepriekšminētās metožu aizvietošanas. Izstrādātā rīka mērķis ir padarīt automātisku, ātru un pabeigtu, bez cilvēciskām kļūdām, šādas pārstrādes procesu tām lietotnēm un projektiem, kuru izstrāde ir aizsākusies pietiekami sen, lai kodu būtu nepieciešams šādi aktualizēt. Rīks tika izstrādāts izmantojot SAP UI5 ietvaru lietotāja saskarnes pusē, Node…