Search results for "corpus linguistic"

showing 6 items of 86 documents

Statystyka, programowanie i co jeszcze? : o wyzwaniach współczesnego językoznawstwa

2020

This chapter offers a refl ection on the state-of-the-art and the future of linguistics. In the light of rapid development and growing popularity of quantitative, statistical research methods, the author refl ects upon their impact on contemporary linguistics (notably corpus linguistics), notably as regards methodology and linguistic data analysis. An attempt is also made in this paper to ruminate over the skills required from linguists regarding current publication standards in top-notch journals and research quality requirements. The discussion focuses on statistics and programming skills as more and more studies make ample use of various statistical metrics and computational tools. The a…

statisticscorpus linguisticslinguisticsresearch methodologyquantitative methodsprogramming
researchProduct

Inter-annotator agreement in spoken language annotation: Applying uα-family coefficients to discourse segmentation

2021

As databases make Corpus Linguistics a common tool for most linguists, corpus annotation becomes an increasingly important process. Corpus users do not need only raw data, but also annotated data, submitted to tagging or parsing processes through annotation protocols. One problem with corpus annotation lies in its reliability, that is, in the probability that its results can be replicable by independent researchers. Inter-annotation agreement (IAA) is the process which evaluates the probability that, applying the same protocol, different annotators reach similar results. To measure agreement, different statistical metrics are used. This study applies IAA for the first time to the Valencia E…

subactsLinguistics and LanguageComputer scienceProcess (engineering)media_common.quotation_subjectP1-1091computer.software_genreLanguage and LinguisticsSet (abstract data type)Annotationkripendorff’s α-coefficientsCorpus linguisticsConversationSegmentationval.es.co. modelPhilology. Linguisticsmedia_commoninter-annotator agreementParsingbusiness.industrycorpus annotationdiscourse segmentationArtificial intelligencebusinesscomputerNatural language processingSpoken languageRussian Journal of Linguistics
researchProduct

Using Corpus Linguistics as a research and training tool for Public Service Interpreting (PSI) in the legal sector

2017

Public Service Translation has for long been the ‘forgotten voice’ in PSI studies but it is arguably a valuable linguistic support for legal institutions and for training interpreters in the legal sector. Given that interpreters in the legal system in Italy often tend to ‘double-up’ as legal translators (to make a living) the line between the two is often hazy. Hybrid modalities like sight translation of legal and administrative documents is also a ‘borderline’ feature of these intertwined professions. The main aim of this paper is to describe how parallel and monolingual corpora can be used to train public service interpreters in double roles (translators, interpreters), namely by using co…

trainingCorpus data; Public service; intepreters; legal;intepreterscorpus linguisticsCorpus dataPublic serviceSettore L-LIN/12 - Lingua e Traduzione - Lingua Inglesediscourse markerslegalPublic Service Interpreting
researchProduct

Digital Community Storytelling as a Sociopolitical Critical Device

2016

Este artículo analiza 40 historias digitales comunitarias basadas en los principios de estudios de discurso crítico basados ​​en corpus (Baker et al., 2008). Las historias se analizan con la intención de probar si pueden clasificarse como ejemplos de historias sociopolíticas digitales. Las historias digitales socio-políticas se describen aquí como una herramienta que los individuos pueden usar para hacer llegar al público opiniones sobre temas que puedan afectar a la democracia (Couldry, 2008), el bienestar social y la estabilidad; y puede servir para facilitar la comunicación, interacción e intercambio de información sobre prácticas sociales conflictivas. Las historias analizadas pueden co…

ucrel lexical choicesconcordancesdigital storytelling; corpus linguistics; discourse analysis; keywords; ucrel lexical choices;storytellingcorpus linguisticsconcordances; storytelling;Anàlisi del discursdiscourse analysiskeywordsdigital storytelling
researchProduct

Pastāvīgo vārdu biežums laikrakstu tiešsaistes publikācijas un to diskusiju ziņās

2016

Šis darbs iepazīstina ar laikraksta The Guardian tiešsiastes publikāciju analīzi. Šī darba mērķis ir izanalizēt pastāvīgo vārdu (lietvārdu, darbības vārdu un īpašības vārdu) biežumu laikrakstu publikācijas un to diskusiju ziņās. Galvenā pētījuma metode bija korpusa-lingvistiskā metode, kas balstīta uz Makenerija (2006) teoriju. Šī darba autors izveidoja mini-korpusu, kas sastāvēja no divām teksta grupām: laikrakstu publikācijām un diskusiju ziņām. Vārdšķiru biežuma sadalījums korpusā, norāda uz formalitātes līmeņu atšķirību starp laikrakstu publikācijām un to diskusiju ziņām. Iegūtie rezultāti norāda uz reģistra variāciju izanalizētājos korpusa žanra tekstos. Lai identificētu līdzības un at…

word frequencydiscussion messagesValodniecībacorpus linguisticsnewspaper articlesparts of speech
researchProduct

Formulaicity in constrained communication : an intermodal approach

2021

DeutschIn dieser an Korpuslinguistik, Formelsprache und Studien uber eingeschrankte Kommunikation grenzende Forschungsstudie, die sich hier auf Ubersetzung, Dolmetschen und L2 konzentriert, wollen wir uberprufen, ob die in der polnisch-englischen Komponente eines intermodalen EPTIC-Korpus gefundenen eingeschrankten Texte sich von den einheimischen Texten unterscheiden in Bezug auf die Verwendung benachbarter Wortkombinationen, oft Bigrams genannt, und ob ahnliche Muster in gesprochenen und geschriebenen Registern gefunden werden konnen. Dazu erarbeiteten wir das Poisson-Regressionsmodell mit festen und zufalligen Effekten. Die Ergebnisse zeigen, dass die ubersetzte Sprache zur hoheren Anzah…

übersetzungTranslationConstrained languageCorpus linguisticsBigramTraducción e InterpretaciónInterpretingFormulaic languageeingeschränkte sprachedolmetschenformelspracheCorpus linguisticsPsychologyHumanitieskorpuslinguistikMonografias de Traduccion e Interpretacion
researchProduct