Search results for "Crawling"
showing 5 items of 5 documents
Beyond the “ivory tower”. Comparing academic and non-academic knowledge on social entrepreneurship
2021
The increasing relevance of societal challenges has recently brought social entrepreneurship to the fore due to its capacity to leverage entrepreneurial processes to achieve social value while ensuring profits. In this study, we apply an experimental research method to analyse the concept of social entrepreneurship comprehensively. More specifically, we develop bibliometric analysis and web crawling techniques to gather information related to social entrepreneurship from Scopus and Wikipedia. We conduct a comparative network analysis of social entrepreneurship’s conceptual structure at academic and non-academic levels. This analysis has been performed considering scientific articles’ keywor…
On Utilizing Stochastic Non-linear Fractional Bin Packing to Resolve Distributed Web Crawling
2014
This paper deals with the extremely pertinent problem of web crawling, which is far from trivial considering the magnitude and all-pervasive nature of the World-Wide Web. While numerous AI tools can be used to deal with this task, in this paper we map the problem onto the combinatorially-hard stochastic non-linear fractional knapsack problem, which, in turn, is then solved using Learning Automata (LA). Such LA-based solutions have been recently shown to outperform previous state-of-the-art approaches to resource allocation in Web monitoring. However, the ever growing deployment of distributed systems raises the need for solutions that cope with a distributed setting. In this paper, we prese…
Towards automatic assessment of government web sites
2013
This paper presents an approach for automatic assessment of web sites in large scale e-Government surveys. The approach aims at supplementing and to some extent replacing human evaluation which is typically the core part of these surveys.The heart of the solution is a colony inspired algorithm, called the lost sheep, which automatically locates targeted governmental material online. The algorithm centers around classifying link texts to determine if a web page should be downloaded for further analysis.The proposed algorithm is designed to work with minimum human interaction and utilize the available resources as best possible. Using the lost sheep, the people carrying out a survey will only…
Web crawling dla celów lingwistycznych. Wybrane aspekty gromadzenia i analizy danych tekstowych na przykładzie rosyjskojęzycznych newsów internetowych
2021
Autor niniejszego artykułu zgromadził ok. 2,7 mln rosyjskojęzycznych newsów internetowych. Zasadnicze cele tego tekstu stanowią: omówienie pojęcia web crawlingu w odniesieniu do pozyskiwania internetowych danych tekstowych, omówienie kwestii strukturyzacji takich danych w nieanotowanych korpusach tekstowych, a także przedstawienie wybranych aspektów analizy danych strukturyzowanych w ten sposób. Autor rozpatruje newsy internetowe jako połączenie tekstu zasadniczego oraz identyfikujących i charakteryzujących go metadanych (wyróżnionych podczas automatycznej ich ekscerpcji ze stron internetowych). Rozdział newsów na tekst zasadniczy i metadane stwarza możliwość przeprowadzenia ich analizy z d…
Newborn crawling and rooting in response to maternal breast odor
2021
International audience; A growing literature shows that perception and action are already tightly coupled in the newborn. The current study aimed to examine the nature of the coupling between olfactory stimuli from the mother and the newborn's crawling and rooting (exploratory movements of the head). To examine the coupling, the crawling and rooting behavior of 28 2-day-old newborns were studied while they were supported prone on a mobility device shaped like a mini skateboard, the Crawliskate®, their head positioned directly on top of a pad infused with either their mother's breast odor (Maternal) or the odor of water (Control). Video and 3D kinematic analyses of the number and types of li…