6533b7ddfe1ef96bd1274051

RESEARCH PRODUCT

Separating compound figures in journal articles to allow for subfigure classification

Fabrice MeriaudeauAntonio Foncubierta-rodríguezDimitrios MarkonisAjad ChhatkuliHenning Müller

subject

Information retrieval020205 medical informaticsComputer scienceProcess (computing)020207 software engineering02 engineering and technologyFilter (signal processing)Image (mathematics)[INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV][ INFO.INFO-TI ] Computer Science [cs]/Image Processing0202 electrical engineering electronic engineering information engineeringBenchmark (computing)Noise (video)Focus (optics)ComputingMilieux_MISCELLANEOUS

description

Journal images represent an important part of the knowledge stored in the medical literature. Figure classification has received much attention as the information of the image types can be used in a variety of contexts to focus image search and filter out unwanted information or ”noise”, for example non–clinical images. A major problem in figure classification is the fact that many figures in the biomedical literature are compound figures and do often contain more than a single figure type. Some journals do separate compound figures into several parts but many do not, thus requiring currently manual separation. In this work, a technique of compound figure separation is proposed and implemented based on systematic detection and analysis of uniform space gaps. The method discussed in this article is evaluated on a dataset of journal figures of the open access literature that was created for the ImageCLEF 2012 benchmark and contains about 3000 compound figures. Automatic tools can easily reach a relatively high accuracy in separating compound figures. To further increase accuracy efforts are needed to improve the detection process as well as to avoid over–separation with powerful analysis strategies. The tools of this article have also been tested on a database of approximately 150’000 compound figures from the biomedical literature, making these images available as separate figures for further image analysis and allowing to filter important information from them.

https://hal.archives-ouvertes.fr/hal-00831527