Catálogo de publicaciones - libros

Compartir en
redes sociales

Document Analysis Systems VII: 7th International Workshop, DAS 2006, Nelson, New Zealand, February 13-15, 2006, Proceedings

Horst Bunke ; A. Lawrence Spitz (eds.)

En conferencia: 7º International Workshop on Document Analysis Systems (DAS) . Nelson, New Zealand . February 13, 2006 - February 15, 2006

Resumen/Descripción – provisto por la editorial

No disponible.

Palabras clave – provistas por la editorial

Database Management; Pattern Recognition; Information Storage and Retrieval; Image Processing and Computer Vision; Simulation and Modeling; Computer Appl. in Administrative Data Processing

Disponibilidad

Institución detectada	Año de publicación	Navegá	Descargá	Solicitá
No detectada	2006	SpringerLink

Información

Tipo de recurso:

libros

ISBN impreso

978-3-540-32140-8

ISBN electrónico

978-3-540-32157-6

Editor responsable

Springer Nature

País de edición

Reino Unido

Fecha de publicación

2006

Información sobre derechos de publicación

Cobertura temática

Ciencias de la computación e información

Tabla de contenidos

Verificá que desde tu institución tengas acceso para descargar o solicitar el libro completo o alguno de sus capítulos.

doi: 10.1007/11669487_51

Performance Evaluation of Text Detection and Tracking in Video

Vasant Manohar; Padmanabhan Soundararajan; Matthew Boonstra; Harish Raju; Dmitry Goldgof; Rangachar Kasturi; John Garofolo

Text detection and tracking is an important step in a video content analysis system as it brings important semantic clues which is a vital supplemental source of index information. While there has been a significant amount of research done on video text detection and tracking, there are very few works on performance evaluation of such systems. Evaluations of this nature have not been attempted because of the extensive effort required to establish a reliable ground truth even for a moderate video dataset. However, such ventures are gaining importance now. In this paper, we propose a generic method for evaluation of object detection and tracking systems in video domains where ground truth objects can be bounded by simple geometric shapes (polygons, ellipses). Two comprehensive measures, one each for detection and tracking, are proposed and substantiated to capture different aspects of the task in a single score. We choose text detection and tracking tasks to show the effectiveness of our evaluation framework. Results are presented from evaluations of existing algorithms using real world data and the metrics are shown to be effective in measuring the total accuracy of these detection and tracking algorithms.

Palabras clave: False Alarm; Ground Truth; Object Detection; Tracking Task; Text Region.

- Posters | Pp. 576-587

doi: 10.1007/11669487_52

Document Analysis System for Automating Workflows

Steven J. Simske; Jordi Arnabat

When a user places a document in a capture device—copier, multi-functional printer [MFP], or scanner—the user expects good output to be produced regardless of the document type. There are a variety of means to achieve improved output, in which the settings on the copying device are tuned to the content characteristics of the document. These settings can be automated across the range of scanned context extremes from photo (blurring, no snapping) to fully-text (sharpening, aggressive snapping) documents. This procedure is “document auto typing”, and relies on a fast and accurate assessment of the content of the captured image. We herein describe the development of seven distinct systems for document analysis, and through the comparison of these systems arrive at an efficient and accurate document analysis system for automating the copying settings. We discuss the applicability of this method to other automated workflows in document capture.

Palabras clave: Optical Character Recognition; Black Pixel; Solid Region; Projection Profile; Document Analysis System.

- Posters | Pp. 588-592

doi: 10.1007/11669487_53

Automatic Assembling of Cadastral Maps Based on Generalized Hough Transformation

Fei Liu; Wataru Ohyama; Tetsushi Wakabayashi; Fumitaka Kimura

There are numerous cadastral maps generated by past land surveying. The raster digitization of these paper maps is in progress. For effective and efficient use of these maps, we have to assemble the set of maps to make them superimposable on other geographic information in a Geographic Information System. The problem can be seen as a complex jigsaw puzzle where the pieces are the cadastre sections extracted from the map. We present an automatic solution to this geographic jigsaw puzzle, based on the generalized Hough transformation that detects the longest common boundary between every piece and its neighbors. The experiments have been conducted using the map of Mie Prefecture, Japan and the French cadastral map. The results of the experiment with the French cadastral maps show that the proposed method, which consists of extracting an external area and extracting and regularizing the north arrow, is suitable for assembling the cadastral map. The final goal of the process is to integrate every piece of the puzzle into a national geographic reference frame and database.

Palabras clave: Common Boundary; Hough Transformation; Jigsaw Puzzle; Internal Area; Automatic Assemble.

- Posters | Pp. 593-603

doi: 10.1007/11669487_54

A Few Steps Towards On-the-Fly Symbol Recognition with Relevance Feedback

Jan Rendek; Bart Lamiroy; Karl Tombre

This paper presents some first steps in building an interactive system which allows a user to efficiently browse a large set of scanned documents, without prior knowledge on the content of these documents, and retrieving symbols of interest to him personally, through a relevance feedback mechanism.

- Posters | Pp. 604-615

doi: 10.1007/11669487_55

The Fuzzy-Spatial Descriptor for the Online Graphic Recognition: Overlapping Matrix Algorithm

Noorazrin Zakaria; Jean-Marc Ogier; Josep Llados

In this paper we present the algorithm of our fuzzy-spatial descriptor for symbol recognition in on-line sketches. Fuzzy sets are used to enhance this approach to cope with the inherent distortion of freehand drawings. It allows handling some primitive irregularities due to different ways of drawing in complex composite forms. The descriptor provides complete local information of the primitives that consist of only straight lines. This descriptor used the opening angle between primitives and its overlapping properties in 1D projection to describe its local spatial properties. Segmentation by curve and pen-speed properties of each drawing has been used in obtaining our basic primitives. The experiment results show that a part from its rotational-invariant, this descriptor is also robust to multi-scale of segmentation of its primitives.

Palabras clave: Membership Function; Linguistic Term; Fuzzy Membership Function; Speed Profile; Spatial Descriptor.

- Posters | Pp. 616-627