Catálogo de publicaciones - libros

Compartir en
redes sociales


Parallel Computing Technologies: 9th International Conference, PaCT 2007, Pereslavl-Zalessky, Russia, September 3-7, 2007. Proceedings

Victor Malyshkin (eds.)

En conferencia: 9º International Conference on Parallel Computing Technologies (PaCT) . Pereslavl-Zalessky, Russia . September 3, 2007 - September 7, 2007

Resumen/Descripción – provisto por la editorial

No disponible.

Palabras clave – provistas por la editorial

Programming Techniques; Computer System Implementation; Software Engineering/Programming and Operating Systems; Computer Systems Organization and Communication Networks; Computation by Abstract Devices; Algorithm Analysis and Problem Complexity

Disponibilidad
Institución detectada Año de publicación Navegá Descargá Solicitá
No detectada 2007 SpringerLink

Información

Tipo de recurso:

libros

ISBN impreso

978-3-540-73939-5

ISBN electrónico

978-3-540-73940-1

Editor responsable

Springer Nature

País de edición

Reino Unido

Fecha de publicación

Información sobre derechos de publicación

© Springer-Verlag Berlin Heidelberg 2007

Tabla de contenidos

Toward a Distributed Implementation of OpenMP Using CAPE

Éric Renault

Traditionally, checkpointing techniques have been used to secure the execution of sequential and parallel programs. This article shows that checkpointing techniques can also be used to automatically generate a parallel program from a sequential program, this program being executed on any kind of distributed parallel system. The article also presents how this new technique have been included inside the usual compilation chain to provide a distributed implementation of OpenMP. Finally, some performance measurements are discussed.

- Techniques for Parallel Programming Supporting | Pp. 303-312

Multicriteria Scheduling Strategies in Scalable Computing Systems

Victor Toporkov

An approach to generation and optimization of scheduling and resource allocation strategies in scalable computing systems is proposed. The approach allows the decomposition of the problem of multicriteria strategy synthesis for the totality of parameterized models of programs with the use of partial and vector quality criteria including, for instance, a cost function and load balancing factors.

- Techniques for Parallel Programming Supporting | Pp. 313-317

Latencies of Conflicting Writes on Contemporary Multicore Architectures

Josef Weidendorfer; Michael Ott; Tobias Klug; Carsten Trinitis

This paper provides a detailed investigation of latency penalties caused by repeated memory writes to nearby memory cells from different threads in parallel programs. When such writes map to the same corresponding cache lines in multiple processors, one can observe the so called effect. This effect can unnecessarily hamper parallel code due to the line granularity based cache hierarchy, which is common on contemporary processor architectures. In this contribution, a benchmark allowing for quantitative estimates about the consequences of the false sharing effect, is presented. Results show that multicore architectures with shared cache can reduce unwanted effects of false sharing.

- Techniques for Parallel Programming Supporting | Pp. 318-327

A Novel Self-Similar () Traffic Filter to Enhance E-Business Success by Improving Internet Communication Channel Fault Tolerance

Allan K. Y. Wong; Wilfred W. K. Lin; Tharam S. Dillon; Jackei H. K. Wong

Internet traffic patterns can cause serious buffer overflow in electronic business (e-business) systems. This leads to widespread retransmission that prolongs the service roundtrip time (RTT). As a result customers are unhappy and avoid returning to do more business. The previous Real-Time Traffic Pattern Detector (RTPD) was proposed to improve Internet channel fault tolerance. With RTPD support time-critical applications can identify the traffic patterns and invoke the corresponding measures to neutralize their ill effects in a dynamic manner. The extant RTPD, however, cannot detect self-similar traffic. This inspired the development of the novel self-similarity () filter proposed in this paper, which makes the RTPD capability more complete. The “RTPD + ” package is the RTPD or ERTPD package. The test results indicate that the addition of the mechanism can indeed contribute to improved e-business communication channels over the Internet.

- Techniques for Parallel Programming Supporting | Pp. 328-339

Accelerating the Singular Value Decomposition of Rectangular Matrices with the CSX600 and the Integrable SVD

Yusaku Yamamoto; Takeshi Fukaya; Takashi Uneyama; Masami Takata; Kinji Kimura; Masashi Iwasaki; Yoshimasa Nakamura

We propose an approach to speed up the singular value decomposition (SVD) of very large rectangular matrices using the CSX600 floating point coprocessor. The CSX600-based acceleration board we use offers 50GFLOPS of sustained performance, which is many times greater than that provided by standard microprocessors. However, this performance can be achieved only when a vendor-supplied matrix-matrix multiplication routine is used and the matrix size is sufficiently large. In this paper, we optimize two of the major components of rectangular SVD, namely, QR decomposition of the input matrix and back-transformation of the left singular vectors by matrix , so that large-size matrix multiplications can be used efficiently. In addition, we use the Integrable SVD algorithm to compute the SVD of an intermediate bidiagonal matrix. This helps to further speed up the computation and reduce the memory requirements. As a result, we achieved up to 3.5 times speedup over the Intel Math Kernel Library running on an 3.2GHz Xeon processor when computing the SVD of a 100,000 × 4000 matrix.

- Techniques for Parallel Programming Supporting | Pp. 340-345

Parallel Dynamic SPT Update Algorithm in OSPF

Yuanbo Zhu; Mingwei Xu; Qian Wu

Shortest-Path-Tree (SPT) computation, as the main load in OSPF protocol, contributes to the slow convergence time in intra-domain routing. With the increasing interest for upcoming routers of multi-core based processing board, efficient parallel routing algorithms are required to take this advantage to speedup SPT computation in order to meet the needs for fast failure recovery applications such as VoIP. However, currently available parallel SPT algorithms are all based on static method, which re-computes the entire tree for each link change. In this paper, we explore parallel algorithms for dynamic SPT update, a more efficient method, which only updates the affected nodes by making use of the previous SPT We first analyze characters of dynamic method to show how they affect parallel design; then we give our parallel dynamic SPT algorithm framework, which uses: (1) parallel distance-updating mode, to get a near liner speedup (assuming perfect load balance) and (2) group-removal schema, to reduce communication cost. Further, to provide load balance, we give a task distribution algorithm called RR_DFS, which makes use of the topology information of the previous SPT. Complexity analysis and simulation result are also presented

- Techniques for Parallel Programming Supporting | Pp. 346-359

Pedestrian and Crowd Dynamics Simulation: Testing SCA on Paradigmatic Cases of Emerging Coordination in Negative Interaction Conditions

Stefania Bandini; Mizar Luca Federici; Sara Manzoni; Giuseppe Vizzari

The paper presents a set of theoretical experiments performed to evaluate Situated Cellular Agent (SCA) approach within pedestrian dynamics research context. SCA is a modeling and simulation approach based on Multi Agent Systems principles that derives from Cellular Automata. In particular, we focus on two emerging phenomena (freezing by heating and lane formation phenomena) that have been empirically observed and already modeled by analytical particle–based models and Cellular Automata–based models.

- Cellular Automata | Pp. 360-369

Coarse-Grained Parallelization of Cellular-Automata Simulation Algorithms

Olga Bandman

Simulating spatial dynamics in physics by Cellular Automata (CA) requires very large computation power, and, hence, CA simulation algorithms are to be implemented on multiprocessors. The preconceived opinion, that no much effort is required to obtain highly efficient coarse grained parallel CA algorithm, is not always true. In fact, a great variety of CA modifications coming into practical use need appropriate, sometimes sophisticated, methods of CA algorithms parallel implementation. Proceeding from the above a general approach to CA parallelization, based on domain decomposition correctness conditions, is formulated. Starting from the correctness conditions particular parallelization methods are developed for the main classes of CA simulation models: synchronous CA with multi-cell updating rules, asynchronous probabilistic CA, and CA compositions. Examples and experimental results are given for each case.

- Cellular Automata | Pp. 370-384

Cellular Automata Models for Complex Matter

Dominique Désérable; Pascal Dupont; Mustapha Hellou; Siham Kamali-Bernard

Complex matter may lie in various forms from granular matter, soft matter, fluid-fluid or solid-fluid mixtures to compact heterogeneous material. Cellular automata models make a suitable and powerful tool to catch the influence of the microscopic scale onto the macroscopic behaviour of these complex systems. Rather than a survey, this paper will attempt to bring out the main concepts underlying these models and to give an insight for future work.

- Cellular Automata | Pp. 385-400

Hysteresis in Oscillatory Behaviour in CO Oxidation Reaction over Pd(110) Revealed by Asynchronous Cellular Automata Simulation

Vladimir Elokhin; Andrey Matveev; Vladimir Gorodetskii; Evgenii Latkin

The dynamic behaviour of the CO oxidation reaction over Pd(110) has been studied by means of probabilistic asynchronous cellular automata (Dynamic Monte-Carlo). The influence of the internal parameters on the shapes of surface concentration waves obtained in simulations under the limited surface diffusion intensity conditions has been studied. The hysteresis in oscillatory behaviour has been found under step-by-step variation of oxygen partial pressure. Two different oscillatory regimes could exist at one and the same parameters of the reaction. The parameters of oscillations (amplitude, period and the shape of spatio-temporal patterns on the surface) depend on the kinetic prehistory of the system. The possibility for the appearance of the cellular and turbulent patterns, spiral, target and stripe oxygen waves on the surface in the cases under study has been shown.

- Cellular Automata | Pp. 401-409