Catálogo de publicaciones - libros
Parallel Computing Technologies: 9th International Conference, PaCT 2007, Pereslavl-Zalessky, Russia, September 3-7, 2007. Proceedings
Victor Malyshkin (eds.)
En conferencia: 9º International Conference on Parallel Computing Technologies (PaCT) . Pereslavl-Zalessky, Russia . September 3, 2007 - September 7, 2007
Resumen/Descripción – provisto por la editorial
No disponible.
Palabras clave – provistas por la editorial
Programming Techniques; Computer System Implementation; Software Engineering/Programming and Operating Systems; Computer Systems Organization and Communication Networks; Computation by Abstract Devices; Algorithm Analysis and Problem Complexity
Disponibilidad
Institución detectada | Año de publicación | Navegá | Descargá | Solicitá |
---|---|---|---|---|
No detectada | 2007 | SpringerLink |
Información
Tipo de recurso:
libros
ISBN impreso
978-3-540-73939-5
ISBN electrónico
978-3-540-73940-1
Editor responsable
Springer Nature
País de edición
Reino Unido
Fecha de publicación
2007
Información sobre derechos de publicación
© Springer-Verlag Berlin Heidelberg 2007
Cobertura temática
Tabla de contenidos
Address-Free All-to-All Routing in Sparse Torus
Risto Honkanen; Ville Leppänen; Martti Penttonen
In this work we present a simple network design for all-to-all routing and study deflection routing on it. We present a time-scheduled routing algorithm where packets are routed address-free. We show that a total exchange relation, where every processor has a packet to route to every other processor, can be routed with routing cost of 1/2 + (1) time units per packet.
The network consists of an -sided -dimensional torus, where the processor (or input/output) nodes are sparsely but regularly situated among − deflection routing nodes, having input and output links. The finite-state routing nodes change their states by a fixed, preprogrammed pattern.
- Techniques for Parallel Programming Supporting | Pp. 200-205
On the Parallel Technologies of Conjugate and Semi-conjugate Gradient Methods for Solving Very Large Sparse SLAEs
Valery P. Ilin; Dasha V. Knysh
The parallel technologies of iterative solving the symmetric and nonsymmetric systems of linear algebraic equations (SLAEs) with very large sparse matrices by means of conjugate and semi-conjugate gradient iterative methods are described. The performance computing for various matrix formats (diagonal, compressed sparse row/column), at the different degrees of freedom of SLAEs, are analysed. The results of experimental measurements under OPENMP, MPI and hybrid systems are presented and discussed.
- Techniques for Parallel Programming Supporting | Pp. 206-214
TRES-CORE: Content-Based Retrieval Based on the Balanced Tree in Peer to Peer Systems
Hai Jin; Jie Xu
Most existing Peer to Peer (P2P) systems support name-based retrieval and have provided very limited support for the full-text search of document contents. In this paper, we present a scheme (TRES-CORE) to support content-based retrieval. First, we propose a tree structure to organize data objects in vector-format in the P2P system, which is height-balanced so that the time complexity of search can be decreased. Second, we give a simple strategy for the placement of tree’s nodes, which can guarantee both load balancing and fault tolerance. Then an efficient policy for the query is given. Besides theoretical analysis that can prove the correctness of our scheme, a simulation-based study is carried out to evaluate its performance under various scenarios finally. In this study, it shows that using this content-based retrieval scheme (TRES-CORE) is more accurate and more efficient than some other schemes in the P2P system.
- Techniques for Parallel Programming Supporting | Pp. 215-229
Efficient Race Verification for Debugging Programs with OpenMP Directives
Young-Joo Kim; Mun-Hye Kang; Ok-Kyoon Ha; Yong-Kee Jun
Races must be detected for debugging parallel programs with OpenMP directives because they may cause unintended nondeterministic results of programs. The previous tool that detects races does not verify the existence of races in programs with no internal nondeterminism because the tool regards nested sibling threads as ordered threads and has the possibility of ignoring accesses involved in races in program models with synchronization such as critical section. This paper suggests an efficient tool that verifies the existence of races with optimal performance by applying race detection engines for labeling and detection protocol. The labeling scheme generates a unique identifier for each parallel thread created during a program execution, and the protocol scheme detects at least one race if any. This tool verifies the existence of races over 250 times faster in average than the previous tool even in the case that the maximum parallelism increases with the fixed number of total accesses using a set of synthetic programs without synchronization such as critical section.
- Techniques for Parallel Programming Supporting | Pp. 230-239
Adaptive Scheduling and Resource Assessment in GRID
Veniamin Krasnotcshekov; Alexander Vakhitov
The problems of scheduling computations in GRID and optimal usage of GRID resources from client side are considered. The general cost functional for GRID scheduling is defined. The cost function is then used to define some scheduling policy based on Simutaneous Perturbation Stochastic Optimization Algorithm, which is used because of it’s fast convergence in multidimensional noisy systems. The technique proposed is being implemented for brokering in GPE4GTK environment to compare it with other techniques.
- Techniques for Parallel Programming Supporting | Pp. 240-244
Dynamic Load Balancing of Applications with a Resource Selection Mechanism on Heterogeneous Resources of the Grid
Valeria V. Krzhizhanovskaya; Vladimir V. Korkhov
In this paper we address the critical issues of efficient resource management and high-performance parallel distributed computing on the Grid by introducing a new hierarchical approach that combines a user-level job scheduling with a dynamic load balancing technique that automatically adapts a distributed or parallel application to the heterogeneous resources. The algorithm developed dynamically selects the resources best suited for a particular task or parallel process of the executed application, and optimizes the load balance based on the dynamically measured resource parameters and estimated requirements of the application. We describe the proposed algorithm for automated load balancing, paying attention to the influence of resource heterogeneity metrics, demonstrate the speedup achieved with this technique for different types of applications and resources, and propose a way to extend the approach to a wider class of applications.
- Techniques for Parallel Programming Supporting | Pp. 245-260
A Novel Algorithm of Optimal Matrix Partitioning for Parallel Dense Factorization on Heterogeneous Processors
Alexey Lastovetsky; Ravi Reddy
. In this paper, we present a novel algorithm of optimal matrix partitioning for parallel dense matrix factorization on heterogeneous processors based on their constant performance model. We prove the correctness of the algorithm and estimate its complexity. We demonstrate that this algorithm better suits extensions to more complicated, non-constant, performance models of heterogeneous processors than traditional algorithms.
- Techniques for Parallel Programming Supporting | Pp. 261-275
Parallel Pseudorandom Number Generator for Large-Scale Monte Carlo Simulations
Mikhail Marchenko
A parallel random number generator is given to perform large-scale distributed Monte Carlo simulations. The generator’s quality was verified using statistically rigorous tests. Also special problems with known solutions were used for the testing. The description of program system MONC for large-scale distributed Monte Carlo simulations is also given.
- Techniques for Parallel Programming Supporting | Pp. 276-282
Dynamic Job Scheduling on the Grid Environment Using the Great Deluge Algorithm
Paul McMullan; Barry McCollum
The utilization of the computational Grid processor network has become a common method for researchers and scientists without access to local processor clusters to avail of the benefits of parallel processing for compute-intensive applications. As a result, this demand requires effective and efficient dynamic allocation of available resources. Although static scheduling and allocation techniques have proved effective, the dynamic nature of the Grid requires innovative techniques for reacting to change and maintaining stability for users. The dynamic scheduling process requires quite powerful optimization techniques, which can themselves lack the performance required in reaction time for achieving an effective schedule solution. Often there is a trade-off between solution quality and speed in achieving a solution. This paper presents an extension of a technique used in optimization and scheduling which can provide the means of achieving this balance and improves on similar approaches currently published.
- Techniques for Parallel Programming Supporting | Pp. 283-292
Parallelism Granules Aggregation with the T-System
Alexander Moskovsky; Vladimir Roganov; Sergei Abramov
T-system is a tool for parallel computing developed at the PSI RAS. The most recent implementation is available on both Linux and Windows platforms. The paper is dedicated to one of important T-system aspects — ability to change parallelism granule size at runtime. The technique is available, primarily, for recursive programs, but it’s possible to extent it to non-recursive ones as well. In the latter case, we employ C++ template“traits”for program transformation. The technique is shown to reduce overhead incurred by runtime support library dramatically.
- Techniques for Parallel Programming Supporting | Pp. 293-302