Invited Talks.- Orchestrating Computations on the World-Wide Web.- Realistic Rendering in Real-Time.- Non-massive, Non-high Performance, Distributed Computing: Selected Issues.- The Forgotten Factor: Facts on Performance Evaluation and Its Dependence on Workloads.- Sensor Networks — Promise and Challenges.- Concepts and Technologies for a Worldwide Grid Infrastructure.- Topic 1.- Support Tools and Environments.- SCALEA: A Performance Analysis Tool for Distributed and Parallel Programs.- Deep Start: A Hybrid Strategy for Automated Performance Problem Searches.- On the Scalability of Tracing Mechanisms.- Component Based Problem Solving Environment.- Integrating Temporal Assertions into a Parallel Debugger.- Low-Cost Hybrid Internal Clock Synchronization Mechanism for COTS PC Cluster.- .NET as a Platform for Implementing Concurrent Objects.- Topic 2.- Performance Evaluation, Analysis and Optimization.- Performance of MP3D on the SB-PRAM Prototype.- Multi-periodic Process Networks: Prototyping and Verifying Stream-Processing Systems.- Symbolic Cost Estimation of Parallel Applications.- Performance Modeling and Interpretive Simulation of PIM Architectures and Applications.- Extended Overhead Analysis for OpenMP.- CATCH — A Call-Graph Based Automatic Tool for Capture of Hardware Performance Metrics for MPI and OpenMP Applications.- SIP: Performance Tuning through Source Code Interdependence.- Topic 3.- Scheduling and Load Balancing.- On Scheduling Task-Graphs to LogP-Machines with Disturbances.- Optimal Scheduling Algorithms for Communication Constrained Parallel Processing.- Job Scheduling for the BlueGene/L System.- An Automatic Scheduler for Parallel Machines.- Non-approximability Results for the Hierarchical Communication Problem with a Bounded Number of Clusters.- Non-approximability of the Bulk Synchronous Task Scheduling Problem.- Adjusting Time Slices to Apply Coscheduling Techniques in a Non-dedicated NOW.- A Semi-dynamic Multiprocessor Scheduling Algorithm with an Asymptotically Optimal Competitive Ratio.- AMEEDA: A General-Purpose Mapping Tool for Parallel Applications on Dedicated Clusters.- Topic 4.- Compilers for High Performance (Compilation and Parallelization Techniques).- Tiling and Memory Reuse for Sequences of Nested Loops.- Reuse Distance-Based Cache Hint Selection.- Improving Locality in the Parallelization of Doacross Loops.- Is Morton Layout Competitive for Large Two-Dimensional Arrays?.- Towards Detection of Coarse-Grain Loop-Level Parallelism in Irregular Computations.- On the Optimality of Feautrier’s Scheduling Algorithm.- On the Equivalence of Two Systems of Affine Recurrence Equations.- Towards High-Level Specification, Synthesis, and Virtualization of Programmable Logic Designs.- Topic 5.- Parallel and Distributed Databases, Data Mining and Knowledge Discovery.- Dynamic Query Scheduling in Parallel Data Warehouses.- Speeding Up Navigational Requests in a Parallel Object Database System.- Retrieval of Multispectral Satellite Imagery on Cluster Architectures.- Shared Memory Parallelization of Decision Tree Construction Using a General Data Mining Middleware.- Characterizing the Scalability of Decision- Support Workloads on Clusters and SMP Systems.- Parallel Fuzzy c- Means Clustering for Large Data Sets.- Scheduling High Performance Data Mining Tasks on a Data Grid Environment.- A Delayed-Initiation Risk-Free Multiversion Temporally Correct Algorithm.- Topic 6.- Complexity Theory and Algorithms.- Parallel Convex Hull Computation by Generalised Regular Sampling.- Parallel Algorithms for Fast Fourier Transformation Using PowerList, ParList and PList Theories.- A Branch and Bound Algorithm for Capacitated Minimum Spanning Tree Problem.- Topic 7.- Applications on High Performance Computers.- Perfect Load Balancing for Demand- Driven Parallel Ray Tracing.- Parallel Controlled Conspiracy Number Search.- A Parallel Solution in Texture Analysis Employing a Massively Parallel Processor.- Stochastic Simulation of a Marine Host-Parasite System Using a Hybrid MPI/OpenMP Programming.- Optimization of Fire Propagation Model Inputs: A Grand Challenge Application on Metacomputers.- Parallel Numerical Solution of the Boltzmann Equation for Atomic Layer Deposition.- Topic 8.- Parallel Computer Architecture and Instruction-Level Parallelism.- Independent Hashing as Confidence Mechanism for Value Predictors in Microprocessors.- Exploiting the Prefetching Effect Provided by Executing Mispredicted Load Instructions.- Increasing Instruction-Level Parallelism with Instruction Precomputation.- Runtime Association of Software Prefetch Control to Memory Access Instructions.- Realizing High IPC Using Time-Tagged Resource-Flow Computing.- A Register File Architecture and Compilation Scheme for Clustered ILP Processors.- A Comparative Study of Redundancy in Trace Caches.- Speeding Up Target Address Generation Using a Self-indexed FTB.- Real PRAM Programming.- In-memory Parallelism for Database Workloads.- Enforcing Cache Coherence at Data Sharing Boundaries without Global Control: A Hardware-Software Approach.- CODACS Project: A Demand-Data Driven Reconfigurable Architecture.- Topic 9.- Distributed Systems and Algorithms.- A Self-stabilizing Token-Based k-out-of-l Exclusion Algorithm.- An Algorithm for Ensuring Fairness and Liveness in Non-deterministic Systems Based on Multiparty Interactions.- On Obtaining Global Information in a Peer-to-Peer Fully Distributed Environment.- A Fault-Tolerant Sequencer for Timed Asynchronous Systems.- Dynamic Resource Management in a Cluster for High-Availability.- Progressive Introduction of Security in Remote-Write Communications with no Performance Sacrifice.- Parasite: Distributing Processing Using Java Applets.- Topic 10.- Parallel Programming: Models, Methods and Programming Languages.- Improving Reactivity to I/O Events in Multithreaded Environments Using a Uniform, Scheduler- Centric API.- An Overview of Systematic Development of Parallel Systems for Reconfigurable Hardware.- A Skeleton Library.- Optimising Shared Reduction Variables in MPI Programs.- Double-Scan: Introducing and Implementing a New Data-Parallel Skeleton.- Scheduling vs Communication in PELCR.- Exception Handling during Asynchronous Method Invocation.- Designing Scalable Object Oriented Parallel Applications.- Delayed Evaluation, Self-optimising Software Components as a Programming Model.- Topic 11.- Numerical Algorithms.- New Parallel (Rank-Revealing) QR Factorization Algorithms.- Solving Large Sparse Lyapunov Equations on Parallel Computers.- A Blocking Algorithm for Parallel 1-D FFT on Clusters of PCs.- Sources of Parallel Inefficiency for Incompressible CFD Simulations.- Parallel Iterative Methods for Navier-Stokes Equations and Application to Stability Assessment.- A Modular Design for a Parallel Multifrontal Mesh Generator.- Pipelining for Locality Improvement in RK Methods.- Topic 12.- Routing and Communication in Interconnection Networks.- On Multicasting with Minimum Costs for the Internet Topology.- Stepwise Optimizations of UDP/IP on a Gigabit Network.- Stabilizing Inter-domain Routing in the Internet.- Performance Analysis of Code Coupling on Long Distance High Bandwidth Network.- Adaptive Path- Based Multicast on Wormhole-Routed Hypercubes.- A Mixed Deflection and Convergence Routing Algorithm: Design and Performance.- Evaluation of Routing Algorithms for InfiniBand Networks.- Congestion Control Based on Transmission Times.- A Dual- LAN Topology with the Dual- Path Ethernet Module.- A Fast Barrier Synchronization Protocol for Broadcast Networks Based on a Dynamic Access Control.- The Hierarchical Factor Algorithm for All-to-All Communication.- Topic 13.- Architectures and Algorithms for Multimedia Applications.- Deterministic Scheduling of CBR and VBR Media Flows on Parallel Media Servers.- Double P-Tree: A Distributed Architecture for Large-Scale Video-on-Demand.- Message Passing in XML-Based Language for Creating Multimedia Presenta