77327

DATAFLOW BASED DISTRIBUTED COMPUTING METHODS. SYSTEM PROTOTYPE

Научная статья

Информатика, кибернетика и программирование

Different methods re pplied to simplify the progrmming nd execution of prllel progrms. On the one hnd universl tools for utomtic progrm prlleliztion both for execution on shred memory nd for multicomputer systems re being developed. The gol of tht design is to simplify prllel progrm development but without significnt loss in the effectiveness of the progrm codes execution. Term tsk nmes the progrm which reds during its execution the dt items with specific nmes from storge nd s the result...

Английский

2015-02-02

20.5 KB

0 чел.

DATAFLOW BASED DISTRIBUTED COMPUTING METHODS. SYSTEM PROTOTYPE.

M.O. Bakhterev, P.A. Vasev

IMM UrB RAS, Yekaterinburg

The usage of parallel and distributed computing systems is accompanied with big expenditures, connected with programming for such a systems. The problem is that modern popular parallel programming tools - MPI and OpenMP - are quite complex to apply. Programmer needs to care about distribution of computational tasks, synchronization, data exchange and so on.

Different methods are applied to simplify the programming and execution of parallel programs. On the one hand, universal tools for automatic program parallelization (both for execution on shared memory and for multicomputer systems) are being developed. On the other hand, the environments for solving the particular task classes (in general, it concerns applications with high level of data parallelism) are being created. Also, the universal instruments, trying to simplify the technical side of the process of parallel and distributed systems programming, are being developed.

Sometimes, creating such a tools, developers are trying to make use of nonstandard calculus paradigms. One of them is Dataflow [1]. Some variants of Dataflow are used when developing processor architectures, supercomputers as whole, software organization of computational threads in bounds of single process, and interprocess communications in distributed environment.

At present work the authors, relying on the analysis of different, including their own, Dataflow models, are developing methods and tools for programming in parallel and distributed environments. The goal of that design is to simplify parallel program development, but without significant loss in the effectiveness of the program codes execution.

Suggested methods of computation arose as a consequence of long theoretical research of the operating system for distributed computations [2]. Methods are based on the storage, task and rule conceptions. Storage keeps named data, to which three operations could be applied - write (create), read and delete. At that the stored data are self-sufficient - they are not queues, but certain items of information with unique names. Term task names the program, which reads during its execution the data items with specific names from storage, and as the result of it's own execution form new data, which are written to storage. Term rule designates the construction, which defines the conditions and the parameters of the program launch. The rule contains:

1. The list of names, which are needed to execute task.

2. The list of correspondence between global data (been in storage) names and local names (which are used inside program).

3. The list of tasks (programs), which should be launched.

4. Actions, performed in the case of successful execution of the tasks (3).

The rule is considered ready for execution, when all data with names from the list (1) are present in storage. After successful execution the rule is deleted from the list of the rules being executed.

The process of programming and carrying out the calculations is unfolding in the following manner. The programmer prepares programs of tasks; at that, theoretically in the frame of one calculation experiment any combination of programming languages and target hardware could be used. For instance, some tasks could be executed on graphics processors.

Also, the programmer forms initialization file, in which prime rules of the system are described. That rule set could be increased later on - during task execution or performing the closing actions in rules. Besides rules there are initial data specified in initialization file, they will be placed in storage.

Further, the programmer gives run command. During the runtime the computing environment finds the ready to execute rules and launches tasks specified in them on suitable available resources. As the result of that, some of rules complete with forming new data and releasing resources for other rules execution. The environment continues the search and execution of rules till the rule set exhausting, the job suspension from the outside or the exposure of error.

Described methods possess a number of valuable capabilities, such as ability to carry out the computational experiment on hybrid architectures, the ability to alter the computing nodes amount during runtime, the ability to support applications in the  globally-distributed environments, the ability to automatically create checkpoints, the ability to suspend and resume the computation in the transparent to programmer manner, the ability to use distributed data storages, and so on.

The authors are developing the prototype of the system based on the suggested methods: the RIDE project. The early samples of programs show the realizability of the system and the elegance of the code for rules description. The authors believe that evolution of the running research will result in the achievement of the main goal - to make the process of distributed parallel programs development simpler and more effective.

Reference

1. Dennis J., Data Flow Supercomputers // Computer, Vol.13, No.11, pp.48-56, 1980

2. Bakhterev M.O. The description of parallel computations in the terms of closures // 10Th International Workshop "Supercomputing and Mathematical Simulations", RFNC-VNIIEF, Sarov, p. 31-32, 2008.


 

А также другие работы, которые могут Вас заинтересовать

4068. Характеристика сетей и технологий ISDN 187.5 KB
  Введение Integrated Services Digital Network (ISDN) (Цифровая Сеть с Интегрированными Услугами) - это всеми доступная интерактивная телефонная сеть, использующая новейшую цифровую технологию передачи сигнала, а так же включающая в себя обширный набо...
4069. Оценка стоимости машин, оборудования и транспортных средств методом чистых активов 242 KB
  Введение Стабилизация и дальнейшее развитие российской экономики непосредственно зависит от развития производственного аппарата промышленности, формируемого в первую очередь отраслями машиностроения. Машины и оборудование, транспортные средства сост...
4070. Психолого-акмеологическое обеспечение эффективности организационного лидерства 238.5 KB
  В рамках курса слушатели академии пополняют знания относительно важного в государственном управлении социально-психологического явления - лидерства, овладевают современными акмеологическими и психолого-педагогическими технологиями, ориентир...
4071. Предпринимательство: сущность и роль в экономическом развитии. Формы и сферы 97.5 KB
  Введение У истоков теории предпринимательства стоял шотландский экономист французского происхождения Р. Кантильен, который и ввел понятие «предприниматель» в экономическую теорию. По Кантильену, предприниматель- это человек с неопределен...
4072. Проблемы и перспективы европейской экономической интеграции 150.5 KB
  Международная экономическая интеграция — характерная особенность современного этапа развития мировой экономики. В конце XX в. она стала мощным инструментом ускоренного развития региональных экономик и повышения конкурентоспособности на...
4073. Экономические взгляды Джона Мейнарда Кейнса 71.5 KB
  Вступление В истории экономической науки имя Джона Мейнарда Кейнса (1883 - 1946) стоит в ряду ученых, оказавших наибольшее влияние на развитие современного им общества. Кейнс стал знаменит и почитаем еще при жизни, а споры по поводу его взглядов не ...
4074. Линейные электрические цепи постоянного тока 6.55 MB
  Цель работы: Для электрической схемы, изображенной на рисунке 1, по заданным в таблице 1 сопротивлениям и э.д.с. выполнить следующее: Составить систему уравнений, необходимых для определения токов по первому и второму законам Кирхгофа, и найти...
4075. Исследование законов Кирхгофа 117 KB
  Цель работы: экспериментально проверить справедливость законов Кирхгофа, научиться строить потенциальную диаграмму и составлять баланс мощностей. Рисунок 1 Порядок выполнения работы: Собираем цепь по схеме. Включаем источники...
4076. Исследование принципа наложения и свойства взаимности 109.5 KB
  Цель работы: экспериментально проверить справедливость принципа наложения. Рисунок 1 Порядок выполнения работы: Собираем цепь по схеме. Включаем источники, устанавливаем на них разные напряжения в пределах 10-15 В. Опре...