Научная статья

Информатика, кибернетика и программирование

Nowadays the development of CPU with new instruction set architecture (ISA) implies that translators to this ISA assembler from the certain set of high level programming languages should be developed too. If the basic principles of the ISA are close enough to the long time used traditional CISC



24.5 KB

1 чел.


M.O. Bakhterev

The Krasovski Institute for Mathematics and Mechanics of UrB RAS, Yekaterinburg

Nowadays the development of CPU with new instruction set architecture (ISA) implies that translators to this ISA assembler from the certain set of high level programming languages should be developed too. If the basic principles of the ISA are close enough to the long time used traditional CISC, RISC and VLIW systems, then the translators development could be simplified by using existing mature software packages such as LLVM and GCC. But if the ISA diverges from traditional ones, the usage of these packages may become inconveniently hard.

The wide known translation systems suggest abstractions for target CPU ISA description which imply that after write of the value V to the register, referred in command with the name N, all the next reads from N till the next write to N will issue the same value V.

This mirror one of the basic ideas for CISC, RISC and VLIW CPUs -- an idea about strict sequential modification of the computer's memory (register set and system RAM) state, through which the information is communicated between computation steps described by instructions. Keeping such an order of computation significantly limit the efficiency of traditional architecture solutions, in the framework of which the high information processing speeds may be achieved either by significant amounts of additional hardware and increase in power consumption (CPUs with out of order execution), or by limiting the class of efficiently executed programs (GPGPUs which could be viewed as vector RISC-machines).

However, as the experience of the multicelluar processor developmen shows, it is possible to maintain high speed in program execution without large amount of additional hardware and without limiting the class of effectively executed programs. To achieve that it is needed to abandon the semantic of sequential instruction execution with mutable state of system RAM and registers threading. But such an abandonment makes utilization of traditional translation systems impossible and frequently leads to refusal from research works on new architectural ideas because of high costs of translation systems development. But the multicellular processor architecture, beside increase in code execution efficiency, possesses the set of another important practical features such as program execution continuation under some of the functional units hardware failures and runtime functional units configuration optimization for certain tasks with the ability to shutdown some of the units for power saving. These features are quite attractive to initiate new translation system development.

The first of the most difficult tasks that should be solved due this development is the task of the source code translation into the intermediate form convenient for further processing. Such convenient form for multicellular processors is program representation as control flow dag in witch the nodes are linear blocks described with data flow dags.  

The difficulty of building such program representation by abstract syntax tree emerges from that for every tree node there are many variants of linking dags build for child subtrees into larger structure. And that is complicated by dependency of linking variants in the node on some tree level on results of linking in the nodes on upper and lower tree levels.

To construct links correctly it is necessary to analyze all that variants and when programming in traditional languages to describe that analysis by lengthy multilevel pattern matching structures. And this is most labor-intensive part of work on translator into intermediate representation, demanding from highly skilled programmers tiring attention to details.

However, some time ago we've formalized one approach to distributed computational processes description in the form of dynamically constructed data flow graphs. The approach was intended to solve the problem of managing complex crossreferences in data processed by parallel code. One of the our approach features allowing to lower the complexity of parallel code development is an ability to progressively build computation structure in some namespace by the named events of new results generation emerging during that computation. This feature, as it was expected, had to simplify both tasks of the distributed computation decomposition into independent parts and the following composition of their results in order to get overall result. In its turn the simplification of these tasks had to lower labor-intensity of parallel code development. We have tested our approach in the coding of some parallel algorithms, and it has payed our expectations off.

The problems emerging when constructing the dag in some node of abstract syntax tree from child subtrees dags are similar to those emerging in the composition and decomposition of distributed computation processes. It may be that such similarity has deep strict mathematical reason, since the structures of the abstractions of communicating sequential processes and of recursive mu-types are quite similar.

Having been following that similarity we have developed the LiME translation system based on the program graph dynamic construction with the same principles as those in the basis of the RiDE system for distributed processes programming. The graph is built by the process of the forms application which are similar to the RiDE rules. These applications are activated by the events of inputs availability. During the form application new arcs leading from existing graph through the inputs to the new graph part described by the form being activated are built. After form activation new outputs pointing to the nodes of grown graph and new forms potentially ready for further activation appear. This process closely mirrors the process of data flow graph dynamic construction in the RiDE system. And we suggest for such method of program graph construction the name event-driven translation.   

We used our system in practice in the development of translator from C99 language to the assembler of multicellular processors. This experience has shown that description of the program graph construction process as the set of forms activated by events is in some sense simplier than encoding similar functionality in one of traditional programming languages. We can refer here to the following comparison. The main translation cycle of one of the most compact C99 translators -- Tiny C Compiler -- is encoded in 6200 lines of C99 code (not counting auxiliary functions). We have managed to implement comparable cycle in just 2500 lines of code for LiME system.

Thus we show in this work that principles in foundation of the RiDE distributed programming system are quite universal, powerful and adequate for description even of such complex interlinks between computational procedures as arise on some stages of program compilation. Besides that we were able to refine and develop the RiDE formal model itself making it more suitable for description of data flows with extreme complex structures.

Работа выполнена при поддержке Программы фундаментальных исследований УрО РАН “Информационные, управляющие и интеллектуальные технологии и системы”, проект 12-П-1-1034.


А также другие работы, которые могут Вас заинтересовать

13917. Съемочные камеры 1.35 MB
  Урок 12. Съемочные камеры В max имеется два типа камер: нацеленная Target камера характеризуется точкой съемки eye position в которой помещается сама камера и точкой нацеливания или мишенью target point то есть точкой в трехмерном пространстве на которую направлена ли
13918. Работа с диапазонами. Относительная адресация 33.88 KB
  Тема: Работа с диапазонами. Относительная адресация Тип урока: комбинированный Цели: Сформировать представление у учащихся о понятии диапазона сортировке таблицы; Познакомить с функциями обработки диапазона принципами относительной адресации; Выработка
13919. Права ребенка 70 KB
  Цель: 1. Обобщить знания учащихся об основных правах ребенка. 2. Развести понятия €œправо€ €œправа€ €œобязанности€ показать единство прав и обязанностей для детейподростков. 3. Повышать социальноправовую компетентность старшеклассников. 4. Воспитыват
13920. Защита прав потреителей 45 KB
  Цели урока: формировать у школьников рациональное потребительское поведение; развать интуицию самостоятельность гибкость мышления; научить отстаивать права потребителя; показать знания основ экономической теории; повторить статьи Закона €œО з...
13921. Технические средства мультимедиа 226 KB
  Тема урока: Технические средства мультимедиа. Цели урока: способствовать формированию у обучающихся устойчивых представлений по основным понятиям темы. развивать навыки работы учащихся на компьютере. Оборудование: мультимедийная презентация интерактивная ...
13922. Основы работы с CorelDRAW. Графические примитивы 328.5 KB
  сновы работы с CorelDRAW. Графические примитивы 1. Растровая и векторная графика. Понятие объекта в CorelDRAW CorelDRAW Х4 в настоящее время один из самых мощных редакторов векторной графики который широко используется в издательской деятельности полиграфии и рекл
13923. Клод Дебюсси «В лодке» 3 MB
  Урок 1. Клод Дебюсси В лодке Реквизит: Все музыкальные файлы урока. Все распечатанные материалы урока. Детский музыкальный инструмент – металлофон. Таз для воды и бумажные кораблики. Краски кисточка для раскрашивания. Клей ножницы для аппли
13924. Семьи Японии 140.7 KB
  Семьи Японии. Япония страна загадочная. Хранение традиций воспитание собственного достоинства склонность к созерцанию непонятная европейцу манят и завораживают. История страны не могла не отразиться на жизни японцев и на том как они строят семейные отношения. В жи...
13925. GLAD TO MEET YOU 472.5 KB
  GLAD TO MEET YOU Unit 1 1A Meeting people. Read the dialogues and guess who is speaking and where they are. Good afternoon Miss Bright Good afternoon Mr. Johnson Hows life No complaints. Thank you. How are things with you Nothing to boast of. I have loads of work again. Sorry to hear that Hi Mike Hey Jack Nice tracksuit Oh thank you. I jog in it every morning. H...