Научная статья

Информатика, кибернетика и программирование

Nowadays the development of CPU with new instruction set architecture (ISA) implies that translators to this ISA assembler from the certain set of high level programming languages should be developed too. If the basic principles of the ISA are close enough to the long time used traditional CISC



24.5 KB

1 чел.


M.O. Bakhterev

The Krasovski Institute for Mathematics and Mechanics of UrB RAS, Yekaterinburg

Nowadays the development of CPU with new instruction set architecture (ISA) implies that translators to this ISA assembler from the certain set of high level programming languages should be developed too. If the basic principles of the ISA are close enough to the long time used traditional CISC, RISC and VLIW systems, then the translators development could be simplified by using existing mature software packages such as LLVM and GCC. But if the ISA diverges from traditional ones, the usage of these packages may become inconveniently hard.

The wide known translation systems suggest abstractions for target CPU ISA description which imply that after write of the value V to the register, referred in command with the name N, all the next reads from N till the next write to N will issue the same value V.

This mirror one of the basic ideas for CISC, RISC and VLIW CPUs -- an idea about strict sequential modification of the computer's memory (register set and system RAM) state, through which the information is communicated between computation steps described by instructions. Keeping such an order of computation significantly limit the efficiency of traditional architecture solutions, in the framework of which the high information processing speeds may be achieved either by significant amounts of additional hardware and increase in power consumption (CPUs with out of order execution), or by limiting the class of efficiently executed programs (GPGPUs which could be viewed as vector RISC-machines).

However, as the experience of the multicelluar processor developmen shows, it is possible to maintain high speed in program execution without large amount of additional hardware and without limiting the class of effectively executed programs. To achieve that it is needed to abandon the semantic of sequential instruction execution with mutable state of system RAM and registers threading. But such an abandonment makes utilization of traditional translation systems impossible and frequently leads to refusal from research works on new architectural ideas because of high costs of translation systems development. But the multicellular processor architecture, beside increase in code execution efficiency, possesses the set of another important practical features such as program execution continuation under some of the functional units hardware failures and runtime functional units configuration optimization for certain tasks with the ability to shutdown some of the units for power saving. These features are quite attractive to initiate new translation system development.

The first of the most difficult tasks that should be solved due this development is the task of the source code translation into the intermediate form convenient for further processing. Such convenient form for multicellular processors is program representation as control flow dag in witch the nodes are linear blocks described with data flow dags.  

The difficulty of building such program representation by abstract syntax tree emerges from that for every tree node there are many variants of linking dags build for child subtrees into larger structure. And that is complicated by dependency of linking variants in the node on some tree level on results of linking in the nodes on upper and lower tree levels.

To construct links correctly it is necessary to analyze all that variants and when programming in traditional languages to describe that analysis by lengthy multilevel pattern matching structures. And this is most labor-intensive part of work on translator into intermediate representation, demanding from highly skilled programmers tiring attention to details.

However, some time ago we've formalized one approach to distributed computational processes description in the form of dynamically constructed data flow graphs. The approach was intended to solve the problem of managing complex crossreferences in data processed by parallel code. One of the our approach features allowing to lower the complexity of parallel code development is an ability to progressively build computation structure in some namespace by the named events of new results generation emerging during that computation. This feature, as it was expected, had to simplify both tasks of the distributed computation decomposition into independent parts and the following composition of their results in order to get overall result. In its turn the simplification of these tasks had to lower labor-intensity of parallel code development. We have tested our approach in the coding of some parallel algorithms, and it has payed our expectations off.

The problems emerging when constructing the dag in some node of abstract syntax tree from child subtrees dags are similar to those emerging in the composition and decomposition of distributed computation processes. It may be that such similarity has deep strict mathematical reason, since the structures of the abstractions of communicating sequential processes and of recursive mu-types are quite similar.

Having been following that similarity we have developed the LiME translation system based on the program graph dynamic construction with the same principles as those in the basis of the RiDE system for distributed processes programming. The graph is built by the process of the forms application which are similar to the RiDE rules. These applications are activated by the events of inputs availability. During the form application new arcs leading from existing graph through the inputs to the new graph part described by the form being activated are built. After form activation new outputs pointing to the nodes of grown graph and new forms potentially ready for further activation appear. This process closely mirrors the process of data flow graph dynamic construction in the RiDE system. And we suggest for such method of program graph construction the name event-driven translation.   

We used our system in practice in the development of translator from C99 language to the assembler of multicellular processors. This experience has shown that description of the program graph construction process as the set of forms activated by events is in some sense simplier than encoding similar functionality in one of traditional programming languages. We can refer here to the following comparison. The main translation cycle of one of the most compact C99 translators -- Tiny C Compiler -- is encoded in 6200 lines of C99 code (not counting auxiliary functions). We have managed to implement comparable cycle in just 2500 lines of code for LiME system.

Thus we show in this work that principles in foundation of the RiDE distributed programming system are quite universal, powerful and adequate for description even of such complex interlinks between computational procedures as arise on some stages of program compilation. Besides that we were able to refine and develop the RiDE formal model itself making it more suitable for description of data flows with extreme complex structures.

Работа выполнена при поддержке Программы фундаментальных исследований УрО РАН “Информационные, управляющие и интеллектуальные технологии и системы”, проект 12-П-1-1034.


А также другие работы, которые могут Вас заинтересовать

36732. Получить сигнал пилообразной формы и исследовать влияние фазы гармоник 85.01 KB
  Для решения воспользуемся формулой: где: Un напряжение k текущий номер гармоник n число гармоник w круговая частота t время Ход работы: Собираем установку.
  Структура таблицы СЕССИЯ Признак ключа Имя поля Тип поля Формат поля Размер поля Ключевое Номер Текстовое 5 Оценка 1 Числовое Фиксированный Длинное целое Оценка 2 Числовое Фиксированный Длинное целое Оценка 3 Числовое Фиксированный Длинное целое Оценка 4 Числовое Фиксированный Длинное целое Результат Текстовое 3 Таблица 2. Структура таблицы СТИПЕНДИЯ Признак ключа Имя поля Тип поля Формат поля Размер поля Ключевое Результат Текстовое 3 Процент Числовое Процентный Одинарное с плавающей точкой Заполните вновь созданные таблицы...
36734. Задания для лабораторных работ по дисциплине Информатика 892.5 KB
  Найти сумму элементов среди элементов расположенных на главной диагонали матрицы А.Найти максимальный элемент в 3ей строке матрицы 3.Поменять местами найденный максимальный элемент и значение S.Найти новый одномерный массив В из минимальных элементов каждой строки матрицы А.
36735. Методы тестирования аппаратных средств ПК 60.79 KB
  Включить компьютер и запустив программу SIW и используя ветвь Hrdwre произвести определение основных параметров следующих элементов ПК: Рабочую частоту ЦП размер кэшпамяти всех уровней Версию BIOS и его размер Параметры МВ тип чипсета тип сокета ЦП количество слотов шин расширения и памяти Объем тип используемой памяти ее производитель рабочая частота памяти максимальная емкость устанавливаемой памяти. Дату создания BIOS Тип BIOS Идентификационный номер BIOS Тип CPU и его рабочие частоты Размер BIOS ROM Установленный и максимально...
36736. Методика поиска неисправностей элементов БП ПК 120.33 KB
  Правильность работы схемы выработки сигнала PG. Порядок выполнения работы: Методика проверки работы сетевого выпрямителя и фильтра. Ознакомится с признаками исправной работы сетевого выпрямителя и фильтра для этого загрузить модель выпрямителя lb_3_1.
36737. Методика поиска неисправностей элементов Блока питания ПК 960.96 KB
  Неисправные элементы Измеренные значения Напряжение U1 В Потребляемый ток I1 Пульсация выпрямленного напряжения mВ С1 пробой 3.77 mV С1С2пробой 0 342. Переключив К2 в положение низ Последовательно введя неисправность конденсатора C1 и транзисторов VT2VT3 Shot пробой Open обрыв проанализировать поведение схемы и характер вырабатываемого сигнала PG зарисовать полученные осциллограммы для каждого случая. C1 Shot пробой C1 Open обрыв VT2 Shot пробой VT2 Open обрыв VT3 Shot пробой VT3 Open обрыв.
36738. Работа с таблицами в текстовом процессоре Word 478 KB
  Каждая ячейка таблицы может содержать текстовую числовую или графическую информацию. Можно производить любые манипуляции с данными таблицы например вставлять и удалять форматировать данные не нарушая при этом расположения столбцов. Создание таблицы Таблицу в текстовом редакторе Word можно создать двумя способами: В меню Таблица выбрать команду Вставить Таблица рис. Диалоговое окно Вставка таблицы Рис.
36740. Острое гнойное воспаление среднего уха у детей. Характер нарушения слуха по камертональным и аудиометрическим данным 15.37 KB
  Одно из наиболее частых заболеваний детей младшего возраста. В результате «незрелости» иммунных систем и отсутствия способности к ограничению воспалительного процесса острый средний отит у детей этого возраста характеризуется тяжелыми общими нарушениями.