Context Navigation

Changes between Version 15 and Version 16 of DsxDocumentation

Timestamp:: Jan 30, 2008, 10:32:42 AM (16 years ago)
Author:: Nicolas Pouillon
Comment:: Cosmetics, typos and generators

Legend:

: Unmodified
: Added
: Removed
: Modified

DsxDocumentation

-                      v15
+                      v16
 == A) Goals and general principles ==
 DSX stands for ''Design Space eXplorer''. It helps the system designer to map a multi-threaded software application
+DSX stands for ''Design Space Explorer''. It helps the system designer to map a multi-threaded software application
 on a multi-processor hardware architecture (MP-SoC) modeled with the SoCLib components.
 …
 A specific goal of DSX is to allow the system designer to control not only the placement of the
 tasks on the processors, but  the placement of the software objects (execution stacks,
+tasks on the processors, but also the placement of the software objects (execution stacks,
 communication buffers, synchronization locks, etc.) on the memory banks. In shared memory multi-processors
 architectures with several physically distributed memory banks, such control is mandatory to optimize
 both the performances and the power consumption.
 The two targeted application domains are the telecommunication applications (where the tasks are handling packets or packet descriptors), and multi-media applications (where the tasks are handling audio or video streams).
 The general principles of the DSX tool are the following:
+The two targeted application domains are the telecommunication applications (where the tasks are handling packets or packet descriptors), and multimedia applications (where the tasks are handling audio or video streams).
+The general principles of DSX are the following:
  * The coarse grain parallelism of the software application must be statically defined as  a '''Task & Communications Graph (TCG)'''. The number of tasks, and the communication channels between tasks should not change during execution.
  * The software tasks are supposed to be written in C or C++, but - for portability reasons - the tasks must use an abstract '''System Resource Layer (SRL)''' API to access the communication and synchronizations resources.
  * Each task in the TCG can be implemented as a '''software task''' (software running on an embedded processor), or can be implemented as an '''hardware task''', (running as a dedicated hardware coprocessor).
  * DSX allows the programmer to use unprotected shared memory spaces, but the prefered inter-tasks communication mechanism use the '''MWMR middleware'''. The MWMR (Multi-Writer, Multi-Reader)communication  channels, are implemented as software FIFOs and can be shared by ''software tasks'', and by ''hardware tasks''.
+ * Each task in the TCG can be implemented as a '''software task''' (software running on an embedded processor), or can be implemented as an '''hardware task''' (running as a dedicated hardware coprocessor).
+ * DSX allows the programmer to use unprotected shared memory regions, but the prefered inter-tasks communication mechanism use the '''MWMR middleware'''. The MWMR (Multi-Writer, Multi-Reader) communication  channels, are implemented as software FIFOs and can be shared by ''software tasks'' and by ''hardware tasks''.
  * DSX provides classical synchronization mechanisms such as '''barriers''' and '''locks''', but inter-task synchronisation is mainly done through the data availability in the MWMR channels.
  * The target hardware architecture is a '''shared memory multi-processor system on chip''' (MP-SoC) using the SoCLib library of IP cores. But - in order to validate the multi-threaded software application - DSX is able to generate an executable binary code for a standard POSIX workstation.
+ * The target hardware architecture is a '''shared memory multi-processor system on chip''' (MP-SoC) using the SoCLib library of IP cores. In order to validate the multi-threaded software application, DSX is able to generate an executable binary code for a standard POSIX workstation.
  * DSX supports the '''POSIX''' compliant [https://www-asim.lip6.fr/trac/mutekh Mutek]  OS kernel for embedded MPSoCs
  * Finally, DSX defines the '''DSX/L''' language, based on PYTHON, that allows the system designer to describe in a single file the Task & Communication Graph (TCG), the MP-SoC hardware architecture, and various mapping of the TCG on the MP-Soc architecture.
+ * DSX defines the '''DSX/L''' language, based on PYTHON, that allows the system designer to describe in a single file the Task & Communication Graph (TCG), the MP-SoC hardware architecture, and various mapping of the TCG on the MP-Soc architecture.
 The DSX/L script execution generates the binary code executable on the workstation, the
 SystemC model of the ''top cell'' correspondint to the MP-SoC architecture, and the binary
+simulator correspondint to the MP-SoC architecture, and the binary
 code that will be uploaded in the MP-Soc embedded memory.
 …
  * flush a MWMR channel
     * '''srl_mwmr_flush( )'''
  * Synchronization barrier
     * '''srl_barrier_wait( )'''
  * taking and releasing a lock
     * '''srl_loock_lock( )'''
+    * '''srl_lock_lock( )'''
     * '''srl_lock_unlock( )'''
  * accessing a shared memory space (address and size)
     * '''srl_memspace_addr( )'''
     * '''srl_memspace_size( )'''
 Three  platforms are presently supported :
+Three  platforms are currently supported :
  * Any Linux (or Unix)  workstation  supporting the POSIX threads,
  * MP-SoC architecture using the MUTEK/D operation system,
  * MP-SoC architecture using the MUTEK/S operating system,
 MUTEK/D is an embedded, POSIX compliant, distributed,  operating system for MP-SoCs,
 while MUTEK/S is an optimized version: the performances are improved, and the memory
+ * MP-SoC architecture using the Mutek/D operation system,
+ * MP-SoC architecture using the Mutek/S operating system,
+Mutek/D is an embedded, POSIX compliant, distributed,  operating system for MP-SoCs,
+while Mutek/S is an optimized version: the performances are improved, and the memory
 footprint is reduced, at the cost of loosing the POSIX compatibility.
 …
 [[Image(MjpegCourse:mjpeg.png)]]
 The two TG & RAMDAC tasks will be implemented as hardware coprocessors : the TG component implements a wire-less receiver for the MJPEG stream, and the RAMDAC component is a graphic display controller.
+The two TG & RAMDAC tasks will be implemented as hardware coprocessors : the TG component implements a wireless receiver for the MJPEG stream, and the RAMDAC component is a graphic display controller.
 The 5 other tasks can be implemented as ''software tasks'' or  as ''hardware tasks''. In this particular example,
 all MWMR communication channels have one single producer, and one
 single consumer, which is frequent for stream oriented multi-media applications.
+single consumer, which is frequent for stream oriented multimedia applications.
 === C1) Task Model definition ===
 As a software application can instanciate several instances of the same task, we must distinguish the task, and the task model. A task model defines the code associated to the task, and the task interface (corresponding to the system resources used by the task : MWMR communications channels, synchronization barriers, locks, and memspaces).
+As a software application can have several instances of the same task, we must distinguish the task, and the task model. A task model defines the code associated to the task, and the task interface corresponding to the system resources used by the task (MWMR communications channels, synchronization barriers, locks, memspaces, ...).
 {{{
 task_model = TaskModel( 'model_name',
 …
                     barriers = [ 'barrier_name', ... ] ,
                     memspaces = [ 'memspace_name', ... ] ,
-                    signals = [ 'signal_name', ... ] ,
                     impls = [ SwTask( 'func', stack_size = 1024 , sources = [ 'func.c' ] )
 }}}
 …
 . ''lock'' : lock protecting exclusive access
 === C4) Memspace definition
+=== C4) Memspace definition ===
 Direct communication through shared memory buffers is supported by DSX, but there is no protection mechanism, and the synchronization is the programmer responsability.
 …
 my_lock = Lock( 'lock_name' )
 }}}
+In the mapping section of the DSX/L program, the lock can be explicitely placed in the memory space.
+In the mapping section of the DSX/L program, 1 software object must be placed :
+. ''lock'' : Where to place the lock
 === C6) Task instanciation ===
 …
 . ''run'' : processor running the task
 === C8) TCG definition ===
+=== C7) TCG definition ===
 The Task and Communication Graph must be defined :
 …
 === D1) SoCLib components  ===
 In the present version of DSX, each hardware component must be described by a PYTHON
+In the current version of DSX, each hardware component must be described by a Python
 class that defines the component interface, and the component parameters.
 The list of available components can be found in SoclibComponents.
 …
 Depending on the component type, the port designation can vary:
  * When the number of ports is fixed, the ports are attributs : My_Proc0.cache define the cache port of the MIPS processor.
  * When the number of port is not fixed (typivally for interconnect component, the ports are accessed through a dedicated method : the getTarget() method of the !LocalCrossbar component returns a VCI target port.
+ * When the number of port is not fixed (typivally for interconnect component, the ports are allocated through a dedicated method : the getTarget() method of the !LocalCrossbar component allocates a VCI target port, the getInit() method allocates an VCI Initiator port.
 The following example describes asimple system with two processor and on e embedded memory:
 {{{
 …
 In any shared memory architecture, the address space is a shared resource.  This resource is structured in several segments. A segment has a name, a base address, a size
 (number of bytes), and a cacheability attribut (Boolean). A segment is a physical entity associated to a
+(number of bytes), and a cacheability attribute (boolean). A segment is a physical entity associated to a
 given VCI target. Several segments can be associated to the same VCI target, but a given segment cannot be distributed over several VCI targets.
 …
 # Instanciating a VCI target hardware component
 # and Linking  the segments to this component
+# and assigning  the segments to this component
 my_ram = MultiRam ( 'ram', seg_data1, seg_data2, seg_reset )
 }}}
 …
 === D4) Generic platforms ===
 As DSX/L is based on PYTHON, it is possible to define generic, parametrized architectutes, that can
+As DSX/L is based on Python, it is possible to define generic, parametrized architectutes, that can
 be reused for various applications. Those reusable architectures are derived classes
 from the basic '''Architecture''' class. The implementation is defined in the architecture() method.
+As an example we define a parameterized multi-processors architecture, called MultiProc, and containing
+ a variable number of processors. The parameter(s) must be named, and the actual parameter value is defined when the architecture is instanciated. The parameter is referenced with the ''getParam()'' method, and it is possible to define a default value.
+As an example we define a parameterized multi-processors architecture, called !MultiProc, and containing
+a variable number of processors. The parameter(s) must be named, and the actual parameter value is defined
+when the architecture is instanciated. The parameter is referenced with the ''getParam()'' method, and it
+is possible to define a default value.
 {{{
 #################################
 …
     def architecture(self):
     # segments definition
     self.reset = Segment( ’reset’, address = 0xbfc00000, type = Cached )
     self.code = Segment( ’code’, type = Cached )
     self.data = Segment( ’data’, type = Uncached )
     # components instanciation and connexion
     self.vgmn = Vgmn( ’vgmn’ )
     self.ram = MultiRam( ’ram’, self.reset, self.code, self.data )
     # processors and caches
     self.cpus = []
     for i in self.getParam( ’nbcpu’ ):
         m = Mips( ’mips%d’%i )
         self.cpus.append( m )
         c = Xcache( ’cache%d’%i )
         g:c.cache // m.cache )
         c.vci // self.vgmn.getTarget() )
     self.vgmn.getTarget() // self.c1
     self.vgmn.getTarget() // self.c2
     self.vgmn.getInit() // self.ram
     # base definition
     self.setBase( self.vgmn )
     # segment table initialization
     self.setConfig(’mapping_table’, MappingTable() )
+        # segments definition
+        self.reset = Segment( ’reset’, address = 0xbfc00000, type = Cached )
+        self.code = Segment( ’code’, type = Cached )
+        self.data = Segment( ’data’, type = Uncached )
+        # components instanciation and connexion
+        self.vgmn = Vgmn( ’vgmn’ )
+        self.ram = MultiRam( ’ram’, self.reset, self.code, self.data )
+        # processors and caches
+        self.cpus = []
+        for i in self.getParam( ’nbcpu’ ):
+            m = Mips( ’mips%d’%i )
+            self.cpus.append( m )
+            c = Xcache( ’cache%d’%i )
+            g:c.cache // m.cache )
+            c.vci // self.vgmn.getTarget() )
+        self.vgmn.getTarget() // self.c1
+        self.vgmn.getTarget() // self.c2
+        self.vgmn.getInit() // self.ram
+        # base definition
+        self.setBase( self.vgmn )
+        # segment table initialization
+        self.setConfig(’mapping_table’, MappingTable() )
 ####################################
 …
 === E1) Mapper declaration ===
 AS it is possible to define various mapping for a given TCG, and a given architecture, we must define a third object : this ''mapper'' will contain all the mapping directives defined by the system designer.
+As it is possible to define various mapping for a given TCG, and a given architecture, we must define a third object : this ''mapper'' will contain all the mapping directives defined by the system designer.
 {{{
 my_mapper = Mapper( my_tcg, my_architecture )
 …
 The mapper has a method ''map()'' that is used to assign a software object to an hardware component.
 An hardware component can b a processor, or a segment associated to an embedded memory bank,
+An hardware component can be a processor, a segment associated to an embedded memory bank,
 or a segment associated to an addressable peripheral.
 {{{
 …
 to various outputs : binary code for the software application, hardware architecture simulation model, etc.
+This involves a code generator. Several code generator exist, they may apply to different parts of you design:
+ * Software only (Tcg object)
+  * Posix() for generating native workstation code
+ * Software and hardware (Mapper object)
+  * MutekS() to use Mutek/S as supporting embedded OS
+  * MutekD() to use Mutel/D as supporting embedded OS
+  * any hardware generator (those on next lines), this will create a platform automatically loading the embedded software
+ * Hardware only (Hardware object)
+  * Caba() to create a CABA netlist (with SoCLib)
+  * Tlmt() to create a TLM-T netlist (with SoCLib)
+User may want to have a convenience Makefile in platform root which would build all code,
+it may be created passing all generators created to generate code to TopMakefile()
+Example: Let's create
+ * An application mapped on an hardware platform with CABA and TLM-T simulators
+ * a corresponding application for the workstation
+ * a top Makefile:
+{{{
+soft = Tcg( ... )
+hard = Hardware( ... )
+mapping = Mapper( hard, soft )
+mapping.map( ... )
+# Generators now:
+muteks_generator = MutekS()
+caba_generator = Caba()
+tlmt_generator = Tlmt()
+posix_generator = Posix()
+# MutekS and simulators (Caba / Tlmt) generates platform and embedded software for a mapping:
+mapping.generate( muteks_generator, caba_generator, tlmt_generator )
+# Posix generates code for a Tcg
+tcg.generate( posix_generator )
+# TopMakefile takes the used generators:
+TopMakefile( muteks_generator, caba_generator, tlmt_generator, posix_generator )
+}}}