[[PageOutline]] = TSAR virtual memory = The TSAR MMU can be used with any 32 bits, single instruction issue, processor. In order to be independent on the processor core choice, the TLB MISS are handled by an hardwired Finite State Machine (called a Table Walk), without software action. == 1. Page Table Organisation == The TSAR virtual memory uses a paged approach. The TSAR architecture defines two page sizes : - 4 Kbytes pages - 2 Mbytes pages The virtual address space size is 4 Gbytes (32 bits virtual addresses). The physical address space is limited to 1 Tbytes (40 bits physical addresses). The page table are build by the operating system, and are stored in memory. They are used for both the VPN (Virtual Page number) to PPN (Physical Page Number) translation, and for access rights checking. === 1.1 Two levels structure === As described below, the Page Table has a hierarchical two levels structure : The general mapping constraints are the following - All page tables (first & second level) must be aligned : the page table base adress must be a multiple of 8K bytes for a first level page table, and multiple of 4K bytes for a second level page table. - The page tables can be placed anywhere in the physical address space. - The PTPR register (located in the generic MMU, and initialised by the OS at each context switch) contains actually the 26 MSB bits of the first level page table base address. It must be extended (left-shifted) to 36 bits by the hardware. === 1.2 First Level Page Table Entry Format === Each entry in a first level page table can contain either a 2M bytes page descriptor (called PTE1), or a second level page table descriptor (called PTD1). It is implemented as a single 32 bits word : * PTE1 : ||V||T||L||R||C||W||X||U||G||D|| reserved (3 bits) || PPN1 (19 bits) || * PTD1 ||V||T|| reserved (2 bits) || PTBA (28 bits) || The various fields are defined as follows : || V || Valid bit || Valid entry when 1 (set by the OS) || || T || Type bit || PTD1 when 1 (set by the OS) || || L || Local access bit || Used by the OS for page replacement (set by the hardware) || || R || Remote access bit || Used by the OS for page replacement (set by the hardware) || || C || Cachable bit || The page is cachable in the L1 cache when 1 (set by the OS) || || W || Writable bit || The page is writable when 1 (set by the OS) || || X || eXecutable bit || The page can contain instructions when 1 (set by the OS) || || U || User bit || The page is accessible in user mode when 1 (set by the OS) || || G || Global bit || Entry not invalidated in TLB flush when 1 (set by the OS) || || D || Dirty bit || The page has been modified when 1 (set by the hardware) || || PPN1 || Physical Page Number || Concatened to the page offset to build the physical address || || PTBA || Page Table Base Address || Second level page table base address || The L, R, D bits are used by the operating system to implement the page replacement policy. * The D bit is set by the hardware, when a page is written and when it is not already set, using an atomic access (LL/SC). * The L bit is set by the hardware, when the page is accessed by a local processor or coprocessor, after a TLB miss, and when it is not already set. * The R bit is set by the hardware, when the page is accessed by a remote processor or coprocessor, after a TLB miss, and when it is not already set. These page table updates use atomic access (LL/SC). If the entry is a PTE1, the PPN1 value (19 bits) must be concatened with the page offset (21 bits) to build the 40 bits physical address. If the entry is a PTD1, the PTBA value (28 bits) must be left-shifted by 12 bits to define the base address of the level 2 page table. The page table being aligned in memory, the 12 LSB bits of this base address have a 0 value. === 1.3 Second Level Page Table Entry Format === Each entry in a second level page table contains a 4K bytes page descriptor (called PTE2). It is implemented as two 32 bits words: the first word contains the flags; the second word contains the 28 bits physical page number (PPN2). * PTE2 first word : ||V||T||L||R||C||W||X||U||G||D|| reserved (22 bits) || * PTE2 second word : || reserved (4 bits) || PPN2 (28 bits) || The various fields are defined as follows : || V || Valid bit || Valid entry when 1 (set by the OS) || || T || Type bit || Must be 0 for a PTE2 (set by the OS) || || L || Local access bit || Used by the OS for page replacement (set by the hardware) || || R || Remote access bit || Used by the OS for page replacement (set by the hardware) || || C || Cachable bit || The page is cachable in the L1 cache when 1 (set by the OS) || || W || Writable bit || The page is writable when 1 (set by the OS) || || X || eXecutable bit || The page can contain instructions when 1 (set by the OS) || || U || User bit || The page is accessible in user mode when 1 (set by the OS) || || G || Global bit || Entry not invalidated in TLB flush when 1 (set by the OS) || || D || Dirty bit || The page has been modified when 1 (set by the hardware) || || PPN2 || Physical Page Number || Concatened to the page offset to build the 40 bits address || The L, R, D bits are used by the operating system to implement the page replacement policy. * The D bit is set by the hardware, when a page is written and when it is not already set, using an atomic access (LL/SC). * The L bit is set by the hardware, when the page is accessed by a local processor or coprocessor, after a TLB miss, and when it is not already set. * The R bit is set by the hardware, when the page is accessed by a remote processor or coprocessor, after a TLB miss, and when it is not already set. These page table updates use atomic access (LL/SC). The PPN2 value (28 bits) must be concatened with the page offset (12 bits) to build the 40 bits physical address. == 2. Generic MMU == For each TSAR programmable processor, the generic MMU is implemented as an hardware component in the L1 cache controller. As the processor core can issue two simultaneous instruction and data requests, there is actually two separated hardware MMUs for instruction and data. Each MMU contains a TLB (Translation Look-aside Buffer). These TLBs are implemented as set-associative caches containing 64 entries (8 sets of 8 ways). Each TLB contains a mix of 4Kbytes and 2 Mbytes page descriptors. for 4 Kbytes & 4 Mbytes pages. == 3. I/O MMU == To be defined...