Context Navigation

← Previous Revision
Latest Revision
Next Revision →
Blame
Revision Log

40bits @ 690

Last change on this file since 690 was 1, checked in by alain, 7 years ago
First import
File size: 21.8 KB

Line
1	1-Event Manager:
2	The even manager has been changed in the fallowing ways :
3	- the API has been changed to avoid given a pointer to the second argumern to event_send we give only the
4	the gpid of the target processor.
5	- local event handling is gloablly the same
6	- remote event has been changed. Must be carrefully handled with physical access:
7	- the kfifo of the remote cpu is physically accessed
8	- the event_notifier must notify after copying the event structure (on the stack ?).
9	!!! This does not aplying that the pointer arguments are copied !!!
10	=> modify event handler
11
12	3-Processes/Thread Management
13
14	3.1) Task Management
15
16	There's one gloabal table that index all process by pid.
17	Replicaing the kernel will also replicate this structure...
18	The fatest way to corrdinate the replica :
19	- Consiste of using the table of cluster 0 as the main table. Each table will contain
20	a pointer to the task structure if the task is local, other ways the table entry is
21	either marked and contains the cluster id wich handle the task or a NULL value.
22	A NULL value at the main table mean that the entry has not been allocated.
23	A NULL value in other table must be confirmed by cheching the main table.
24
25	This imply a change in pid allocate, wich must allocate from the main table by setting
26	the cluster id and marking the entry with a remote flag (0x1). // TODO more on synch
27
28	The lookup need to check for the task in local, if we find a task address we return' it.
29	If we find a remote entry we go to the indicate cluster. If the remote cluster does not contain
30	the address of the task ...
31
32
33	Excuse me! the fastest way is to directly use the main table as the only table and ignore other tables!!!
34	This will require putting a physical addresse on the table to point to task structures and synchronised
35	with a lock.
36
37	3.2)Task descriptor
38
39	This structure contain the following structures that are shared between threads:
40	- vmm (Virtual Memory Manager) : see 2-.
41	- pointer to root/cwd file, fdinfo and binary file: use physical addresses
42	- pointer to task parent: use a parent cluster id field, or directly a physical addresse ?
43	- 2 list : one for the children and one for the siblings
44	- a signal manager (NULL par default ?)
45	- thread handling structures :
46	- a table containing a reference to all the threads (and a pointer to it's page struct)
47	- a list entry root that englobe all thread: not really used, can be deleted
48
49	How to manipulate these fields ? => by message passing ?
50	for thread's pointer we directly placed the ppn: thread's structures are aligned on a page.
51	Other pointers fd_info, bin are to locals strutures.
52	vmm handling is local if a single threaded task, otherwise the handling is done by message passing.
53
54	3.3) Thread Management
55	In the second stage...
56
57
58
59	2-Memory Management:
60	vmm: message passing to the processor that contain the main structure.
61	There is no need to acess the thread data except for statistique pupose!
62	There is a part that could be done by the thread with passing a message... look in the page table part
63
64	To think : page fault at the same addresses how to avoid them (merge the messages of page fault) ?
65
66	kmem: all remote allocation are forbiden except for user pages (at least for now).
67	(user remote allocation have been done in the second stage.)
68
69
70	3-Cluster Manager:
71	One cluster manager for each cluster: no need for the global table...no need for the replication of cluster addresses
72	...no needs for procs base addresses: the cpu_s are directly put into the cluster structures with the cluster base
73	address the compiler will offset to the good addresse.
74
75
76	3-DQDT:
77	initialisation has been modified to not require the remote allocation API. We set a table of two dqdt in the
78
79
80
81	4-Drivers
82
83
84
85
86	5-Boot:
87	A) The main proc of the boot cluster enter the boot loader, initiliase the stack bases for
88	all other procs in the boot_tbl and wait for all other proc to copy the needed info (kernel, boot,
89	boot info) from the boot cluster.
90
91	B) All other procs enter the boot_loader allocate a stack a the end of the cluster ram and
92	wait for their cluster to be initilised by their main proc. (except for those of the boot cluster ?)
93
94	C) The main procs of other clusters enter the boot loader, set the sp register to point to
95	the stack at the end of the current cluster ram.
96	(Enter the C code and) Copy the kernel/boot/boot_info from the main cluster to their cluster
97	increment a value and sleep back (wait instruction).
98
99	D) All procs sleep after incrementing a counter in the in the original BI (boot cluster).
100
101	E) Once the counter reach the number of proc, the main proc of the boot cluster can then
102	copy the kernel/boot/boot_info in the same cluster but at the correct adresses. This will erase the
103	code of the preloader. (the sp pointer of the boot cluster is good, the preloader should set it to
104	point
105
106	!!! moving the boot code is dangerous, since we are currently executing it ... and the
107	addreses in it are fixed ?!!! If we really want this space, we need to give up at this stage on the
108	boot code, BI can be kept.
109
110	***********************************************************************************************************
111	Initital memory content at boot (TODO):
112
113	------------------ 0x0000 0000
114	Preloader
115	------------------ ENDPRELOADER
116	Kernel
117	------------------ END kernel paddr (important here, the charging addresse (paddr) is differente than vaddr)
118	BI
119	------------------
120	Boot code
121	------------------ End of boot
122	.....
123	------------------ end of cluster space - 4 boot stacks
124	Boot stacks (x nbproc)
125	----------------- End of cluster space
126
127
128	Boot code role is to set:
129	1 - the kernel and boot info: deplace it to the address 0
130	2 - the stacks for each proc
131	3 - initialise the info structure (allocated on the stack): don't forget to put the BI info address
132	that resulted from displacing it.
133	4 - initialise the page table and the mmu if we are booting in virtual mode
134
135	***** Phase 1: boot proc of the boot cluster
136	- after the preloader, there's only the boot_proc who has a stack and all other proc are waiting (an WTI).
137	- jump to C code
138	- wakeup all proc (possible improvement: set a distributed wakeup)
139
140	***** Phase 2: all proc (the boot cluster proc)
141
142	- start by setting a stack at the end of current cluster
143	- jump to C code
144	- set info structure
145	- copy the kernel and BI info (only one proc per cluster) at the start of the current cluster
146	(- set the page table and the mmu)
147	- jump to the kernel (of the cuurrent cluster) and wait in it ? yes wait for the
148	kernel image of cluster 0 to be set (since we will probably synchronise with it)
149
150
151	? how to signal to the current cluster proc to jump to kernel ?
152
153	--> the boot code is now used by only the boot cluster
154	--> the preloader code is used by only the boot cluster other procs
155
156	***** Phase 3:
157	- put a barrier to be sure that all procs have entred the boot code
158	- all proc set their stack: to put after the boot proc stack
159	(- set the page table and the mmu)
160
161	--> the preloader code is no longer in use
162
163	- copy the kernel and BI of the boot cluster (by the boot proc)
164	- all proc of the boot cluster can now jump to the kernel and wait
165	- the boot proc enters the kernels and wake up a proc per cluster (except for the current
166	boot cluster)
167
168	--> the boot code is no longer in use
169
170
171	!! be careful the stack of all the procs are still in use !!
172	put the in the reserved zone ?
173
174	N.B.: all addresses of the boot cluster are found using compiled in variables
175
176
177	****** It's up to the kernel now...
178
179	Kernel boot:
180	***** Phase 1: only the boot proc
181	wake up one proc per cluster
182	!!! Must be sure that they are sleeping (waiting) ? !!!
183
184	***** Phase 2: A proc per cluster
185	Tasks: '->' indicate dependance
186
187	1) initialise task bootstap and task manager and set idle arg
188	reported: devfs/sysfs roots
189
190	2) initialise arch_memory : (initialise boot_tbl: done in the boot phase); initialise
191	clusters_tbl ?
192	initialise arch: initialise current cluster: initialise ppm (physical page memory) (can
193	alloc now!), cpu_init : event listener!
194	initialise devices (printk) (put all in cluster devices lists, root all
195	interruptions locally (to the proc handling them));
196	and register them (!!!to be reported!!!) -> depend on message api
197
198
199	3) --Barrier--: here we should wait that everybody has set there event listener.
200
201	only main proc of I/O cluster: initialise fs cache locally(?);
202	initialise devfs/sysfs roots and set devices.
203	----Barier ?------
204
205
206	4) task_bootstarp_finalize : should not be needed all should be done in task_init (TODO)
207	kdmsg_init : a set of lock (isr, exception, printk)_lock and printk_sync
208
209	5) DQDT init: arch_dqdt_init and dqdt_init (TODO)
210
211	6) Forget about bootstrap_replicate_task and cluster_init_table
212	a local cluster_init_cores is needed ?
213
214	register devices in fs ?
215
216	**** Phase 3: all procs (thread idle)
217
218	set thread_idle for each proc (parrallèle ?)
219
220	load thread_idle: free reserved page (the boot loader reserved pages...), create the event
221	thread event manager, kvfsd only in the cluster_IO.
222
223
224	*** Phase 4: kvfsd (only main cpu of cluster IO)
225
226	enable all irq.
227
228	initialise fs -> need __sys_blk
229	task load init require devfs file system
230	if failed set kminishell
231
232
233
234
235
236
237
238
239
240	Kernel Runing Modifications:
241
242	*** DQDT (Ressource handling): Must be set using physical addresses ?
243
244	*** FS support: Open, read, write ... all physical access or just when reaching the device ?
245
246	*** Memory support: when there is no multithreaded application (only kthread) the memory
247	management is all done localy. Cross cluster messaging for remote creation.
248
249	*** Task support: a replicated taskmanager, pids are now composed of XYN, X and Y are the
250	cluster offset, N is the offset in the current cluster. Remote creation can be done
251	only using message passing.
252
253	*** Device support:
254
255	*** Exception, Interruptions (not syscalls): all device interruptions are rooted and handled
256	locally, as configured at boot time.
257
258	Add cluster span to the arch config or in the cluster_entry. Yes it must be defined by the
259	boot code since this the code that decide the offset when using virtual memory.
260
261	*** User space support:
262	- Syscall: switch adressing mode in case we come or go from a syscall; save the coproc2 extend register
263	- copy from/to user must be modified
264
265
266	Developement process:
267	******** Phase one *****************
268	Consiste of boting the kernel with no user space support and with a dummy dqdt suport
269
270	*** Stage 0:
271	Rewrite the kernel in the view of passing to 40 bits:
272	Handle gloabal variables:
273	1) remove inused gloabl variables
274	2) simplifie the message passing API
275	3) do a simplified DQDT : replace dqdt files by a dummy : next_cluster++
276	4) task and thread creation only using message passing
277	5) do a true copy_from_usr and copy_to_usr !
278	6) delete the assumption that all address superior to KERNEL_OFFSET belongs
279	to the kernel(genarally used to verify if an address belongd to user space)
280	replace by checking that the address is between USR_OFFSET and _LIMIT
281	(see libk/elf.c) for an example.
282
283
284	*** Stage 1:
285	Develloping boot code: require setting the disk, develloping virtual memory for
286	the perspective of a dooble mode of booting, replicate the kernel and put it
287	at address zero.
288	Develloping kernel boot code (reaching kern_init and printing a message ?), setting boot_dmsg
289
290	*** Stage 2: (TODO: synchronise with stage 0)
291	Rewrite of most of the kernel:
292	******* GOALS:
293	1) delete (replace) reference to (unused) global variables:
294	- cluster_tbl : no such a thing as cluster init...
295	reference to cluster struct should be removed to keep only cids: this will affect allot of code!
296	No more a access to remote listener ptr (event.c), nor cpu struct, not cluster, nor ppm of remote cluster ...
297
298	- devfs_db replaced by the root dentry/inode of the devfs file system and since
299	there is only one kernel that use it => allocate it dynamically
300	- similar for sysfs_root_entry ?
301	- devfs_n_op/devfs_f_op : are const
302	- (dma,fb,icu,iopic,sda,timer,xicu,tty)_count are per cluster the
303	new naming scheme is device_cidn
304	- rt_timer: Not used => removet it
305	- (exception, isr, printk, boot)_lock: are all used to lock the printing
306	- kexcept_tty, klog_tty, tty_tbl, kisr_tty, kboot_tty ???
307
308	- soclib_(any device)_driver are RO: can we mark them as const ?
309	- boot_stage_done: not used
310
311	2) Event Manager to use physical adresses.
312	3) fs
313	4) all interruption (produced by device and WTI) are handled locally
314	5) all exception are handled locally (there is no remote page fault, since
315	there is no user space => no multithreaded user space application)
316	6) Complete the kernel boot phase.
317	7) Context switch: adding the saving of the coproc registers: DATA_extend register and Virtual
318	mode register
319	?cpu_gid2ptr: should be modified to fetch only local cluster, if a remote cluster is called, it should return an error
320
321	*** Stage N: user space support
322	$27 is now used by the kerntry code! user space should no longer used it
323	Switch to user space need to modify to set off the MMU and reset when going back...
324	set in init context the mmu register value (0xF)
325
326	*Structures modif (with study only struct fields)**
327
328	** NOTATIONS:
329	-PB: problem,
330	-HD: handled in the corresponding structure. This is for structure that are
331	directly embeded in the structure and wich contain a non local pointer
332	-NPB: no problem
333	-LPTR: local pointer
334	-RPTR: remote pointer
335	-RPTR-S: RPTR that is static => point to a structure that cannot be moved across clusters
336	-RPTR-D: RPTR that is dynmic=> point to a structure that can be moved across clusters
337	(which mean that the ptr need to be updated!)
338
339
340	** Structures that need to accessed remotly (and does they need to be dynamic ?)
341	struct task_s;
342	struct thread_s;
343
344
345	** Structures details
346
347	+ thread structure (thread.h)
348	struct thread_s
349	{
350	struct cpu_uzone_s uzone;//NOPB
351	spinlock_t lock;//PB:HD
352	thread_state_t state;//NOPB
353	...//NOPB
354	struct cpu_s lcpu;//NOPB: LPTR /! pointer to the local CPU description structure */
355	struct sched_s local_sched; //NOPB LPTR /! pointer to the local scheduler structure */
356
357	* struct task_s *task; //PB: RPTR
358
359	thread_attr_t type;//NOPB /! 3 types : usr (PTHREAD), kernel (KTHREAD) or idle (TH_IDLE) /
360	struct cpu_context_s pws;//NOPB /! processor work state (register saved zone) /
361
362	* struct list_entry list;//PB /! next/pred threads at the same state /
363	* struct list_entry rope;//OB /! next/pred threads in the __rope list of thread /
364	struct thread_info info;//HD /! (exit value, statistics, ...) /
365
366	uint_t signature;//NOPB
367	};
368	struct thread_info
369	{
370	...//NOPB
371	struct cpu_s *ocpu;//PB: RPTR-S
372	uint_t wakeup_date;//NOPB /! wakeup date in seconds /
373	void exit_value;//NOPB /! exit value returned by thread or joined one */
374	* struct thread_s join;//PB:RPTR-D /! points to waiting thread in join case */
375	* struct thread_s *waker;//RPTR-D
376	struct wait_queue_s wait_queue;//PB:HD
377	* struct wait_queue_s *queue;//PB
378	struct cpu_context_s pss;//NPB /! Processor Saved State /
379	pthread_attr_t attr;//HD
380	void *kstack_addr;//NPB:LPTR (always moved with the thread)
381	uint_t kstack_size;//OK
382	struct event_s *e_info;//never used!!!!!!!!!!!
383	struct page_s *page;//LTPR ?
384	};
385	typedef struct
386	{
387	...//NOPB
388	void *stack_addr;//user space ?
389	size_t stack_size;
390	void entry_func;//USP / mandatory */
391	void exit_func;//USP / mandatory */
392	void *arg1;//?
393	void *arg2;//?
394	void *sigreturn_func;//?
395	void *sigstack_addr;//USP todo
396	size_t sigstack_size;//?
397	struct sched_param sched_param;//PPB
398	...//NOPB /* mandatory */
399	} pthread_attr_t;
400
401	+ task structure
402	struct task_s
403	{
404	/* Various Locks */
405	mcs_lock_t block;//HD
406	spinlock_t lock;//HD
407	spinlock_t th_lock;//HD
408	struct rwlock_s cwd_lock;//HD
409	spinlock_t tm_lock;//local:HD
410
411	/* Memory Management */
412	struct vmm_s vmm;//HD
413
414	/* Placement Info */
415	struct cluster_s *cluster;//local
416	struct cpu_s *cpu;//local
417
418	/* File system *
419	//for cwd and root they should always point to the same
420	//file struct (not node) to avoid ensurring the coherence
421	//if one of them change
422	struct vfs_inode_s *vfs_root;//
423	struct vfs_inode_s *vfs_cwd;//
424	struct fd_info_s *fd_info;//embed it
425	struct vfs_file_s *bin;//Static: embed it
426
427	/* Task management */
428	pid_t pid;//NPB
429	uid_t uid;//NPB
430	gid_t gid;//NPB
431	uint_t state;//NPB
432	atomic_t childs_nr;//NPB
433	uint16_t childs_limit;//NPB
434
435	//make them of type faddr
436	struct task_s *parent;//PB
437	struct list_entry children;//PB
438	struct list_entry list;//PB
439
440	/* Threads */
441	uint_t threads_count;
442	uint_t threads_nr;
443	uint_t threads_limit;
444	uint_t next_order;
445	uint_t max_order;
446
447	//add a function to register after we creat it ?
448	BITMAP_DECLARE(bitmap, (CONFIG_PTHREAD_THREADS_MAX >> 3));
449	struct thread_s **th_tbl;//PB: TODO: local table of pointer to avoid the page below?
450	struct list_entry th_root;
451	struct page_s *th_tbl_pg;//NPB: local (make it local)
452
453	/* Signal management */
454	struct sig_mgr_s sig_mgr;
455
456	#if CONFIG_FORK_LOCAL_ALLOC
457	struct cluster_s *current_clstr;
458	#endif
459	};
460
461
462
463
464
465	File per file modif:
466	*** kern dir files:
467	*PKB - kern_init.c (see up)
468	- atomic.c: handle atomic_t and refcount_t objects (try to keep them local ? It will
469	be hard (fs, event handler?))
470	- barrier.c: used only by sys_barrier.c
471	- blkio.c: this layer allow an fs to send request to block device, all request pass
472	here.(manipulate a device struct. With this it's best to replicate the notion
473	of device ? NO). Use blkio async to send an IPI blokio request.
474	- cluster.c : some init functions, cid2ptr (to delete ?), manager, key_op (never used)
475	- cond_var.c: used only by user space (at least for now)
476	- cpu.c : the function cpu_gid2ptr access the cluster structure! see the structure for more
477	- do_exec : use copy_uspace!
478	- do_interupt : remove thread migration handling ?
479	- do_syscall : (N.B. the context save is for the fork syscall) remove thread migration ?
480	- event : modify to support the ...
481	- (keysdb/kfifo/mwmr/mcs_sync/radix/rwlock/semaphore/spinlock).c: their modification depend on the scope in which their are used
482	- kthread_create: local
483	- rr-sched: should be local
484	- scheduler: (thread_local_cpu) should be local
485	- task.c: should get simplified => removing replicate calls (they should may be used to see how the migration goes)
486	- thread_create.c: use of cpu_gid2ptr, should be local
487	- destroy\|dup\|idle.c: should be local
488	- thread migrate.c: need to be modified
489	SYSCALLS
490	Miscelanious:
491	- sys_alarm.c: manipulte local data (And thread migration is initiated by each thread indepndantly...)
492	*PKB - sys_barier.c: not local, to complexe for no reason except performance, we should pospone this functionnality?
493	I think we need such a struc for the boot. This mean that posponing is not possible, but we can simplify it
494	- sys_clock.c: local
495	- sys_cond_var.c:
496	- sys_dma_memcpy.c:
497	- sys_rwlock.c
498	- sys_sem.c
499	Task:
500	- sys_exec.c
501	- sys_fork.c
502	- sys_getpid.c
503	- sys_ps.c
504	- sys_signal.c
505	- sys_thread_create.c
506	- sys_thread_detach.c
507	- sys_thread_exit.c
508	- sys_thread_getattr.c
509	- sys_thread_join.c
510	- sys_thread_migrate.c
511	- sys_thread_sleep.c
512	- sys_thread_wakeup.c
513	- sys_thread_yield.c
514	Task Mem:
515	- sys_madvise.c
516	- sys_mcntl.c
517	- sys_mmap.c
518	- sys_sbrk.c
519	- sys_utls.c
520	FS:
521	- sys_chdir.c:
522	- sys_close.c:
523	- sys_closedir.c:
524	- sys_creat.c:
525	- sys_getcwd.c
526	- sys_lseek.c
527	- sys_mkdir.c
528	- sys_mkfifo.c
529	- sys_open.c
530	- sys_opendir.c
531	- sys_pipe.c
532	- sys_read.c
533	- sys_readdir.c
534	- sys_stat.c
535	- sys_unlink.c
536	- sys_write.c
537	mm/:
538	Task related:
539	vmm.c: munmap (region:detach, split, resize); mmap (shared, private \| file, anomymous);
540	skbrk(resize heap region TODO: use the resize func), madvise_migrate, vmm_auto_migrate,
541	(modify page table attribute and set migrate flag), madvise_will_need;
542	vm_region.c:...
543	Other:
544	ppm.c: handle page descriptors. We find some use of cluster ptr that should be removed ?
545	page.c: to make local
546
547
548
549	Question:
550	1) How is the time handled in almos. Does we use a unique timer to keep the time seamleese across node ?
551	2) Copy to/from userspace
552	3) Cross cluster lists: thread lists
553	4) Only the kentry is mapped when entring the kernel from user space... This is going to be a complexe point.
554	Three new things need to be done:
555	1- if from user space (or simply if we were in the vitual mode was actif), switch to physical mode of local cluster
556	2- if from kernel save DATA_EXT register. Also, if we do cpy_uspace by switchin to the virtulle space, we need to
557	do what we do in one.
558
559
560	Remark:
561	cross cluster access also keep in mind that other access are possible, like page based access (temporarely map a page).
562
563
564
565
566	Task and thread subsystems:
567	Per cluster vmm or per thread ?
568
569	Per cluster: 1) interest: less physical space cosummed and less coherence to handle
570	2) Problems: "resharing" of the stack zone could complex (make it impossible to migrate the stack of thread and thus the thread) ?
571
572	Per thread: 1) interest: virtualise the virtual mapping between thread become easy (for stack and TLS)
573	and temporary mappping are allot easier...
574	(and we could free the $27 register!)
575	2) Problems: to much use of physical space (cache) + require more coherence message
576
577
578
579	Files need to accessible across cluster, since they could be share by tasks!
580
581
582
583	TODO:
584	Check that we have mapped enough of the kernel, for the kentry ?
585
586
587
588	VMM replicated region (mapper) require that the insertion to the page cache be atomic!
589
590
591	DEveloppement results:
592	event_fifo.c
593	remote operation.c
594	arch_init.c:ongoing for devices
595	cluster_init.c
596	kentry.c:k1 is now used by the kernel! No, it's that we k1 only when we were in
597	kernel mode... Otherwise when coming in usert mode we don't need it and save it?

Note: See TracBrowser for help on using the repository browser.

Download in other formats:

Original Format