zephyr

mirror of https://github.com/zephyrproject-rtos/zephyr synced 2025-08-12 01:25:36 +00:00

Author	SHA1	Message	Date
Kumar Gala	7d35a8c93d	kernel: remove arch_mem_domain_destroy The only user of arch_mem_domain_destroy was the deprecated k_mem_domain_destroy function which has now been removed. So remove arch_mem_domain_destroy as well. Signed-off-by: Kumar Gala <kumar.gala@linaro.org>	2021-03-18 16:30:47 +01:00
Kumar Gala	3a6598054a	kernel: remove deprecated mem domain APIs Remove k_mem_domain_destroy and k_mem_domain_remove_thread as they've been deprecated for at least 2 releases now. Signed-off-by: Kumar Gala <kumar.gala@linaro.org>	2021-03-17 13:49:36 -05:00
Andrzej Głąbek	6de16d0013	kernel: Add missing verification for device_usable_check() system call so that this function and also device_is_ready() can be called from user mode. Signed-off-by: Andrzej Głąbek <andrzej.glabek@nordicsemi.no>	2021-03-15 10:45:20 -05:00
Enjia Mai	4aed856d7f	kernel: smp: Remove unused internal API z_smp_reacquire_global_lock() The internal function z_smp_reacquire_global_lock() has not used by anywhere inside zephyr code, so remove it. Fixes #33273. Signed-off-by: Enjia Mai <enjiax.mai@intel.com>	2021-03-14 18:32:26 -04:00
Peter Bigot	b29abe3710	device: add API to visit required devices The static device dependencies from devicetree are not the only ones that might be present at runtime. Add API that allows visiting required devices without assuming that handles for or pointers to them can be accessed as a static contiguous sequence. Signed-off-by: Peter Bigot <peter.bigot@nordicsemi.no>	2021-03-11 08:53:18 -05:00
Lauren Murphy	d88ce65463	kernel/sched: only send IPI to abort thread if hardware supports it Wrap arch_sched_ipi() call in z_thread_abort() with ifdef checking for hardware support of IPI. Fixes #32723 Signed-off-by: Lauren Murphy <lauren.murphy@intel.com>	2021-03-10 14:27:33 -05:00
James Harris	33c9be90cc	kernel: fix TOCTTOU issue in k_thread_name_set Previously, a racing write to the provided string could result in up to CONFIG_THREAD_MAX_NAME_LEN-2 bytes after the end of user-accessible memory being leaked into the thread name. For now, make a temporary copy. In an ideal world this could copy directly from userspace into the thread name, but that violates the current vrfy / impl split. Signed-off-by: James Harris <james.harris@intel.com>	2021-03-08 19:27:23 -05:00
Andy Ross	820c94e5dd	arch/xtensa: Inline atomics The xtensa atomics layer was written with hand-coded assembly that had to be called as functions. That's needlessly slow, given that the low level primitives are a two-instruction sequence. Ideally the compiler should see this as an inline to permit it to better optimize around the needed barriers. There was also a bug with the atomic_cas function, which had a loop internally instead of returning the old value synchronously on a failed swap. That's benign right now because our existing spin lock does nothing but retry it in a tight loop anyway, but it's incorrect per spec and would have caused a contention hang with more elaborate algorithms (for example a spinlock with backoff semantics). Remove the old implementation and replace with a much smaller inline C one based on just two assembly primitives. This patch also contains a little bit of refactoring to address the scheme has been split out into a separate header for each, and the ATOMIC_OPERATIONS_CUSTOM kconfig has been renamed to ATOMIC_OPERATIONS_ARCH to better capture what it means. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-03-08 11:14:27 -05:00
Andy Ross	deca2301f6	kernel/swap: Move arch_cohere_stacks() back under the lock Commit `6b84ab3830` ("kernel/sched: Adjust locking in z_swap()") moved the call to arch_cohere_stacks() out of the scheduler lock while doing some reorgnizing. On further reflection, this is incorrect. When done outside the lock, the two arch_cohere_stacks() calls will race against each other. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-03-08 11:14:27 -05:00
Eric Johnson	b4aeef4d5b	kernel: timer: Fix incorrect behavior for timers with K_FOREVER period Zephyr docs state that timers will act as one-shot timers when started with a period of K_NO_WAIT or K_FOREVER. However the code adjusting period was setting K_FOREVER timeout ticks to 1 which caused the timer to expire every tick. This adds a check to not adjust K_FOREVER periods Signed-off-by: Eric Johnson <eric@liveathos.com>	2021-03-07 08:00:08 -05:00
Flavio Ceolin	9b246aba78	power: Make pm_system_resume private This API is not intended to be public and it is called only from the idle thread. Signed-off-by: Flavio Ceolin <flavio.ceolin@intel.com>	2021-03-07 07:59:53 -05:00
Flavio Ceolin	2e9b583da9	idle: Remove weak function pm_system_resume is always implemented when PM is enabled. There is no need to have this weak function under an ifdef PM. Signed-off-by: Flavio Ceolin <flavio.ceolin@intel.com>	2021-03-07 07:59:53 -05:00
Flavio Ceolin	6307d19967	power: Remove unused / unimplemented code pm_system_resume_from_deep_sleep is not implemented or used anywhere. Just remove it and keep the code base cleaner. Signed-off-by: Flavio Ceolin <flavio.ceolin@intel.com>	2021-03-07 07:59:53 -05:00
Flavio Ceolin	e2771340af	power: Remove unnecessary pm_idle_exit_notification_disable api This function is useless and the state variable that it was controlling is also not necessary because the same logic is being handled by the variable post_ops_done.\ This reasonably simplifies idle thread logic. Signed-off-by: Flavio Ceolin <flavio.ceolin@intel.com>	2021-03-07 07:59:53 -05:00
Flavio Ceolin	b5e1336e83	power: s/POWER_STATE_ACTIVE/PM_STATE_ACTIVE Fix some references to old power state names. Signed-off-by: Flavio Ceolin <flavio.ceolin@intel.com>	2021-03-07 07:59:53 -05:00
Flavio Ceolin	10f29359d7	power: Make pm_system_suspend private pm_system_suspend is called only from the idle thread and should not be exported as a public API. Signed-off-by: Flavio Ceolin <flavio.ceolin@intel.com>	2021-03-07 07:59:53 -05:00
James Harris	53b8179371	kernel: sem: handle resets with outstanding waiting threads Previously, a k_sem_reset with any outstanding waiting threads would result in the semaphore in an inconsistent state, with more threads waiting in the wait_q than the count would indicate. Explicitly -EAGAIN any waiting threads upon k_sem_reset, to ensure safety here. Signed-off-by: James Harris <james.harris@intel.com>	2021-03-06 07:39:43 -05:00
James Harris	b10428163a	kernel: sem: add K_SEM_MAX_LIMIT Currently there is no way to distinguish between a caller explicitly asking for a semaphore with a limit that happens to be `UINT_MAX` and a semaphore that just has a limit "as large as possible". Add `K_SEM_MAX_LIMIT`, currently defined to `UINT_MAX`, and akin to `K_FOREVER` versus just passing some very large wait time. In addition, the `k_sem_*` APIs were type-confused, where the internal data structure was `uint32_t`, but the APIs took and returned `unsigned int`. This changes the underlying data structure to also use `unsigned int`, as changing the APIs would be a (potentially) breaking change. These changes are backwards-compatible, but it is strongly suggested to take a quick scan for `k_sem_init` and `K_SEM_DEFINE` calls with `UINT_MAX` (or `UINT32_MAX`) and replace them with `K_SEM_MAX_LIMIT` where appropriate. Signed-off-by: James Harris <james.harris@intel.com>	2021-03-05 08:13:53 -06:00
Spoorthy Priya Yerabolu	4118ed1d4d	kernel: sched: removing dead code Due to the recent changes to scheduler z_find_first_thread_to_unpend & z_remove_thread_from_ready_q are not used anymore. So removing the dead code. fixes: #32691 Signed-off-by: Spoorthy Priya Yerabolu <spoorthy.priya.yerabolu@intel.com>	2021-03-05 11:05:25 +03:00
Andy Ross	6400bb54d6	kernel/idle: Clean up and refactoring / remove TICKLESS_IDLE_THRESH While I'm in the idle code, let's clean this loop up. It was a really bad #ifdef hell: * Remove the CONFIG_TICKLESS_IDLE_THRESH logic (and the kconfig), which never did anything but needlessly increase latency. * Move the needed timeout logic from the main loop into pm_save_idle(), which eliminates the special case for !SYS_CLOCK_EXISTS. Behavior (modulo that one kconfig) should be completely unchanged, and now the inner part of the idle loop looks like: while (true) { (void) arch_irq_lock(); if (IS_ENABLED(CONFIG_PM)) { pm_save_idle(); } else { k_cpu_idle(); } } Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-03-04 14:31:12 -05:00
Andy Ross	39a8f3b4f9	kernel/idle: Replace stolen IRQ lock The removal of the abort handling also absconded with an IRQ lock that is required for reliable operation in the idle loop. Put it back. Once the idle loop has made a decision to enter idle, any interrupt that arrives needs to be masked and delivered AFTER the system enters idle. Otherwise we run the risk of races where the system accepts and processes an interrupt that should have prevented idle, but then goes to sleep anyway having already made the decision. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-03-04 14:31:12 -05:00
Peter Bigot	b706a5e999	kernel: remove old work queue implementation Now that the old API has been reimplemented with the new API remove the old implementation and its tests. Signed-off-by: Peter Bigot <peter.bigot@nordicsemi.no>	2021-03-03 20:06:00 -05:00
Peter Bigot	d1affd9118	kernel: default to new work API implementation Switch the default and clean up some test workarounds. This will enable final conversions necessary to transition to the new API. Signed-off-by: Peter Bigot <peter.bigot@nordicsemi.no>	2021-03-03 20:06:00 -05:00
Peter Bigot	dc34e7c6f6	kernel: add new work queue implementation This commit provides a complete reimplementation of the work queue infrastructure intended to eliminate the race conditions and feature gaps in the existing implementation. Both bare and delayable work structures are supported. Items can be submitted; delayable items can be scheduled for submission at a future time. Items can be delayed, queued, and running all at the same time. A running item can also be canceling. The new implementation: * replaces "pending" with "busy" which identifies the active states; * supports canceling delayed and submitted items; * prevents resubmission of a item being canceled until cancellation completes; * supports waiting for cancellation to complete; * supports flushing a work item (waiting for the last submission to complete without preventing resubmission); * supports waiting for a queue to drain (only allows resubmission from the work thread); * supports stopping a work queue in conjunction with draining it; * prevents handler-reentrancy during resubmission. Signed-off-by: Peter Bigot <peter.bigot@nordicsemi.no>	2021-03-03 20:06:00 -05:00
Peter Bigot	44539ed645	kernel: select work queue implementation Attempts to reimplement the existing work API using a new work implementation failed, primarily due to heavy use of whitebox testing in validating the original API. Add a temporary Kconfig that will select between the two implementations so we can use the same identifiers but select which implementation they reference. This commit just adds the selection infrastructure and uses it to conditionalize the existing implementation in anticipation of the new one in the next commit. Signed-off-by: Peter Bigot <peter.bigot@nordicsemi.no>	2021-03-03 20:06:00 -05:00
Peter Bigot	0259c864df	kernel: add private scheduler APIs These functions are a subset of proposed public APIs to clean up several issues related to safely handling waking of threads. They have been made private as they interface may change, but their use will simplify the reimplementation of the k_work functionality. See: https://github.com/zephyrproject-rtos/zephyr/pull/29668 Signed-off-by: Andrew Boie <andrew.p.boie@intel.com> Signed-off-by: Peter Bigot <peter.bigot@nordicsemi.no>	2021-03-03 20:06:00 -05:00
James Harris	c7bb423f3e	kernel: fix race conditions with z_ready_thread Several internal APIs wrote thread attributes (return value, mainly) _after_ calling `z_ready_thread`. This is unsafe, at least in SMP, because another core could have already picked up and run the thread. Fixes #32800. Signed-off-by: James Harris <james.harris@intel.com>	2021-03-03 13:54:47 -05:00
James Harris	6543e06914	kernel: sched: avoid unnecessary lock in z_impl_k_yield `z_impl_k_yield` unlocked sched_spinlock, only to lock it again immediately, do a little bit more work, then unlock it again. This causes performance issues on SMP, where `sched_spinlock` is often fairly highly contended and cores often end up spinning for quite a while waiting to retake the lock in `z_swap_unlocked`. Instead directly pass the spinlock key to `z_swap` and avoid the extra lock+unlock. Signed-off-by: James Harris <james.harris@intel.com>	2021-03-02 14:35:21 -05:00
James Harris	2cd0f66515	kernel: sched: change to 3-way thread priority comparison `z_is_t1_higher_prio_than_t2` was being called twice in both the context-switch fastpath and in `z_priq_rb_lessthan`, just to dealing with priority ties. In addition, the API was error-prone (and too much in the fastpath to be able to assert its invarients) - see also #32710 for a previous example of this API breaking and returning a>b but also b>a. Replacing this with a direct 3-way comparison `z_cmp_t1_prio_with_t2` sidesteps most of these issues. There is still a concern that `sgn(z_cmp_t1_prio_with_t2(a,b)) != -sgn(z_cmp_t1_prio_with_t2(b,a))` but I don't see any way to alleviate this aside from adding an assert to the fastpath. Signed-off-by: James Harris <james.harris@intel.com>	2021-03-02 14:27:14 -05:00
James Harris	3330ab12d8	kernel: fix yielding between tasks with same deadline Previously two tasks with the same deadline and priority would always have `z_is_t1_higher_prio_than_t2` `true` in both directions. This is logically inconsistent, and results in `k_yield` not actually yielding between identical threads. Signed-off-by: James Harris <james.harris@intel.com>	2021-02-27 10:25:47 +01:00
Andy Ross	6fb6d3cfbe	kernel: Add new k_thread_abort()/k_thread_join() Add a newer, much smaller and simpler implementation of abort and join. No need to involve the idle thread. No need for a special code path for self-abort. Joining a thread and waiting for an aborting one to terminate elsewhere share an implementation. All work in both calls happens under a single locked path with no unexpected synchronization points. This fixes a bug with the current implementation where the action of z_sched_single_abort() was nonatomic, releasing the lock internally at a point where the thread to be aborted could self-abort and confuse the state such that it failed to abort at all. Note that the arm32 and native_posix architectures, which have their own thread abort implementations, now see a much simplified "z_thread_abort()" internal API. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-24 16:39:15 -05:00
Andy Ross	c0c8cb0e97	kernel: Remove abort and join implementation (UNBISECTABLE) THIS COMMIT DELIBERATELY BREAKS BISECTABILITY FOR EASE OF REVIEW. SKIP IF YOU LAND HERE. Remove the existing implementatoin of k_thread_abort(), k_thread_join(), and the attendant facilities in the thread subsystem and idle thread that support them. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-24 16:39:15 -05:00
Andy Ross	bf99f3105f	kernel/timeout: Correctly clamp z_clock_set_timeout() argument This function would correctly suppress attempts to set timeouts that were too soon for the driver or farther out than what was already set, but when it actually set the timeout it would use the requested value and not clamp it to the minimum of it and the current timeout expiration, leading to "too-long" timeouts being set at the driver. In uniprocessor configurations, that turns out to have been benign because something else would always come back along when timeout state changed and fix the broken value before the expiration. But in SMP, this opens up races. For example, the idle thread on one CPU can see that there are no active threads and schedule a maximum value timeout at the same time as the other thread adds a new timeout that expects a near-term expiration. The broken code here would see that the new timeout exists, decide that yes it needs to override, but then set the K_TICKS_FOREVER value it got from the idle thread! Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-24 16:39:15 -05:00
Andy Ross	419f37043b	kernel/sched: Clamp minimum timeslice when TICKLESS When the kernel is TICKLESS, timeouts are set as needed, and drivers all have some minimum amount of time before which they can reliably schedule an interrupt. When this happens, drivers will kick the requested interrupt out by one tick. This means that it's not reliably possible to get a timeout set for "one tick in the future"[1]. And attempting to do that is dangerous anyway. If the driver will delay a one-tick interrupt, then code that repeatedly tries to schedule an imminent interrupt may end up in a state where it is constantly pushing the interrupt out into the future, and timer interrupts stop arriving! The timeout layer actually has protection against this case. Finally getting to the point: in recent changes, the timeslice layer lost its integration with the "imminent" test in the timeout code, so it's now able to run into this situation: very rapidly context switching code (or rapidly arriving interrupts) will have the effect of infinitely[2] delaying timeouts and stalling the whole timeout subsystem. Don't try to be fancy. Just clamp timeslice duration such that a slice is 2 ticks at minimum and we'll never hit the problem. Adjust the two tests that were explicitly requesting very short slice rates. [1] Of course, the tradeoff is that the tick rate can be 100x higher or more, so on balance tickless is a huge win. [2] Actually it only lasts until a 31 bit signed rollover in the HPET cycle count in practice. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-24 16:39:15 -05:00
Andy Ross	a202670c18	kernel/sched: Remove now-spurious SWAP_NONATOMIC workaround Recent work to normalize use of the thread QUEUED state bit means that we never attempt to remove unqueued threads from the low-level run queue. So the old workaround for SWAP_NONATOMIC that was trying to detect this condition isn't necessary anymore. Which is serendipitous, because it was written to encode some very specific logic about the circumstances where _current could be dequeued that I'd like to be able to break. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-24 16:39:15 -05:00
Andy Ross	05c468f594	kernel/sched: Make z_ready_thread() safe vs. already-running threads This is part of the scheduler API, and was always just a synchronized wrapper around the internal ready_thread() function. But where the internal users seem to be careful not to call it on threads that are not known to be already queued or running, the general users in the IPC code seem to be less strict. Add a simple test to detect the case where a thread is already running. Right now this just loops over the array of CPUs, so is O(N) in the CPU count even though N is never more than four for us currently. But this is possible without modifying data structures. A more scalable way to do this if we ever need to run on very parallel systems would be to use another state bit for RUNNING, or to keep a backpointer in the thread struct to the CPU it's running on, etc... Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-24 16:39:15 -05:00
Andy Ross	6b84ab3830	kernel/sched: Adjust locking in z_swap() Swap was originally written to use the scheduler lock just to select a new thread, but it would be nice to be able to rely on scheduler atomicity later in the process (in particular it would be nice if the assignment to cpu.current could be seen atomically). Rework the code a bit so that swap takes the lock itself and holds it until just before the call to arch_switch(). Note that the local interrupt mask has always been required to be held across the swap, so extending the lock here has no effect on latency at all on uniprocessor setups, and even on SMP only affects average latency and not worst case. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-24 16:39:15 -05:00
Andy Ross	37866336f9	kernel/sched: Fix race between thread wakeup timeout and abort Aborted threads will cancel their timeouts, but the timeout subsystem isn't protected under the same lock so it's possible for a timeout to fire just as a thread is being aborted and wake it up unexpectedly. Check the state before blowing anything up. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-24 16:39:15 -05:00
Andy Ross	8e16012ab7	kernel/thread: Initialize pended_on field of struct thread_base This got missed, leaving garbage there for restarted threads to trip on. Actually I see multiple uninitialized fields, which seems odd. This code deserves some rework, thread initialization isn't a performance path and we should probably be zeroing the struct out. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-24 16:39:15 -05:00
Andrei Emeltchenko	377456c5af	kernel: Move LOCKED() macro to kernel_internal.h Remove duplication in the code by moving macro LOCKED() to the correct kernel_internal.h header. Signed-off-by: Andrei Emeltchenko <andrei.emeltchenko@intel.com>	2021-02-22 14:56:37 -05:00
Daniel Leung	ece9cad858	kernel: add CONFIG_SRAM_OFFSET This adds a new kconfig CONFIG_SRAM_OFFSET to specify the offset from beginning of SRAM where the kernel begins. On x86 and PC compatible platforms, the first 1MB of RAM is reserved and Zephyr should not link anything there. However, this 1MB still needs to be mapped by the MMU to access various platform related information. CONFIG_SRAM_OFFSET serves similar function as CONFIG_KERNEL_VM_OFFSET and is needed for proper phys/virt address translations. Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2021-02-22 14:55:28 -05:00
Daniel Leung	ec21c0b92f	kernel: mmu: fix boot address translation macros The Z_BOOT_VIRT_TO_PHYS() and Z_BOOT_PHYS_TO_VIRT() address translation macros are flipped in their calculations. The calculation is supposed to be: virt = phys + ((KERNEL_VM_BASE + KERNEL_VM_OFFSET) - SRAM_BASE_ADDRESS) So fix the them. Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2021-02-22 14:55:28 -05:00
Anas Nashif	ecdc770c9b	kernel: thread: do not assert on tests remove assert that prevented us from testing non-userspace code on platforms that can do userspace. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-02-22 14:36:06 -05:00
Andy Ross	252764a4ba	kernel/timeout: Fix off-by-one in absolute timeouts The computation was using the already-adjusted input value that assumed relative timeouts and not the actual argument the user passed. Absolute timeouts were consistently waking up one tick early. Fixes #32499 Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-22 08:45:24 -05:00
Peter Bigot	d554d34137	device: add post-process of elf file to manage device handles Following the idiom used for system calls, add script support to read the initial application binary to identify which devices are defined, and to use their offset in the device array as their unique handle rather than the externally-defined ordinal from devicetree. The device dependency arrays are updated to use these handles. Signed-off-by: Peter Bigot <peter.bigot@nordicsemi.no>	2021-02-19 15:46:16 -05:00
Peter Bigot	d1a0568e11	device: store device pm busy status in the state structure Move the busy status from a global atomic bit sequence to atomic flags in the device PM state. While this temporarily adds 4 bytes to each PM structure the whole device PM infrastructure will be refactored and it's likely the extra memory can be recovered. Signed-off-by: Peter Bigot <peter.bigot@nordicsemi.no>	2021-02-19 10:11:20 -05:00
Peter Bigot	65eee5cb47	device: store initialization status in the state structure Separate the state indicator of whether the initialization function has been invoked from the success or failure of the initialization. This allows precise confirmation that the device is ready (i.e. it has been initialized, and that initialization succeeded). Signed-off-by: Peter Bigot <peter.bigot@nordicsemi.no>	2021-02-19 10:11:20 -05:00
Peter Bigot	8d771f1d8e	device: move device power management state into common dynamic state This avoids the need for distinct object that uses flash to store its initializer. Instead the state is initialized when the kernel is starting up, before anything can reference it. In future refactoring the PM state could be accessed directly without storing an extra pointer in the static device state. Signed-off-by: Peter Bigot <peter.bigot@nordicsemi.no>	2021-02-19 10:11:20 -05:00
Peter Bigot	1cadd8b305	device: perform dynamic device initialization during system startup Initialize all device objects in a batch before invoking any code that might try to reference data in them. This eliminates a race condition enabled by the ability to resolve a device structure at build time, and reference it from one device's initialization routine before the device itself has been initialized. While the device is pulled from the sys_init records rather than static devices, all in-tree init_entry records that are associated with devices are produced via Z_DEVICE_DEFINE(), so there should be no static devices that would be missed by instead iterating over the device records. Signed-off-by: Peter Bigot <peter.bigot@nordicsemi.no>	2021-02-19 10:11:20 -05:00
Peter Bigot	5b36a01a67	device: binding lookup should return null for unsupported names A null device name should map to a null device. So should a name that is empty. Signed-off-by: Peter Bigot <peter.bigot@nordicsemi.no>	2021-02-16 14:39:53 -06:00
Andy Ross	c7d0cb6641	include/kernel_arch_interface.h: Redocument arch_switch() Some recent changes exposed some common "arch_switch() anti-patterns" in various architectures. The documentation technically described this all correctly, but probably wasn't as clear as it should have been. Rewrite, making clear exactly what needs to happen and how the fields should be interpreted. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-14 16:22:45 -05:00
Andy Ross	4ff457113e	kernel/sched: Fix rare SMP deadlock It was possible with pathological timing (see below) for the scheduler to pick a cycle of threads on each CPU and enter the context switch path on all of them simultaneously. Example: * CPU0 is idle, CPU1 is running thread A * CPU1 makes high priority thread B runnable * CPU1 reaches a schedule point (or returns from an interrupt) and decides to run thread B instead * CPU0 simultaneously takes its IPI and returns, selecting thread A Now both CPUs enter wait_for_switch() to spin, waiting for the context switch code on the other thread to finish and mark the thread runnable. So we have a deadlock, each CPU is spinning waiting for the other! Actually, in practice this seems not to happen on existing hardware platforms, it's only exercisable in emulation. The reason is that the hardware IPI time is much faster than the software paths required to reach a schedule point or interrupt exit, so CPU1 always selects the newly scheduled thread and no deadlock appears. I tried for a bit to make this happen with a cycle of three threads, but it's complicated to get right and I still couldn't get the timing to hit correctly. In qemu, though, the IPI is implemented as a Unix signal sent to the thread running the other CPU, which is far slower and opens the window to see this happen. The solution is simple enough: don't store the _current thread in the run queue until we are on the tail end of the context switch path, after wait_for_switch() and going to reach the end in guaranteed time. Note that this requires changing a little logic to handle the yield case: because we can no longer rely on _current's position in the run queue to suppress it, we need to do the priority comparison directly based on the existing "swap_ok" flag (which has always meant "yielded", and maybe should be renamed). Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-14 16:22:45 -05:00
Andy Ross	91946ef21c	kernel/sched: Refactor, unify management of QUEUED state The QUEUED state flag was managed separately from the run queue insertion/deletion, and the logic (while AFAICT perfectly correct) was tangled in a few places trying to keep them in sync. Put the management of both behind a queue_thread()/dequeue_thread() API for clarity. The ALWAYS_INLINE usage seems to be working to get the compiler to condense the resulting multiple assignments. No behavior change. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-14 16:22:45 -05:00
Andy Ross	dd43221540	kernel/sched: Fix race with switch handle The "null out the switch handle and put it back" code in the swap implementation is a holdover from some defensive coding (not wanting to break the case where we picked our current thread), but it hides a subtle SMP race: when that field goes NULL, another CPU that may have selected that thread (which is to say, our current thread) as its next to run will be spinning on that to detect when the field goes non-NULL. So it will get the signal to move on when we revert the value, when clearly we are still running on the stack! In practice this was found on x86 which poisons the switch context such that it crashes instantly. Instead, be firm about state and always set the switch handle of a currently running thread to NULL immediately before it starts running: right before entering arch_switch() and symmetrically on the interrupt exit path. Fixes #28105 Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-14 16:22:45 -05:00
Andy Ross	1ba7414029	kernel/sched: Correct coherence assert Some legacy spots in our IPC layer (legally) pass a NULL wait queue to pend(). Allow this in the coherence assertion. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-11 14:47:40 -05:00
Andy Ross	4dc6a0b89b	kernel/poll: Remove dummy waitq from stack The poll code uses a dummy wait queue so the threads have something to block on, but the previous coherence pass (which rearranged things to put the _poller data elsewhere) missed that this was on the stack, which is not allowed. It actually has no use except as a list, so make it a global static instead. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-11 14:47:40 -05:00
Andy Ross	1d51e888d8	kernel/z_swap: Remove on-stack dummy spinlock The z_swap_unlocked() function used a dummy spinlock for simplicity. But this runs afouls of checking for stack-resident spinlocks (forbidden when KERNEL_COHERENCE is set). And it's executing needless code to release the lock anyway. Replace with a compile time NULL, which will improve performance, correctness and code size. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-11 14:47:40 -05:00
Andy Ross	604f0f44b6	kernel/sched: Add missing lock around waitq unpend calls The two calls to unpend a thread from a wait queue were inexplicably* unsynchronized, as James Harris discovered. Rework them to call the lowest level primities so we can wrap the process inside the scheduler lock. Fixes #32136 * I took a brief look. What seems to have happened here is that these were originally synchronized via an implicit from an outer caller (remember the original Uniprocessor irq_lock() API is a recursive lock), and they were mostly implemented in terms of middle-level calls that were themselves locked. So those got ported over to the newer spinlock but the outer wrapper layer got forgotten. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-10 07:43:18 -05:00
Daniel Leung	371752bce3	kernel: tls: align tdata/tbss sections in stack This lets the linker tell us what kind of alignment is required for both tdata and tbss data when copying them into stack. If they are not aligned as expected by the toolchain, generated code would be accessing incorrect location for thread variables. Fixes #32015 Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2021-02-07 23:28:43 -05:00
Nicolas Pitre	f9461d1ac4	mmu: fix ARM64 compilation by removing z_mapped_size usage The linker script defines `z_mapped_size` as follows: ``` z_mapped_size = z_mapped_end - z_mapped_start; ``` This is done with the belief that precomputed values at link time will make the code smaller and faster. On Aarch64, symbol values are relocated and loaded relative to the PC as those are normally meant to be memory addresses. Now if you have e.g. `CONFIG_SRAM_BASE_ADDRESS=0x2000000000` then `z_mapped_size` might still have a reasonable value, say 0x59334. But, when interpreted as an address, that's very very far from the PC whose value is in the neighborhood of 0x2000000000. That overflows the 4GB relocation range: ``` kernel/libkernel.a(mmu.c.obj): in function `z_mem_manage_init': kernel/mmu.c:527:(.text.z_mem_manage_init+0x1c): relocation truncated to fit: R_AARCH64_ADR_PREL_PG_HI21 ``` The solution is to define `Z_KERNEL_VIRT_SIZE` in terms of `z_mapped_end - z_mapped_start` at the source code level. Given this is used within loops that already start with `z_mapped_start` anyway, the compiler is smart enough to combine the two occurrences and dispense with a size counter, making the code effectively slightly better for all while avoiding the Aarch64 relocation overflow: ``` text data bss dec hex filename 1216 8 294936 296160 484e0 mmu.c.obj.arm64.before 1212 8 294936 296156 484dc mmu.c.obj.arm64.after 1110 8 9244 10362 287a mmu.c.obj.x86-64.before 1106 8 9244 10358 2876 mmu.c.obj.x86-64.after ``` Signed-off-by: Nicolas Pitre <npitre@baylibre.com>	2021-02-05 17:19:56 -05:00
Carlo Caione	302a36a115	kernel: mmu: Fix trivial typos Otherwise the memory scheme is confusing to read. Signed-off-by: Carlo Caione <ccaione@baylibre.com>	2021-02-04 14:00:36 -05:00
Martin Åberg	612dad264c	kernel: Decouple TICKS_PER_SEC from TICKLESS_CAPABLE The SYS_CLOCK_TICKS_PER_SEC default may depend on the kernel config for tickless, rather than the capability. Signed-off-by: Martin Åberg <martin.aberg@gaisler.com>	2021-02-04 12:34:23 -05:00
Ioannis Glaropoulos	40aab3276c	Revert "kernel: init: activate FPU for main thread" Activating K_FP_REGS flags introduces stack memory overhead for the main thread in Cortex-M architecture. Several ARM platforms experience main thread stack overflows when building with FPU_SHARING=y. Enabling FPU sharing in main thread should not be the default configuration. Users are welcome to enable FP sharing on the main thread in the application code, in main(). This reverts commit `8453a73ede`. Signed-off-by: Ioannis Glaropoulos <Ioannis.Glaropoulos@nordicsemi.no>	2021-02-03 17:22:50 -05:00
Anas Nashif	39f632e7f0	kernel: fix usage of KERNEL_COHERENCE macro Add missing CONFIG_ to KERNEL_COHERENCE usage in code. Fixes #30380 Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-02-03 10:42:04 -05:00
Daniel Leung	d1495e98e2	kernel: fix arch_mem_coherent() call in spinlock The call to arch_mem_coherent() inside spinlock.h when spinlock validation and memory coherence enabled is causing build error as spinlock.h does not include kernel_arch_func.h directly. However, simply including that file does not work either as this creates the chicken-or-egg in the chain of include files. In order to make spin validation work with kernel coherence enabled, a separate function is created to break the circular dependencies of include files. Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2021-02-03 10:42:04 -05:00
Daniel Leung	079bc64c16	kernel: fix _kernel argument to arch_mem_coherent Argument to arch_mem_coherent() is a pointer so pass a pointer to _kernel. Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2021-02-03 10:42:04 -05:00
Andy Ross	887e1abace	kernel/timeout: Fix timeout "sooner" computation There was an edge case in the timeout handling (exposed by, but not strictly related to, the recent timeslice fix): the next_timeout() computation would include time slice expiration as a clamp on the result, but this would be invoked also on the z_set_timeout_expiry() path which gets hooked on entry to a new thread which is needed to set the timeout in the first place. So if no other timer interrupt was scheduled, it was possible to miss the first timeslice interrupt after thread scheduling. The explanation is much longer than the fix (use <= as the comparator instead of <). In practice this was only being hit in the existing test suite on riscv miv running under renode using non-default clock rates. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-02 17:58:40 -05:00
Andy Ross	544475d8a7	kernel/timeout: Schedule zero-time timeouts Fix an edge case that snuck in with the recent fix: if timeslicing is enabled, the CPU's slice_ticks will be zero, and thus match a timeout object's dticks value of zero, and thus get suppressed (because "we already have a timeout scheduled for that") incorrectly. Fixes #31789 Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-02 17:58:40 -05:00
Alexandre Bourdiol	8925af94f2	kernel: Kconfig: increase test default MAIN_STACK_SIZE for ARM Cortex M There are more and more tests that fail due to stackoverflow. Increasing MAIN_STACK_SIZE to fix those issues. Signed-off-by: Alexandre Bourdiol <alexandre.bourdiol@st.com>	2021-02-02 10:05:46 -05:00
Flavio Ceolin	148769c715	sched: timeout: Do not miss slice timeouts Time slices don't have a timeout struct associated and stored in timeout_list. Time slice timeout is direct programmed in the system clock and tracked in _current_cpu->slice_ticks. There is one issue where the time slice timeout can be missed because the system clock is re-programmed to a longer timeout. To this happens, it is only necessary that the timeout_list is empty (any timeout set) and a new timeout longer than remaining time slice is set. This is cause because z_add_timeout does not check for the slice ticks. The following example spots the issue: K_THREAD_STACK_DEFINE(tstack, STACK_SIZE); K_THREAD_STACK_ARRAY_DEFINE(tstacks, NUM_THREAD, STACK_SIZE); K_SEM_DEFINE(sema, 0, NUM_THREAD); static inline void spin_for_ms(int ms) { uint32_t t32 = k_uptime_get_32(); while (k_uptime_get_32() - t32 < ms) { } } static void thread_time_slice(void p1, void p2, void p3) { printk("thread[%d] - Before spin\n", (int)(uintptr_t)p1); / Spinning for longer than slice / spin_for_ms(SLICE_SIZE + 20); / The following print should not happen before another * same priority thread starts. / printk("thread[%d] - After spinning\n", (int)(uintptr_t)p1); k_sem_give(&sema); } void main(void) { k_tid_t tid[NUM_THREAD]; struct k_thread t[NUM_THREAD]; uint32_t slice_ticks = k_ms_to_ticks_ceil32(SLICE_SIZE); int old_prio = k_thread_priority_get(k_current_get()); / disable timeslice / k_sched_time_slice_set(0, K_PRIO_PREEMPT(0)); for (int j = 0; j < 2; j++) { k_sem_reset(&sema); / update priority for current thread / k_thread_priority_set(k_current_get(), K_PRIO_PREEMPT(j)); / synchronize to tick boundary / k_usleep(1); / create delayed threads with equal preemptive priority / for (int i = 0; i < NUM_THREAD; i++) { tid[i] = k_thread_create(&t[i], tstacks[i], STACK_SIZE, thread_time_slice, (void )i, NULL, NULL, K_PRIO_PREEMPT(j), 0, K_NO_WAIT); } /* enable time slice (and reset the counter!) / k_sched_time_slice_set(SLICE_SIZE, K_PRIO_PREEMPT(0)); / Spins for while to spend this thread time but not longer / / than a slice. This is important / spin_for_ms(100); printk("before sleep\n"); / relinquish CPU and wait for each thread to complete / k_sleep(K_TICKS(slice_ticks (NUM_THREAD + 1))); for (int i = 0; i < NUM_THREAD; i++) { k_sem_take(&sema, K_FOREVER); } /* test case teardown / for (int i = 0; i < NUM_THREAD; i++) { k_thread_abort(tid[i]); } / disable time slice */ k_sched_time_slice_set(0, K_PRIO_PREEMPT(0)); } k_thread_priority_set(k_current_get(), old_prio); } Signed-off-by: Flavio Ceolin <flavio.ceolin@intel.com>	2021-01-27 16:55:58 -05:00
Andrew Boie	14c5d1f1f7	kernel: add CONFIG_ARCH_MAPS_ALL_RAM Some arches like x86 need all memory mapped so that they can fetch information placed arbitrarily by firmware, like ACPI tables. Ensure that if this is the case, the kernel won't accidentally clobber it by thinking the relevant virtual memory is unused. Otherwise this has no effect on page frame management. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-23 19:47:23 -05:00
Andrew Boie	6c97ab3167	mmu: promote public APIs These are application facing and are prefixed with k_. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-23 19:47:23 -05:00
Andrew Boie	c7be5dddda	mmu: backing stores reserve page fault room If we evict enough pages to completely fill the backing store, through APIs like k_mem_map(), z_page_frame_evict(), or z_mem_page_out(), this will produce a crash the next time we try to handle a page fault. The backing store now always reserves a free storage location for actual page faults. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-23 19:47:23 -05:00
Andrew Boie	60d306642e	kernel: add z_num_pagefaults_get() Simple counter of number of successfully handled page faults by the core kernel. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-23 19:47:23 -05:00
Andrew Boie	611b626b39	mmu: pin the whole kernel This will enable testing of the implementation until the critical set of pages is identified and known to the kernel. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-23 19:47:23 -05:00
Andrew Boie	a5cb878144	kernel: add demand paging implementation Implement runtime APIs for pinning, paging in, and evicting memory, as well as the page fault hook called from architecture code. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-23 19:47:23 -05:00
Andrew Boie	431b7c0fe5	kernel: add demand paging internal interfaces APIs used by backing store and eviction algorithms. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-23 19:47:23 -05:00
Andrew Boie	a6eca9fab6	kernel: add demand paging arch interfaces Architecture layer hooks for demand paging. See doxygen for these API definitions for more details. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-23 19:47:23 -05:00
Andrew Boie	ecb25fec51	mmu: ensure gperf data is mapped Page tables created at build time may not include the gperf data at the very end of RAM. Ensure this is mapped properly at runtime to work around this. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-23 19:47:23 -05:00
Andrew Boie	299a2cf62e	mmu: arch_mem_map() may no longer fail Pre-allocation of paging structures is now required, such that no allocations are ever needed when mapping memory. Instantiation of new memory domains may still require allocations unless a common page table is used. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-23 19:47:23 -05:00
Andrew Boie	5db615bb38	mmu: add k_mem_free_get() Return the amount of physical anonymous memory remaining. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-23 19:47:23 -05:00
Andrew Boie	8ccec8eba6	kernel: add k_mem_map() interface Allows applications to increase the data space available to Zephyr via anonymous memory mappings. Loosely based on mmap(). Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-23 19:47:23 -05:00
Andrew Boie	e35f179db3	kernel: add page frame management Initialize the page frame ontology at boot and update it when we do memory mappings. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-23 19:47:23 -05:00
Andrew Boie	73a3e05e40	kernel: add CONFIG_ARCH_HAS_RESERVED_PAGE_FRAMES We will need this to run on x86 with PC-like hardware. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-23 19:47:23 -05:00
Peter Bigot	affa7a1c7e	Revert "device: add post-process of elf file to manage device handles" This reverts commit `40d3653758`. Signed-off-by: Peter Bigot <peter.bigot@nordicsemi.no>	2021-01-23 18:01:03 -05:00
Nicolas Pitre	a2011d8af9	z_heap_aligned_alloc(): avoid memory wastage The strategy used in z_heap_aligned_alloc() was to allocate an extra align-sized memory block for storing a pointer to the memory heap. This is wasteful in terms of memory usage when alignment is larger than a pointer width. A loop is needed to find the initial memory start when freeing it which isn't optimal either. Instead, let's have sys_heap_aligned_alloc() rewind a pointer after it is aligned to make just enough room for storing our heap reference. This way the heap reference is always located immediately before the aligned memory and any unused memory is returned to the heap. The rewind and alignment values may coincide in which case only the alignment is necessary anyway. Signed-off-by: Nicolas Pitre <npitre@baylibre.com>	2021-01-22 10:04:43 -05:00
Flavio Ceolin	d21cfd5f36	power: Remove power management conditionals from code Remove conditionals (PM_DEEP_SLEEP_STATES and PM_SLEEP_STATES) from power management code. Now these features are always available when power management is enabled. Signed-off-by: Flavio Ceolin <flavio.ceolin@intel.com>	2021-01-22 09:31:20 -05:00
Flavio Ceolin	579f7049c7	power: Move pm subsystem to new power states Migrate the whole pm subsystem to use new power states information from power_state.h and get states and residency properties from device tree. Signed-off-by: Flavio Ceolin <flavio.ceolin@intel.com>	2021-01-22 09:31:20 -05:00
Peter Bigot	0ab314f705	kernel: const-qualify objects used to calculate delay values The internal API to measure time until a delay expires does not modify the referenced timeout. Make the functions that call it take pointers to const objects, so that they can be used with pointer to const-qualified containers. Signed-off-by: Peter Bigot <peter.bigot@nordicsemi.no>	2021-01-22 08:05:26 -06:00
Anas Nashif	db0732f11d	Revert "kernel: add CONFIG_ARCH_HAS_RESERVED_PAGE_FRAMES" This reverts commit `9d2ebfff58`. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-01-22 08:39:45 -05:00
Anas Nashif	8e84eaf73e	Revert "kernel: add page frame management" This reverts commit `2ca5fb7e06`. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-01-22 08:39:45 -05:00
Anas Nashif	0417b97257	Revert "kernel: add k_mem_map() interface" This reverts commit `69d39af5e6`. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-01-22 08:39:45 -05:00
Anas Nashif	6b82664a5a	Revert "mmu: add k_mem_free_get()" This reverts commit `9111ec2c19`. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-01-22 08:39:45 -05:00
Anas Nashif	a2ec139bf7	Revert "mmu: arch_mem_map() may no longer fail" This reverts commit `db56722729`. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-01-22 08:39:45 -05:00
Anas Nashif	d887e078f9	Revert "mmu: ensure gperf data is mapped" This reverts commit `e9bfd64110`. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-01-22 08:39:45 -05:00
Anas Nashif	65122b776a	Revert "kernel: add demand paging arch interfaces" This reverts commit `b8ae437967`. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-01-22 08:39:45 -05:00
Anas Nashif	cd0beca292	Revert "kernel: add demand paging internal interfaces" This reverts commit `3e51a7a775`. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-01-22 08:39:45 -05:00
Anas Nashif	752934b3c8	Revert "kernel: add demand paging implementation" This reverts commit `2fe1fc53c8`. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-01-22 08:39:45 -05:00
Anas Nashif	75ebe4c7bd	Revert "mmu: pin the whole kernel" This reverts commit `a45486e1d5`. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-01-22 08:39:45 -05:00
Anas Nashif	c2c87c99c7	Revert "kernel: add z_num_pagefaults_get()" This reverts commit `d7e6bc3e84`. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-01-22 08:39:45 -05:00
Anas Nashif	5e978d237c	Revert "mmu: backing stores reserve page fault room" This reverts commit `7a642f81ab`. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-01-22 08:39:45 -05:00
Anas Nashif	ef17f889dc	Revert "mmu: promote public APIs" This reverts commit `63fc93e21f`. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-01-22 08:39:45 -05:00
Daniel Leung	d3218ca515	debug: coredump: remove z_ prefix for stuff used outside subsys This removes the z_ prefix those (functions, enums, etc.) that are being used outside the coredump subsys. This aligns better with the naming convention. Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2021-01-21 22:08:59 -05:00
Flavio Ceolin	d6d4a832a4	kernel: build: Make TICKLESS_KERNEL depends on TICKLESS_CAPABLE Tickless kernel option depends on tickless capable being selected by the target. Signed-off-by: Flavio Ceolin <flavio.ceolin@intel.com>	2021-01-21 17:20:32 -05:00
Andrew Boie	63fc93e21f	mmu: promote public APIs These are application facing and are prefixed with k_. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-21 16:47:00 -05:00
Andrew Boie	7a642f81ab	mmu: backing stores reserve page fault room If we evict enough pages to completely fill the backing store, through APIs like k_mem_map(), z_page_frame_evict(), or z_mem_page_out(), this will produce a crash the next time we try to handle a page fault. The backing store now always reserves a free storage location for actual page faults. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-21 16:47:00 -05:00
Andrew Boie	d7e6bc3e84	kernel: add z_num_pagefaults_get() Simple counter of number of successfully handled page faults by the core kernel. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-21 16:47:00 -05:00
Andrew Boie	a45486e1d5	mmu: pin the whole kernel This will enable testing of the implementation until the critical set of pages is identified and known to the kernel. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-21 16:47:00 -05:00
Andrew Boie	2fe1fc53c8	kernel: add demand paging implementation Implement runtime APIs for pinning, paging in, and evicting memory, as well as the page fault hook called from architecture code. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-21 16:47:00 -05:00
Andrew Boie	3e51a7a775	kernel: add demand paging internal interfaces APIs used by backing store and eviction algorithms. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-21 16:47:00 -05:00
Andrew Boie	b8ae437967	kernel: add demand paging arch interfaces Architecture layer hooks for demand paging. See doxygen for these API definitions for more details. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-21 16:47:00 -05:00
Andrew Boie	e9bfd64110	mmu: ensure gperf data is mapped Page tables created at build time may not include the gperf data at the very end of RAM. Ensure this is mapped properly at runtime to work around this. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-21 16:47:00 -05:00
Andrew Boie	db56722729	mmu: arch_mem_map() may no longer fail Pre-allocation of paging structures is now required, such that no allocations are ever needed when mapping memory. Instantiation of new memory domains may still require allocations unless a common page table is used. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-21 16:47:00 -05:00
Andrew Boie	9111ec2c19	mmu: add k_mem_free_get() Return the amount of physical anonymous memory remaining. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-21 16:47:00 -05:00
Andrew Boie	69d39af5e6	kernel: add k_mem_map() interface Allows applications to increase the data space available to Zephyr via anonymous memory mappings. Loosely based on mmap(). Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-21 16:47:00 -05:00
Andrew Boie	2ca5fb7e06	kernel: add page frame management Initialize the page frame ontology at boot and update it when we do memory mappings. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-21 16:47:00 -05:00
Andrew Boie	9d2ebfff58	kernel: add CONFIG_ARCH_HAS_RESERVED_PAGE_FRAMES We will need this to run on x86 with PC-like hardware. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2021-01-21 16:47:00 -05:00
Peter Bigot	40d3653758	device: add post-process of elf file to manage device handles Following the idiom used for system calls, add script support to read the initial application binary to identify which devices are defined, and to use their offset in the device array as their unique handle rather than the externally-defined ordinal from devicetree. The device dependency arrays are updated to use these handles. Signed-off-by: Peter Bigot <peter.bigot@nordicsemi.no>	2021-01-21 14:49:04 -06:00
Carlo Caione	e77c841023	cache: Expand the APIs for cache flushing The only two supported operations for data caches in the cache framework are currently arch_dcache_flush() and arch_dcache_invd(). This is quite restrictive because for some architectures we also want to control i-cache and in general we want a finer control over what can be flushed, invalidated or cleaned. To address these needs this patch expands the set of operations that can be performed on data and instruction caches, adding hooks for the operations on the whole cache, a specific level or a specific address range. Signed-off-by: Carlo Caione <ccaione@baylibre.com>	2021-01-19 14:31:02 -05:00
Anas Nashif	989ebf6c35	kernel: add vrfy hooks to support userspace with condvar Add needed vrfy hooks for userspace support. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-01-19 08:55:47 -05:00
Anas Nashif	06eb489c45	kernel: add condition variables Introduce condition variables similar to how they are done in POSIX with a mutex. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-01-19 08:55:47 -05:00
Ningx Zhao	ff7ec0ce40	kernel: poll: remove unreachable code register_event always returns 0, so the conditional will always take the first branch and code in the else part is never reached. Fixes #31282 Signed-off-by: Ningx Zhao <ningx.zhao@intel.com>	2021-01-18 11:02:59 -05:00
Enjia Mai	53ca709828	tests: coverage: exclude the CODE UNREACHABLE of code coverage 1. Exclude the CODE UNREACHABLE line while generating coverage report. 2. Exclude the memory domain deprecated API when calculating code coverage. Signed-off-by: Enjia Mai <enjiax.mai@intel.com>	2021-01-15 12:42:00 -05:00
Nicolas Pitre	7a22a4bdf6	heap: clean up some size related issues First, the maximum heap size must fit in 31 bits worth of chunks because the internal 32-bit field holding the size is shared with the `used` bit. Then the mention of a 256-byte block in the doc is no longer relevant. That pertained to the previous allocator implementation. And ditto for the HEAP_MEM_POOL_MIN_SIZE kconfig option. Signed-off-by: Nicolas Pitre <npitre@baylibre.com>	2021-01-15 12:08:20 -05:00
Andy Ross	ef626571b2	kernel/sched: Optimize deadline comparison Needing to check the current cycle time (which involves a spinlock and register read on most architectures) is wasteful in the scheduler priority predicate, which is a hot path. If we "burn" one bit of precision (and document the rule), we can do the comparison without knowing the current time. 2^31 cycles is still far longer than a live deadline thread in any legitimate realtime app should ever live before being scheduled. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-01-15 11:35:50 -05:00
Maureen Helm	f63385204c	linker: arm: Add cortex_m itcm section Adds a linker section for Cortex-M instruction tightly coupled memory (ITCM), similar to the existing section for DTCM. A new executable MPU region is not added as there isn't currently a need to make this section accessible to user mode. This section can be enabled by setting a device tree chosen node zephyr,itcm. Signed-off-by: Maureen Helm <maureen.helm@nxp.com>	2021-01-15 14:51:20 +01:00
Andy Ross	e956639dd6	kernel: Remove CONFIG_LEGACY_TIMEOUT_API This was a fallback for an API change several versions ago. It's time for it to go. Fixes: #30893 Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-01-14 21:33:16 -05:00
Daniel Leung	fe477ea6d3	kernel: userspace: aligned memory allocation for dynamic objects This allows allocating dynamic kernel objects with memory alignment requirements. The first candidate is for thread objects where, on some architectures, it must be aligned for saving/restoring registers. Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2021-01-13 09:43:55 -08:00
Daniel Leung	0c9f9691c4	kernel: mempool: add z_thread_aligned_alloc This adds a new z_thread_aligned_alloc() to do memory allocation with required alignment. Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2021-01-13 09:43:55 -08:00
Enjia Mai	8d5a22c3c1	tests: enable the code coverage report for qemu_x86_64 Enable the code coverage report for qemu_x86_64 platform. See issue #17991 please. Signed-off-by: Enjia Mai <enjiax.mai@intel.com>	2021-01-05 10:32:38 -08:00
Peter Bigot	19cb44bab7	kernel: idle: fix builds with PM but no system clock PM depends on SYS_CLOCK_EXISTS in Kconfig but several boards have Kconfig overrides that allow the dependency to be ignored, so CONFIG_PM=y even though CONFIG_SYS_CLOCK_EXISTS=n. Fix the code so that the true dependency is reflected in the generated code. Signed-off-by: Peter Bigot <peter.bigot@nordicsemi.no>	2020-12-27 18:18:52 +01:00
Christopher Friedt	135ffaff74	kernel/k_malloc: add k_aligned_alloc This change adds z_heap_aligned_alloc() and k_aligned_alloc() and changes z_heap_malloc() and k_malloc() to be small wrappers around the aligned variants. Fixes #29519 Signed-off-by: Christopher Friedt <chrisfriedt@gmail.com>	2020-12-27 18:17:07 +01:00
Marcin Niestroj	11cb1cf336	kernel: sched: fix legacy timeout calculation in z_tick_sleep Ticks should be assigned directly to timeout value in case of CONFIG_LEGACY_TIMEOUT_API=y, just as they were before referenced patch. Fixes: `7a815d5d99` ("kernel: sched: Use k_ticks_t in z_tick_sleep") Signed-off-by: Marcin Niestroj <m.niestroj@grinn-global.com>	2020-12-18 14:03:25 -05:00
Andrew Boie	d2ad783a97	mmu: rename z_mem_map to z_phys_map Renamed to make its semantics clearer; this function maps physical memory addresses and is not equivalent to posix mmap(), which might confuse people. mem_map test case remains the same name as other memory mapping scenarios will be added in the fullness of time. Parameter names to z_phys_map adjusted slightly to be more consistent with names used in other memory mapping functions. Signed-off-by: Andrew Boie <andrew.p.boie@intel.com>	2020-12-16 08:55:55 -05:00
Daniel Leung	8ad5ad2214	kernel: don't unlock and then lock immediately in idle loop Inside the idle loop, in some configuration, IRQ is unlocked and then immediately locked again. There is a side effect: 1. IRQ is unlocked in middle of the loop. 2. Another thread (A) can now run so idle thread is un-scheduled. 3. Thread A runs to its end and going through the thread self-abort path. 4. Idle thread is rescheduled again, and continues to run the remaining loop when it eventuall calls k_cpu_idle(). The "pending abort" path is not being executed on thread A at this point. 5. Now, thread A is suspended, and the CPU is in idle waiting for interrupts (e.g. timeouts). 6. Thread B is waiting to join on thread A. Since thread A has not been terminated yet so thread B is waiting until the idle thread runs again and starts executing from the beginning of while loop. 7. Depending on how many threads are running and how active the platform is, idle thread may not run again for a while, resulting in thread B appearing to be stuck. To avoid this situation, the unlock/lock pair in middle of the loop is removed so no rescheduling can be done mid-loop. When there is no thread abort pending, it simply locks IRQ and calls k_cpu_idle(). This is almost identical to the idle loop before the thread abort code was introduced (except the check for cpu->pending_abort). Fixes #30573 Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2020-12-15 08:27:27 -05:00
Anas Nashif	3d33d767f1	twister: rename in code Some code had reference to sanitycheck, replace with twister. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2020-12-11 14:13:02 -05:00
Enjia Mai	00d7f2cfb6	kernel: thread: make offload_sem visible for releasing it outside In order to release irq_offload semaphore outside kernel/thread.c, we make it visible by modifying it non-static under ztest. This would be needed such as when call irq_offload() to enter interrupt context and a fatal error happened, then you have to release it in your fatal handler, or the irq_offload will still be locked and no longer be using again. Signed-off-by: Enjia Mai <enjiax.mai@intel.com>	2020-12-09 21:52:09 -05:00
Anas Nashif	98cef8376c	power: cleanup kernel idle and power hooks Cleanup code for power management and remove some duplication and isolate power management code from the kernel code. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2020-12-09 15:18:29 -05:00
Anas Nashif	8a97717c72	power: rename _pm_idle_exit_notification_disable Rename API to be more consistent with guidelines. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2020-12-09 15:18:29 -05:00
Anas Nashif	f9238d6370	power: set_kernel_idle_time_in_ticks -> _set_kernel_idle_time_in_ticks Rename internal function. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2020-12-09 15:18:29 -05:00
Anas Nashif	e3937453a6	power: rename _sys_suspend/_sys_resume Be consistent in PM namespaces. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2020-12-09 15:18:29 -05:00
Anas Nashif	e0f3833bf7	power: remove SYS_ and sys_ prefixes Remove SYS_ and sys_ from all PM related functions and defines. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2020-12-09 15:18:29 -05:00
Anas Nashif	dd931f93a2	power: standarize PM Kconfigs and cleanup - Remove SYS_ prefix - shorten POWER_MANAGEMENT to just PM - DEVICE_POWER_MANAGEMENT -> PM_DEVICE and use PM_ as the prefix for all PM related Kconfigs Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2020-12-09 15:18:29 -05:00
Anas Nashif	87ddddae52	Revert "kernel: fix usage of KERNEL_COHERENCE macro" This reverts commit `67c5e6b0c0`. This is causing build issues on some platforms. Revert for now. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2020-12-08 14:27:27 -05:00
Maximilian Bachmann	34d7c78f4f	kernel: add k_heap_aligned_alloc k_heap did not have an aligned alloc function, even though this is supported by the internal sys_heap. Signed-off-by: Maximilian Bachmann <m.bachmann@acontis.com>	2020-12-08 13:21:26 -05:00
Anas Nashif	e1d42724e5	kernel: make KERNEL_COHERENCE depend on ARCH_HAS_COHERENCE We can't enable KERNEL_COHERENCE is architecture does not support it. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2020-12-08 09:30:02 -05:00
Anas Nashif	67c5e6b0c0	kernel: fix usage of KERNEL_COHERENCE macro Add missing CONFIG_ to KERNEL_COHERENCE usage in code. Fixes #30380 Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2020-12-08 09:30:02 -05:00
Andy Ross	3c2c1d85b0	kernel: Remove z_mem_pool wrapper internals These implemented a k_mem_pool in terms of the now universal k_heap utility. That's no longer necessary now that the k_mem_pool API has been removed. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2020-12-07 21:50:14 -05:00
Andy Ross	c844bd87b3	kernel: Remove legacy mem_pool usage The mailbox and msgq utilities had API variants that could pass old mem_pool blocks through the data structure. That API is being deprected (and the features were obscure), so remove the internal support. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2020-12-07 21:50:14 -05:00
Andy Ross	c770cab1a3	kernel: Make thread resource pools into sys_heaps The k_mem_pool allocator is no more, and the z_mem_pool compatibility API is going away. The internal allocator should be a k_heap always. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2020-12-07 21:50:14 -05:00

1 2 3 4 5 ...

2467 Commits