spin-based implementation for no_std #61

matklad · 2019-10-08T06:54:45Z

We currently don't provide sync module in no_std, as it requires OS-level support for blocking. What lazy_static does in this case is that it has a spin feature, which replaces locking with spinning.

I think just silently replacing OS mutexes with busy waiting is the wrong approach, this is an important distinction which should be reflected in the type system.

So, we should instead add an opt-in spin module, which has the same API as sync, but is based on the spin crate. That is, with both std and spin features enabled, the user should be able to use both once_cell::sync and once_cell::spin.

The text was updated successfully, but these errors were encountered:

richardanaya · 2019-10-22T04:20:47Z

I just ran into needing this tonight and felt bad having to turn to lazy static :( thanks for opening this! once_cell FTW!

josephlr · 2019-12-23T22:24:46Z

So notes on why we might want this (both in once_cell and eventually merged into libstd/libcore):

ring needs a no_std compatible replacement for spin-rs (which is unmaintained), spin-rs no longer maintained (dependency) briansmith/ring#921
getrandom (a dependency of rand) needs one-time initialization for holding onto file handles, checking CPUID, checking OS support for various functionality, etc... Right now, we implement this on our own but ideally we wouldn't have this custom implementation. As we want to eventually make getrandom part of the libstd (see Use getrandom crate for retrieving system entropy? rust-lang/rust#62079 and Use getrandom crate rust-lang/rust#62082), it cannot depend on any std features. So using spinlocks is the best bet.

matklad · 2020-01-01T16:59:25Z

So using spinlocks is the best bet.

I think I disagree with this. Using spin-locks if there's a real operating system around seems bad. If two threads race to run initialization, one thread enters a critical section and is scheduled out of the CPU, the other will be busy waiting for a long time. Moreover, if the first thread is a low-priority one, and the second one has a high priority, we get priority inversion!

Moreover, I don't think std uses blocking at all at the moment when getting random data? I think std only needs randomness for hash maps (is this true?) and there, it uses tls for caching:

https://github.com/rust-lang/rust/blob/e380efa5ecdef714dad72c473fc0933ff4d59283/src/libstd/collections/hash/map.rs#L2459-L2461

Could the getrandom be designed in such a way that it's the client who manages the state?

Something like this

pub struct SysRandom { ... }

impl SysRandom {
    pub fn init() -> SysRandom { ... }
    pub getrandom(&self, dest: &mut [u8]) { ... }
}

The std would then stuff it into a tls, and rand could use a global synchronized OnceCell<SysRandom>.

matklad · 2020-01-01T21:57:19Z

@josephlr here's a demonstration that, in extremely unfortunate cases, the current spin-lock based implementation in getrandom leads to extremely horrible results: https://github.com/matklad/spin-of-death

richardanaya · 2020-01-01T21:59:30Z

To add some context, my desire for spin lock was for no_std + alloc web assembly.

…

On Wed, Jan 1, 2020 at 1:57 PM Aleksey Kladov ***@***.***> wrote: @josephlr <https://github.com/josephlr> here's a demonstration that, in extremely unfortunate cases, the current spin-lock based implementation in getrandom leads to extremely horrible results: https://github.com/matklad/spin-of-death — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#61?email_source=notifications&email_token=AACHZGQS36XCCWGCUTEF2QTQ3UGT7A5CNFSM4I6NXAS2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEH5NKXQ#issuecomment-570086750>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AACHZGVAGLKHDJSFO2LRPF3Q3UGT7ANCNFSM4I6NXASQ> .

matklad · 2020-01-01T22:06:13Z

@richardanaya I'd love to hear more details about your use-case! I have a theory that one actually never wants a spin lock :)

Am I correct that your use-case is basically "I statically know that there always is exactly one thread, so no synchronization is necessary, and I want to use spin just to work around the annoying compiler errors, although I statically know that we'll never actually spin"?

matklad · 2020-01-01T22:41:52Z

@richardanaya if my assumption sounds right, could you check if #82 fulfills your use-case?

matklad · 2020-01-02T04:00:17Z

See also https://matklad.github.io//2020/01/02/spinlocks-considered-harmful.html and https://probablydance.com/2019/12/30/measuring-mutexes-spinlocks-and-how-bad-the-linux-scheduler-really-is/

mark-i-m · 2020-01-04T04:01:54Z

I just read the blog post. I disagree with the argument about interrupt handlers. Doing something potentially blocking in an interrupt handler is just wrong; interrupt handlers are supposed to be short and do minimal work because they are stealing time from the scheduled task. Personally, I think adding a spin-based no_std feature would be fine, and the implementation can be improved later if needed...

RKennedy9064 · 2020-02-22T03:58:26Z

@matklad I'd also be interested in a #![no_std] implementation using spin-locks. I saw this crate recently and wanted to try it out, but without a #![no_std] feature I can't. My main use case would be to see if I could replace lazy_static! in the Rust OS tutorials here https://os.phil-opp.com/vga-text-mode/#lazy-statics.

Since it's building an OS, it's built with #![no_std]. I'm assuming my use case would be extremely small, but figured it couldn't hurt to bring it up. Let me know what you think.

matklad · 2020-02-22T11:39:14Z

Interesting example! I think this is also a case of miss-use of spinlocks. We only need runtime initialization there because unsafe { &mut *(0xb8000 as *mut Buffer) } is not const evaluatable yet.

Rather than using lazy_static, I would suggest doing the following:

pub struct Writer {
    column_position: usize,
    color_code: ColorCode,
    buffer_addr: usize,
}

impl Writer {
    unsafe fn buffer(&self) -> &'static mut Buffer {
        unsafe { &mut (self.buffer_addr as *mut Buffer) }
    }
}

This is slightly less ergonomic (as you need a function call, and not a field access) but has massively simpler runtime behavior.

RKennedy9064 · 2020-02-22T17:21:52Z

@matklad Does this still hold true if there's only ever one instance of Writer, that needs be be accessed by multiple sources and always writes to 0xb8000? The end result is eventually used to create printing macros that are used to display things from the kernel, as well as in hardware interrupts like so.

#[macro_export]
macro_rules! print {
    ($($arg:tt)*) => ($crate::vga_buffer::_print(format_args!($($arg)*)));
}

#[macro_export]
macro_rules! println {
    () => ($crate::print!("\n"));
    ($($arg:tt)*) => ($crate::print!("{}\n", format_args!($($arg)*)));
}

#[doc(hudden)]
pub fn _print(args: fmt::Arguments<'_>) {
    use core::fmt::Write;
    use x86_64::instructions::interrupts;

    interrupts::without_interrupts(|| {
        WRITER.lock().write_fmt(args).unwrap();
    });
}

#[panic_handler]
fn panic(info: &PanicInfo<'_>) -> ! {
    println!("{}", info);
    loop {}
}

extern "x86-interrupt" fn double_fault_handler(
    stack_frame: &mut InterruptStackFrame,
    _error_code: u64,
) -> ! {
    panic!("EXCEPTION: DOUBLE FAULT\n{:#?}", stack_frame);
}

Is something like this still possible without having a public static reference like so?

use spin::Mutex;

lazy_static! {
    pub static ref WRITER: Mutex<Writer> = Mutex::new(Writer {
        column_position: 0,
        color_code: ColorCode::new(Color::Yellow, Color::Black),
        buffer: unsafe { &mut *(0xb8000 as *mut Buffer) },
    });
}

Using your implementation, wouldn't you still need to create a pub static ref to the writer so that i can be accessed everywhere, including interrupts? Then wouldn't I still need lazy_static to make this possible since there's #![no_std] support?

I'm definitely open to different approaches for something like this, just didn't want to throw to many details in my first post. Let me know your thoughts/suggestions.

matklad · 2020-02-24T12:18:18Z

Yeah, I still think that lazy_static ideally should not be here. Specifically,

    interrupts::without_interrupts(|| {
        WRITER.lock().write_fmt(args).unwrap();
    });

does synchronization three times:

first, we synchronize on lazy-static initialization
then, we disable interrupts (which is also synchronizatio)
finally, we lock a lock

I'd say that two of three synchronizations are unnecessary. They don't necessary make your program worse, but they are not the minimal solution.

Here's how I'd do this API:

/// Kernel Spin Lock.
////
/// This locks disables local interrupts and, on multiprocessor systems, 
/// additionally locks a global atomic flag.
/// This lock is safe to use in any context
/// 
/// See also https://www.kernel.org/doc/Documentation/locking/spinlocks.txt
pub struct KSpinLock<T> {
    #[cfg(has_two_cores)]
    locked: AtomicBool,
    value: T,
}

impl  KSpinLock<T> {
    pub const fn new(value: T) -> KSpinLock<T> { ... }

    pub fn lock(&self) -> KSpinLockGuard<'_, T> {
        disable_interrupts();
        #[cfg(has_two_cores)] while !self.locked.cas(false, true, Acquire) {}
        KSpinLockGuard { ... }
    }
}

pub struct KSpinLockGuard<T> {
    // this probably should store prev value of interrupt flags?
}

impl Drop for KSpinLockGuard<T> {
    fn drop(&mut self) {
        restore_interrupts();
        #[cfg(has_two_cores)] self.locked.store(false, Release)
    }
}

/// ...

pub struct Writer<'a> {
  column_position: usize,
  buffer: &'a mut Buffer,
}

// This struct we need to work-around lack of const-fn
struct WriterState {
  column_position: usize,
  buffer_addr: usize
}

pub fn with_writer(f: impl FnOnce(Writer<'_>)) {
    static WRITER_STATE: KSpinLock<WriterState> = 
        KSpinLock::new(WriterState { column_position: 0, buffer_addr: 0xb8000 }); 
    let mut state = WRITER_STATE.lock();
    let buffer: &mut Buffer = unsafe {&mut *(state.buffer_addr as *mut Buffer) };
    let writer = Writer { column_position: state.column_position, buffer };
    f(writer);
}

Specifically:

the global data doesn't really need any initaization, in theory it can just be there in our kernel binary
so we use available const-fn powers to get the binary we want, even if the Rust API is pretty (callback instead of &'static mut)
we also abstract synchronization into a dedicated kernel mutex, which manages both interrupts (which is also required) and global "locked" flags (which is only required if there are several CPUs).

matklad · 2020-02-24T12:20:07Z

I do feel like this is a wall of code, in comparison with just using off the shelf lazy_static,mutex,etc. But I believe this wall of code has an important advantage in that it achieves the runtime behavior you really want. In particular, it doesn't do unnecessary late initialization, and it also does the exactly appropriate amount of synchronization.

matklad · 2020-02-24T12:22:10Z

Hm, I guess a more elegant hack would be to do this:

#[repr(C)]
pub struct Writer<'a> {
  ...
  buffer: &'a mut Buffer,
}

[repr(C)]
struct WriterState {
  ...
  buffer_addr: usize
}

and then mem::transmute usize-state to an &'a mut state in the with_writer function.

RKennedy9064 · 2020-02-25T04:58:42Z

@matklad Wow thanks for taking the time to come up with such a detailed example. I should hopefully have time in the next few days to test this out and see how it goes. I did have a few clarifying questions if you had the time.

If I understand correctly, is with_writer basically creating the lock, along with the writer and calling it once so that it just exists in the kernel binary? Then I could assign the writer using with_writer in order to call my write functions in various places and lock as needed?

Also, I noticed that one of the comments mentions needing the struct because of a lack of const-fn in Rust. Since I'm using nightly, would I be able to leverage #![feature(const_fn)] instead using WriterState, and if so, would it be advisable?

Again thanks for taking the time to look into this and provide detailed examples. This approach definitely looks promising and I like it more then having to rely on an untyped macro.

tdonovan4 · 2020-04-06T18:51:05Z

Any update on this?

matklad · 2020-04-06T19:29:17Z

@tdonovan4 not really, I still haven't seen a case which wouldn't become better by replacing spinlock with a more appropriate synchronization.

tdonovan4 · 2020-04-06T21:35:31Z

@matklad thinking about it, you might be right. I believe I could adapt my case to use #82 instead once it's merged. Thanks.

zesterer · 2020-10-10T14:02:55Z

As a point of interest, spin-rs is now maintained again.

richardanaya · 2020-11-14T01:23:30Z

Still curious about this issue :) I use spin-rs all the time in WebAssembly land. Mostly trying to get binary sizes down as much as possible.

richardanaya · 2020-11-14T01:25:48Z

FYI, to describe my use case, there's a lot of asynchronous code in web lang. WebAssembly is multi-entrant so I often times end up in scenarios where one function needs to have a common global mutexed-object. (e.g. like a common game state that render function and click handler can modify).

zesterer · 2020-11-15T13:08:22Z

spin-rs is now trying to mirror many of the APIs in std::sync so we'd definitely be happy to accept a spinning implementation.

rockboynton · 2022-10-10T22:51:58Z

Jump bumping this for interest

matklad · 2022-10-22T19:34:58Z

Closed by #195

josephlr mentioned this issue Jan 2, 2020

use_file: Remove use of spin-locks rust-random/getrandom#125

Merged

RKennedy9064 mentioned this issue Feb 25, 2020

Switch to different locking crate (as reported by RUSTSEC-2019-0031) phil-opp/blog_os#737

Closed

jplatte mentioned this issue Dec 30, 2020

Switch from lazy_static to once_cell tokio-rs/tracing#1165

Closed

This was referenced Jan 11, 2021

AlmostOnceCell #133

Closed

Switch from lazy_static to once_cell tkaitchuck/aHash#65

Closed

jplatte mentioned this issue Jun 14, 2022

Replace lazy_static with once_cell clap-rs/clap#3828

Closed

matklad closed this as completed Oct 22, 2022

mkroening mentioned this issue Nov 6, 2022

Support custom mutexes via lock_api on no_std #207

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spin-based implementation for no_std #61

spin-based implementation for no_std #61

matklad commented Oct 8, 2019

richardanaya commented Oct 22, 2019

josephlr commented Dec 23, 2019

matklad commented Jan 1, 2020

matklad commented Jan 1, 2020

richardanaya commented Jan 1, 2020 via email

matklad commented Jan 1, 2020 •

edited

matklad commented Jan 1, 2020

matklad commented Jan 2, 2020

mark-i-m commented Jan 4, 2020

RKennedy9064 commented Feb 22, 2020

matklad commented Feb 22, 2020

RKennedy9064 commented Feb 22, 2020

matklad commented Feb 24, 2020

matklad commented Feb 24, 2020

matklad commented Feb 24, 2020 •

edited

RKennedy9064 commented Feb 25, 2020

tdonovan4 commented Apr 6, 2020

matklad commented Apr 6, 2020

tdonovan4 commented Apr 6, 2020

zesterer commented Oct 10, 2020

richardanaya commented Nov 14, 2020 •

edited

richardanaya commented Nov 14, 2020

zesterer commented Nov 15, 2020

rockboynton commented Oct 10, 2022

matklad commented Oct 22, 2022

spin-based implementation for no_std #61

spin-based implementation for no_std #61

Comments

matklad commented Oct 8, 2019

richardanaya commented Oct 22, 2019

josephlr commented Dec 23, 2019

matklad commented Jan 1, 2020

matklad commented Jan 1, 2020

richardanaya commented Jan 1, 2020 via email

matklad commented Jan 1, 2020 • edited

matklad commented Jan 1, 2020

matklad commented Jan 2, 2020

mark-i-m commented Jan 4, 2020

RKennedy9064 commented Feb 22, 2020

matklad commented Feb 22, 2020

RKennedy9064 commented Feb 22, 2020

matklad commented Feb 24, 2020

matklad commented Feb 24, 2020

matklad commented Feb 24, 2020 • edited

RKennedy9064 commented Feb 25, 2020

tdonovan4 commented Apr 6, 2020

matklad commented Apr 6, 2020

tdonovan4 commented Apr 6, 2020

zesterer commented Oct 10, 2020

richardanaya commented Nov 14, 2020 • edited

richardanaya commented Nov 14, 2020

zesterer commented Nov 15, 2020

rockboynton commented Oct 10, 2022

matklad commented Oct 22, 2022

matklad commented Jan 1, 2020 •

edited

matklad commented Feb 24, 2020 •

edited

richardanaya commented Nov 14, 2020 •

edited