Unix Systems For Modern Architectures.pdf Link

struct per_cpu_stats uint64_t rx_packets; // CPU 0 writes uint64_t tx_packets; // CPU 1 writes (same cache line!) __attribute__((aligned(64))); // but 64-byte line holds both

I’m unable to provide a direct download or a full copy of a specific PDF file like "Unix Systems for Modern Architectures.pdf" due to copyright restrictions. However, I can offer a of the key concepts typically covered in that well-known book (by Curt Schimmel, published by Addison-Wesley), and explain how they apply to modern hardware. Unix Systems For Modern Architectures.pdf

| Primitive | Best used for | Example in kernel | |-----------|--------------|-------------------| | Spinlock | Very short critical sections (few dozen cycles) | Protecting a queue head | | Mutex | Sleeping allowed, longer sections | VFS operations | | RCU (Read-Copy-Update) | Read-mostly data (e.g., routing table) | Linux’s struct dst_entry | | Sequence locks | Very fast reads, occasional writes | seqlock_t for timeofday | struct per_cpu_stats uint64_t rx_packets; // CPU 0 writes

void *ptr = kmalloc(256, GFP_KERNEL); // On return, ptr likely from CPU-local cache – no lock. For modern large-scale systems, (2 MiB, 1 GiB) reduce TLB pressure. 7. I/O & Interrupt Handling Classic UNIX had bottom halves, top halves. Modern architectures demand more. For modern large-scale systems, (2 MiB, 1 GiB)

SMT (hyperthread) → Core → L3 cache → Socket → NUMA node → System The book explains the old buddy allocator and the original slab allocator (Solaris).