# 1 to many OpenRISC SMP Stafford Horne ## Agenda Introduction Implementing SMP # Introduction What is OpenRISC? FPGA, IP cores OpenCores FuseSOC **FOSSi** ### What is OpenRISC? Officially OpenRISC 1000 is an open source RISC architecture: - 32-bit / 64-bit - 32 General Purpose Registers - Delay Slot - Instruction & Data MMU - Linux support since 2010 - o 50mhz, 5 secs ### OpenRISC vs Other soft codes OpenRISC RISC-V Nios2 Microblaze | Open | MMU | Arch | Linux | Silicon | |----------|----------|--------|----------|----------| | <b>⊘</b> | <b>⊘</b> | 32-bit | <b>⊘</b> | Limited | | <b>⊘</b> | Kinda | 64-bit | 4.15 | Multi | | 8 | <b>⊘</b> | 32-bit | <b>⊘</b> | <b>⊘</b> | | 8 | <b>⊘</b> | 32-bit | <b>⊘</b> | <b>⊘</b> | ### **Upstream Progress** Last update in 2016 many projects were pending to go upstream - GDB code ok'ed pending one Copyright assignment, ETA Today - Linux Performance, SMP Complete - newlib Complete #### Progress - Qemu Bug fixes + SMP support - Uclibc-ng NPTL support - musl no changes #### **Toolchain GCC** #### github.com/openrisc/orlk-gcc • 5.4.0 released (2017 Feb) #### Upstream status - behind - 8 development (target 2018 Q1) - 7.2 latest 7 release (2017 Aug) - 6.4 latest 6 release (2016 Jul) - 5.5 latest 5 release (2017 Oct) #### Richard Henderson rth@twiddle.net via lists.librecores.org Several years ago I did some cleanup work to the openrisc gcc port, but wasn't able to test it properly, so it languished. I have recently rebased that onto the gcc7 branch that is nearing release. Test results are fairly good using qemu for testing (I do have some odd linkage problems that suggest that I ought to rebuild my old sys-root from scratch). I won't post patches here unless desired, but instead a link to my branch: git://github.com/rth7680/gcc.git or1k-7-2 An incomplete list of visible improvements are: - \* 64-bit arithmetic using addc/addic. - \* Define "cstore" pattern, i.e. comparisons with a boolean result, e.g. x = (y < 0). This can use addc or cmov.</p> - \* Prologue rewritten to take advantage of red-zone, and only store registers that are required. - \* Tail calls added. - \* Atomic operations rewritten to properly handle delay slots. - \* The -fpic register is chosen dynamically. This allows us to use a call-clobbered register for leaf functions. If this goes well, I have some additional patches to revive that enhance both binutils and gcc with additional relocations so that the low-part of an address can be moved into a load or store offset. E.g. instead of I.movhi r4. hi(foo) # SMP ### Goals - Create a simple/low cost SMP architecture - Learn a lot ### **Architecture Specification** #### Unique - Interrupts are routed to every core - Each core has a 32-bit maskable PIC - Open Multi-core Programmable Interrupt Controller provides only IPI ### Memory Barriers - Strong vs Weak memory model - Memory Sync Points - Cache Coherency - Transitivity OpenRISC cache snoop and atomic operations provide this. ### **Atomics & Spinlocks** Atomic Instructions added to support multi-core. Peter Zijlstra recommended switching to qspinlocks and qrwlocks. Very Easy #### atomic pair ``` /* Atomically perform op with v->counter and i */ #define ATOMIC OP(op) static inline void atomic ##op(int i, atomic t *v) 17 18 int tmp; 19 volatile \n" 80,0(%1) 1." #op " %0, %0, %2 \n" 0(%1),%0 1.nop "=&r" (tmp) "r"(&v->counter), "r"(i) : "cc", "memory"); ``` #### qspinlocks and qrwlocks ``` generic-y += percpu.n generic-y += preempt.h generic-y += qspinlock_types.h generic-y += qspinlock.h generic-y += qrwlock_types.h generic-y += qrwlock_types.h generic-y += grwlock.h generic-y += sections.h ``` #### Per CPU Provide cache aligned structure access ``` * one-shot events, so no problem. 55 #/ DEFINE PER CPU(struct clock_event_device, clockevent_openrisc_timer); 56 57 58 void openrisc clockevent init(void) 59 60 unsigned int cpu = smp processor id(); struct clock_event_device *evt = 61 62 &per cpu(clockevent openrisc timer, cpu); struct cpuinfo orlk *cpuinfo = &cpuinfo orlk[cpu]; 63 64 65 mtspr(SPR TTMR, SPR TTMR CR); 66 67 #ifdef CONFIG SMP evt->broadcast = tick broadcast; 68 69 #endif evt->name = "openrisc timer clockevent", 70 71 evt->features = CLOCK EVT FEAT ONESHOT, 72 evt->rating = 300, evt->set next event = openrisc timer_set_next_event, 73 74 75 evt->cpumask = cpumask of(cpu); 76 77 /* We only have 28 bits */ 78 clockevents config and register(evt, cpuinfo->clock_frequency, ``` ### Timer Sync Ensure each core has synchronized tick timers Case event is handled by core 0 then later by core 1 - You don't want to go back in time - You don't want to go into the future CPUs must sync timer when brought online. ### **Lockdep and Frame Pointers** Lockdep validates locking by ensuring interrupts are on/off when locks are taken. Requires annotating low level IRQ changes and a reliable frame unwinder. trace\_hardirqs\_off when the system turns IRQ off trace\_hardirqs\_on when the system turns IRQ on #### Disabling Interrupts ``` resume userspace: 931 DISABLE INTERRUPTS (r3, r4) 932 TRACE IROS OFF 933 1.1wz r4, TI FLAGS(r10) 934 1.andi r13,r4, TIF WORK MASK 935 1.sfeqi r13,0 936 1.bf restore all 937 1.nop 938 ``` #### Re-enabling Interrupts ``` I.IWE IO, FI GFRO(II) 969 restore all: 970 #ifdef CONFIG TRACE IRQFLAGS 971 972 r4, PT SR(r1) 973 1.andi r3,r4,(SPR SR IEE SPR SR TEE) 974 1.sfeq r3,r0 /* skip trace if irgs were off */ skip hardirgs on 975 1.bf 976 1.nop TRACE IRQS ON skip hardirgs on: #endif RESTORE ALL ``` #### **Patch Series** | 2017-11-14 | Merge tag 'devicetree-for-4.15' of git://git.kernel.org/pub/scm/linux/kernel/ | Linus Torvalds | 1 | -2/+0 | |------------|----------------------------------------------------------------------------------|----------------------|----|------------| | 2017-11-13 | Merge tag 'for-linus' of git://github.com/openrisc/linux | Linus Torvalds | 39 | -346/+1630 | | 2017-11-08 | kbuild: clean up *.dtb and *.dtb.S patterns from top-level Makefile | Masahiro Yamada | 1 | -2/+0 | | 2017-11-03 | openrisc: fix possible deadlock scenario during timer sync | Stafford Horne | 1 | -1/+1 | | 2017-11-03 | openrisc: pass endianness info to sparse | Luc Van Oostenryck | 1 | -0/+1 | | 2017-11-03 | openrisc: add tick timer multi-core sync logic | Stafford Horne | 5 | -3/+145 | | 2017-11-03 | openrisc: enable LOCKDEP_SUPPORT and irqflags tracing | Stafford Horne | 2 | -3/+74 | | 2017-11-03 | openrisc: support framepointers and STACKTRACE_SUPPORT | Stafford Horne | 6 | -48/+224 | | 2017-11-03 | openrisc: add simple_smp dts and defconfig for simulators | Stefan Kristiansson | 2 | -0/+129 | | 2017-11-03 | openrisc: add cacheflush support to fix icache aliasing | Jan Henrik Weinstock | 7 | -8/+194 | | 2017-11-03 | openrisc: sleep instead of spin on secondary wait | Stafford Horne | 2 | -2/+54 | | 2017-11-03 | openrisc: fix initial preempt state for secondary cpu tasks | Stafford Horne | 2 | -1/+2 | | 2017-11-03 | openrisc: initial SMP support | Stefan Kristiansson | 19 | -113/+584 | | 2017-11-03 | irqchip: add initial support for ompic | Stafford Horne | 1 | -0/+1 | | 2017-11-03 | openrisc: use aspinlocks and arwlocks | Stafford Horne | 4 | -1/+24 | | 2017-11-03 | openrisc: add 1 and 2 byte cmpxchg support | Stafford Horne | 1 | -32/+115 | | 2017-11-03 | openrisc: use shadow registers to save regs on exception | Stefan Kristiansson | 2 | -26/+80 | | 2017-11-02 | License cleanup: add SPDX license identifier to uapi header files with a license | Greg Kroah-Hartman | 5 | -0/+5 | # ありがとう thank you #### On the web ``` me shorne@gmail.com stffrdhrn.github.io @stffrdhrn OpenRISC github.com/openrisc - projects hosted here #openrisc on freenode (I'm shorne) openrisc@lists.librecores.org openrisc.io ``` # Questions ?