Nov 14, 2013 Josef Weidendorfer (replacement today for Prof. Gerndt)
2 / 15
for easy integration with other parts into custom chips for embedded market: SoC
includes GPU, network, controllers,
WS12/13
3 / 15
WS12/13
4 / 15
WS12/13
5 / 15
WS12/13
6 / 15
Apple A5
used in iPad2, iPhone 4S manufactured by Samsung Cortex A9, ARMv7 ISA
Others
Texas Instruments OMAP3/4, NVIDIA Tegra2/3
7 / 15
WS12/13
WS12/13
8 / 15
ARMv7 ISA
RISC (Reduced Instruction Set Computing)
few, simple instructions (~32) fixed format (4 bytes for 32bit encoding)
load/store architecture
explicit instructions for memory access simple + indexed addressing modes multiple load/store with modifying index
9 / 15
Special
r13: stack r14: link register (return address for function calls) r15: program counter CPSR: status register (+ SPSR saved)
including N/Z/C/V
NEON
separate 32 64bit / 16 128bit vector registers
10 / 15
WS12/13
BL <subroutine>
Stores return address in LR Returning implemented by restoring the PC from LR For non-leaf functions, LR will have to be stacked func1 func2
: : BL func1 : : STMFD sp!, {regs,lr} : BL func2 : LDMFD sp!, {regs,pc} : : : : : MOV pc, lr
13 / 15
WS12/13
14 / 15
WS12/13
Cortex A9
1-4 cores, 16-64 kB L2I/D, 0-8 MB L2 speculative out-of-order Apple A5, OMAP4 (Pandaboard), Tegra3 (Nexus 7)
Cortex A15
more aggressive OoO, more stages, VFP-4, OMAP5, Samsung Exynos 5x (Nexus 10) can compete with current x86 Atom processors
15 / 15
WS12/13
for massively parallel server loads (simple, but lots of cores?) implementations expected for 2014 by AMD, Broadcom, Samsung,
16 / 15
WS12/13
More information
ARM Architecture Reference Manual (ARM ARM) CRE podcast about ARM, Wikipedia @TUM: new introductionary lab course with recently sponsored BeagleBoards (~40) inauguration event at Nov 23, 2013 with representatives from ARM, TI, STMicro
17 / 15
WS12/13
19 / 15
WS12/13