Arm's Cortex X925: Reaching Desktop Performance
Apple Silicon vs Cortex-X925 and Other ARM Cores
- Many commenters find it odd the article doesn’t compare X925 to Apple’s M-series, given Apple’s status as ARM performance leader and similar “desktop” positioning.
- Counterpoint: Apple doesn’t sell its CPUs as components, so they’re irrelevant for “industry” use; the article focuses on licensable ARM cores.
- Rebuttal: The piece already compares to AMD/Intel, so excluding Apple feels arbitrary; X925-based systems (e.g., GB10/DGX Spark) compete with Macs as products.
- Benchmarks cited from Geekbench: M5 beats a Ryzen 9 9950X in single-thread but lags in multi-thread; M4 Max/Mac Studio offer strong laptop/desktop performance.
- Some see Apple as single-thread “king,” but note that server chips like Graviton 4 or AmpereOne win on core count and different workloads.
Ecosystem, OS, and Linux Concerns
- Several participants reject Apple hardware despite performance due to macOS UX, ecosystem lock-in, repairability, and lack of official Linux support.
- Others highlight Asahi Linux’s progress but note poorer battery life and power management vs macOS.
- Virtualization on Apple Silicon is praised (Apple’s framework, containers, QEMU), though Rosetta 2’s future changes raise concerns.
Relevance of “Desktop Performance” and Product Timing
- Debate over what “desktop performance” even means today given diversity of form factors (mini PCs, laptops, workstations).
- Some argue power and perf/W should be central to any desktop comparison; others prioritize raw performance when plugged into mains.
- Criticism that ARM core announcements often precede real products by years; others note X925-based hardware is already shipping and that long pre-briefing isn’t unique to ARM.
Memory Models and Concurrency Bugs
- Discussion on whether wider ARM desktop use will expose more race conditions due to ARM’s weaker memory model vs x86’s stronger ordering.
- Explanation that C/C++ memory models allow behaviors x86 happens not to expose, so bugs may surface when moving to ARM.
- Historical note: multiple architectures (PowerPC, MIPS, Alpha, Windows NT’s early multi-ISA design) were useful for uncovering such portability bugs.
SIMD, Cache, and Microarchitecture Details
- X925 praised for high IPC but seen as disadvantaged vs Zen 5 in heavy vector/FP workloads due to 128-bit SIMD width and unbalanced FMA vs load throughput.
- Some argue wide SIMD (e.g., AVX-512) matters mainly for specialized workloads; everyday desktop use is more latency/UI bound.
- Separate discussion on 4KB vs 16KB page sizes constraining L1 cache design on x86 vs recent ARM cores, and how that affects performance and power.