2024-08-27

"Tinyboxes finally have a buy it now button"

Hardware & Specs

Two main variants: “red” with 6×7900 XTX GPUs (~~$15k) and “green” with 6×4090 GPUs (~~$25k).
Air‑cooled chassis, roughly 3200W max power draw, large under‑desk style case rather than dense rackmount.
CPU is described only generically (EPYC‑class); several commenters find the CPU/RAM specs vague and possibly underpowered relative to the GPU count.
PCIe 4.0 x16 links for each GPU; discussion around bandwidth marketing (32 GB/s per direction vs “64 GB/s” bidirectional spec).

Performance, Bottlenecks & Workloads

Some see it as strong GPU‑per‑dollar compared with high‑end multi‑4090 workstations.
Concerns that limited CPU and RAM could bottleneck workloads with heavy preprocessing.
Debate over PCIe and interconnect: fine for many workloads on a single box, but much slower than InfiniBand for large distributed training.
Several comments say it cannot realistically fine‑tune a 70B model; VRAM is borderline and interconnect is too weak for multi‑box scaling at that size.
One view: OCP 3.0 slot with ~200 Gbps NIC is adequate up to a few‑billion‑parameter models, but not beyond; 70B remains “unclear/likely no.”

Power, Electrical & Cooling Concerns

3200W draw triggers extensive discussion of residential power: circuit derating to 80%, AFCI nuisance trips, transient GPU spikes, and code limits on continuous loads.
US users may need two separate 120V circuits or a dedicated 240V/20–50A feed; EU users note standard 230V outlets are not rated for 3.2kW 24/7.
Suggestions: dedicated circuits, mini‑split AC, separate shed, or basement placement; some argue this extra infrastructure undercuts the “cheap compute” story.
Significant concern about dumping 3kW of heat into typical homes, especially in hot climates.

Value, Pricing & Alternatives

Raw GPU cost of the red model is ~⅓ of total; the rest is attributed to CPU, motherboard, RAM, storage, PSUs, custom case, assembly, and support.
Some feel the box is overpriced versus DIY builds (e.g., ~$4k dual‑GPU rigs) or professional alternatives (Supermicro + H100 in colo, or cloud).
Others argue hobbyists undervalue engineering, integration, and “works out of the box” convenience.

Target Use Cases & Scalability

Seen as appealing for individuals or small teams wanting lots of local GPU without cloud.
Skepticism about using many of them as a cluster: networking is limited, form factor is bulky (≈15U each), and operations costs rise quickly.
Question over long‑term business viability and support; unclear how often hardware will be refreshed.

Related topics