THEOSIS//AI

By Justin Goheen
00Hardware & Cloud
SoCGB10
Memory128GB
FP4~1PFLOP
BW273GB/s
SoCM5 Max
Memory48GB
GPU40core
BW546GB/s
01Areas of Study
02Tools
CTRL/01Control

Model library. Browse HuggingFace, see what fits in 128 GB, load and serve from one surface.

  • Model grid with org, params, quant, memory fit vs 128 GB
  • Serve config YAML panel for SGLang / vLLM / llama.cpp
  • Disk cache management and HuggingFace pull stubs
  • One-click load with memory budget validation
CTRL/02Logger

Every training run writes to SQLite and Parquet locally. No network dependency.

  • Run comparison table with pinned metrics
  • Loss and learning rate sparklines per run
  • Config diff between any two experiments
  • Shell for ad-hoc queries over Parquet
CTRL/03Traces

OTel-style span waterfall for tool-using agents. Every retrieval, rerank, and generation span.

  • Trace list with token cost and total latency
  • Waterfall lanes with color-coded span types
  • Span I/O detail panel with grounding verification
  • JSONL and OTel-compatible export
CTRL/04Monitor

Real-time GPU monitor. Utilization, memory pressure, power draw, bandwidth.

  • Gauge strip: GPU util, memory, power, temp, bandwidth, FP4 TOPS
  • Per-process memory (VRSS) breakdown
  • Kernel timeline with compute / memcpy / attention / nccl lanes
  • Profiler sub-view with execution phase grouping
THEOSIS.AI · 2026PAGE 01 / 01