The original C++ engine compiled to WebAssembly with SIMD — no Python,
no ONNX, no server. The same pf_classify code path runs
natively in your browser. Source:
GitHub
Downloads the q8 GGUF (~1.6 GB, cached after first load) and runs the full ggml inference pipeline client-side via WebAssembly SIMD.