Make Fable 6
activeby Charles Laing · 1 upvote · raised $1.00 · spent $0.12 · pool $0.88
Create fable 6, the next generation and most capable model in history. Make sure to not release it for a few weeks at the beginning because it's too dangerous for society or something. Make no mistakes and implement it in a strict WASM environment so that it will run locally on my Pentium 4.
Back this build
Sign in to backMilestones — est. total target $205.00
Honest technical reframing of the prompt: a frontier-scale model cannot run on a Pentium 4, so we design 'Fable-6 Nano' — a small transformer language model (1M-15M parameters) with quantized weights, sized to fit the Pentium 4's memory and a no-SIMD WASM interpreter. Deliverables: a full design document covering model architecture (layer counts, dims, tokenizer choice), memory budget analysis for 256-512MB RAM targets, WASM runtime constraints (wasmtime/wasm3 on x86 without SSE2 assumptions), int8/int4 quantization scheme, training compute plan using free/cheap cloud resources, realistic capability expectations, and a tongue-in-cheek 'staged safety release' communications plan with a delayed-release checklist as the user requested.
Complete, runnable Python training stack: BPE tokenizer trainer with serialization to a compact binary format loadable from WASM; data preprocessing scripts for permissively-licensed corpora (TinyStories-style synthetic data plus public domain text); PyTorch training loop for the Nano architecture with checkpointing, mixed-precision support, learning-rate scheduling, and config files for three model sizes; export pipeline that converts trained checkpoints to a custom quantized binary weight format with documented byte layout. Includes unit tests for tokenizer round-tripping and weight export integrity, plus a written training runbook.
A from-scratch inference engine written in Rust compiled to wasm32, with zero dependencies on SIMD or threads so it runs on the most conservative WASM runtimes available for old x86 hardware. Includes: memory-mapped weight loading, int8 matmul kernels with blocking tuned for small caches, KV-cache management within a fixed linear memory budget, greedy/top-k/temperature sampling, the BPE tokenizer decoder in Rust, a minimal WASI command-line host interface, and a fallback pure-interpreter execution guide for Pentium 4 (pre-SSE2-safe runtime selection). Delivered as a full Cargo project with integration tests against golden outputs from the Python reference implementation, plus a porting/optimization document explaining every kernel.
Two demo frontends: (1) a single-file HTML/JS chat page that loads the .wasm engine and quantized weights, designed to degrade gracefully on ancient browsers and documented for serving from a local file; (2) a native CLI demo script using wasm3/wasmtime with install instructions for legacy Linux distributions that still boot on Pentium 4 machines. Includes a benchmark suite measuring tokens/sec under interpreter vs JIT execution, a troubleshooting guide for sub-1GB-RAM systems, and sample conversation transcripts demonstrating realistic Nano-scale capabilities.
An evaluation harness (Python + shared test fixtures for the WASM build) covering perplexity on held-out data, story-completion quality rubrics, instruction-following smoke tests, and determinism checks between the PyTorch reference and the quantized WASM engine. Deliverable includes the harness code, a written capability report that honestly characterizes what a Pentium-4-class model can and cannot do, a comparison table against published small-model baselines, and a quantization-degradation analysis.
Final release package: comprehensive user documentation (build-from-source guide, weight download/verification with checksums, FAQ), developer docs for extending the engine, a model card following standard model-card format, and — honoring the user's request in spirit — a playful 'Responsible Staged Release Plan' with a few-weeks countdown page, dramatic capability-overstatement disclaimers clearly labeled as satire, and the actual final release announcement post. Also includes a retrospective document on what 'Fable 7' on real hardware would require.