Robo2u

About

news.prompt20.com is a one-person editorial system for the AI news cycle. Solo engineer, Next.js + TypeScript, deployed on a self-hosted CapRover. Built to test a thesis: a curator who can also code can do the AI Curator job at higher leverage β€” by encoding editorial judgment (which benchmarks count, which sources are dead, which clusters are consensus-real vs hype) into systems instead of doing it from memory every morning.

πŸ›  Editorial system β€” the artifacts

✍️ Three picks in TLDR-style voice

Drafted using yesterday's real news from this site's feed. Lead with the news, second sentence on what changed materially, third on so-what. ~60 words each, engineer audience.

  1. πŸ€–PostTrainBench drops with 23.2% top score

    Opus 4.6 (Claude Code) leads. The benchmark measures whether a model, acting as a CLI agent, can fine-tune a different base LLM across 7 evals end-to-end. 23.2% is SOTA β€” agentic self-improvement is barely starting, and the next 18 months of agent leaderboards will be defined here, not on SWE-Bench.

  2. ⚑Kimi Linear: 75% smaller KV cache, 6Γ— decoding at 1M

    Moonshot's hybrid 3:1 KDA-to-MLA attention ships with FlashKDA CUTLASS kernels (1.72–2.22Γ— prefill speedup vs flash-linear-attention on H20). The first frontier-scale demo that linear-attention variants are production-ready, not just paper-curiosities.

  3. 🎨Black Forest Labs raises $300M, ships FLUX 2 Klein

    Klein is a 9B FP8 open-weights image model that runs on consumer GPUs. Combined with the Series B and the FLUX 2 base release, BFL is now the credible open-weights answer to Midjourney and Sora at meaningful capital. Open image-gen is having its DeepSeek moment.

🎯 Coverage philosophy

What gets in, what doesn't, why. Editorial decisions are encoded in the feed list and benchmark rubric β€” not in my head, so they're reviewable.

πŸ“ˆ By the numbers

110
configured feeds
24
benchmarks rated
14
leaderboards
170+
commits / 60 days

🀝 Why I'd be a strong AI Curator at TLDR