May 16, 2026

Hacker News Evening Brief: 2026-05-16

Tonight’s Hacker News front page leaned into the practical edges of software: web semantics, Unicode failures, storage behavior, mobile control, security research, AI tooling, and several deeply specific side projects. The stronger discussions were less about novelty and more about what holds up under real use: browser compatibility, hardware constraints, competition rules, licensing, methodology, and whether a tool’s promise survives the messy systems around it.

AI & Tech Policy

SANA-WM, a 2.6B open-source world model for 1-minute 720p video

Summary: NVIDIA’s SANA-WM page presents an efficient minute-scale world model for generating or modeling one-minute 720p video. The title describes it as a 2.6B-parameter open-source world model, but the compact project page excerpt exposes only the heading rather than benchmarks, examples, or license details. The interesting claim is that longer-horizon video/world modeling may be possible at a relatively small model size, though the available pack does not establish the release status of the weights.

HN Discussion: Commenters immediately pulled the idea toward games, asking whether generated worlds can replace hand-authored environments where every object placement is intentional. Skeptics challenged the open-source label if weights are only promised “soon,” while optimists focused on local GPU feasibility and possible future use in games or VR.

DeepSeek-V4-Flash means LLM steering is interesting again

Summary: Sean Goedecke argues that activation steering becomes newly practical if DeepSeek-V4-Flash is good enough for local, agentic coding experiments. The post connects that renewed interest to antirez’s DwarfStar 4 project, which exposes steering as a first-class local-model capability. It explains steering as finding a concept direction in a model’s activations, such as terse responses, then boosting or suppressing that direction during inference.

HN Discussion: Several commenters focused on the safety-sensitive side of steering vectors, especially their use for removing refusals or “abliteration.” Others were more interested in interaction design: if local models expose hidden knobs that hosted frontier labs keep private, steering could become a workflow control rather than a research demo. A correction also pushed back on calling DwarfStar 4 a stripped-down llama.cpp.

After 8 years, I rewrote my open-source PyTorch curvature library

Summary: The repository is pytorch-hessian-eigenthings, a PyTorch tool for efficient Hessian eigendecomposition. The author says the original 2018 package targeted GPU-accelerated curvature analysis of neural-network loss landscapes and has now been rewritten. The practical problem is that Hessian eigenvalues are useful for studying flat minima and low-rank structure, but a full Hessian has memory cost that grows quadratically with parameter count.

HN Discussion: The visible thread is mostly the author’s own context-setting, so the discussion is thinner than the submission title suggests. The concrete technical theme is feasibility: how to estimate curvature metrics for modern models without materializing an impossibly large Hessian. Readers treated it as a maintained niche research tool rather than a general-purpose ML framework.

The sigmoids won’t save you

Summary: Scott Alexander critiques a common AI argument: because exponentials eventually flatten into sigmoids, current capability curves can be safely dismissed. He accepts the technical point that growth cannot continue forever, using epidemics and airspeed records as examples of curves that eventually hit limits. His argument is that invoking a future plateau does not identify where it is, when it arrives, or whether it comes before AI reaches high capability.

HN Discussion: Commenters leaned into prediction humility, arguing that neither trend graphs nor slogans settle whether AI stalls, accelerates, or collapses. A methodological thread debated Lindy’s Law as a default for trends with unknown limits, while others objected that trends are not static cultural artifacts. Several comments also challenged whether current AI benchmarks measure intelligence or only trainable task performance.

Image-blaster: Creates 3D environments, SFX, and meshes from a single image

Summary: image-blaster is a GitHub project described as an image-to-world skillset for Claude. The submission says it can create 3D environments, sound effects, and meshes from one input image. The repository excerpt is mostly GitHub navigation, but the discussion identifies World Labs as a central service behind the workflow and mentions related AI modeling pieces such as texturing and auto-rigging.

HN Discussion: Commenters compared the project with older multi-image reconstruction systems such as Microsoft’s PhotoSynth, with the main leap being a single-image input. The strongest skepticism was about hallucinated geometry outside the visible scene: if the unseen world does not make physical sense, single-image generation may be less useful than it looks. Game developers also asked about adjacent pipelines for consistent isometric sprites.

Show HN: Watch a neural net learn to play Snake

Summary: tinyppo-snake is an in-browser lab for training a neural network to play Snake using PPO. The interface exposes training and watch modes, live weights, environment grids, rollout metrics, score charts, learning-rate sweeps, multi-seed presets, and run comparison. Rather than a static article, the page is an interactive training surface where users can start runs, watch trained-policy rollouts, reset environments, and compare experiments.

HN Discussion: Commenters reported live behavior from their own runs, including one case where a model reached a high score and then appeared to collapse back to zero-scoring play. Another noticed that switching between train and watch modes temporarily hurt training performance, raising questions about UI state or the training loop. The thread also included a related terminal UI Snake AI project and a report of Bitdefender suspicious-page alerts.

Security & Privacy

Gaining control of every projector and camera on campus

Summary: Edna describes discovering that Colorado School of Mines DNS assigns visible subdomains to devices connected to campus Wi-Fi. The post starts from the idea of tracing devices by resolving internal DNS names to IP addresses, then considers approaches such as zone transfers and certificate logs. The visible excerpt frames the project as network exploration that escalated toward campus projectors and cameras, with DNS behavior as the initial weak point.

HN Discussion: Commenters compared the PTR-record enumeration trick with hotel and public networks where guest clients can still resolve internal hostnames through shared DNS. A former Vaddio product manager explained that newer firmware required non-default passwords and that the company shipped tooling to mass-upgrade firmware and change default auth. The thread also questioned how access worked if routing was blocked, and noted a reported campus DNS outage caused by aggressive querying.

A 0-click exploit chain for the Pixel 10

Summary: Google Project Zero describes adapting a previously published Pixel 9 zero-click-to-root exploit chain to the Pixel 10. The first stage updates a Dolby CVE-2025-54957 exploit, mostly by recalculating offsets for the Pixel 10 library. A Pixel 10 mitigation difference mattered: RET PAC replaced a stack-protector target, so the exploit used an overwriteable initialization path instead of __stack_chk_fail.

HN Discussion: Commenters connected the exploit surface to AI-powered phone features that decode message media before a user opens it, reviving old worries about automatic parsing of untrusted content. Patch timing also drew attention because one Android driver bug was reportedly patched within 90 days, reassuring some readers about Google while worrying them about broader Android vendor response. Others wondered whether exploit publication feels more frequent because of real changes or AI security hype.

Frontier AI has broken the open CTF format

Summary: Kabir argues that frontier AI has broken open capture-the-flag competitions because scoreboards no longer cleanly measure human security skill. The author grounds the claim in his own CTF history, including Australian wins and top placements with an international team through 2025. The post says the shift came as AI tools improved enough to solve or accelerate tasks that previously distinguished competitors, making the old online format unrecoverable.

HN Discussion: Readers drew parallels to education, where LLMs can teach but also make it hard to distinguish learning from outsourcing. Competitive programming and code golf were cited as adjacent domains where AI agents are starting to rival top humans if they cannot be banned. Some comments explored whether harder challenges, obfuscation, or model-resistant tasks can help, while asking when that difficulty destroys the original competition.

Where to buy a non-Apple, non-Google smartphone

Summary: The Register surveys smartphone alternatives as Apple and Google impose tighter limits on their phone operating systems. The article frames the issue around the Keep Android Open campaign and the desire to avoid the Apple-Google mobile duopoly. The excerpt indicates coverage of vendors or platforms such as Murena, Punkt, and Volla, while commenters note that several listed options are still Android-based.

HN Discussion: Commenters argued that buying an alternative phone is easier than living with one because banking, government, authentication, postal, and transit apps can force a mainstream device. GrapheneOS came up as a major omission, with one commenter calling a Pixel running GrapheneOS the best practical non-Google option despite the hardware origin. Others compared Huawei HarmonyOS integration with Librem 5/PureOS freedom and app compatibility pain.

Web & Infrastructure

You don’t know HTML Lists

Summary: Frank M. Taylor’s article digs into list semantics beyond the everyday ul and ol examples. It walks through ordered, unordered, description, menu/control, and form-choice patterns, focusing on how to represent collections of content rather than how to style them. The piece is aimed at developers who already write HTML but may not have looked closely at the list-related corners of the HTML specification.

HN Discussion: The most concrete comments came from people testing the examples in browsers, especially datalist and disabled optgroups on Mobile Safari. That turned the thread from pure semantics into compatibility and accessibility tradeoffs. Another line of discussion asked whether HTML linters can enforce semantic tag selection rather than merely catching invalid markup.

Moving away from Tailwind, and learning to structure my CSS

Summary: Julia Evans describes migrating a few small sites away from Tailwind toward semantic HTML and vanilla CSS. She does not frame Tailwind as a mistake; instead, she says it helped when her CSS was chaotic and taught her useful structural habits. The migration became a way to think directly about layout, fonts, colors, common components, and the systems a CSS codebase needs to avoid decay.

HN Discussion: Accessibility-focused commenters argued that developers should start with semantic HTML and then style it, rather than letting utility classes drive the document shape. A recurring debate compared Tailwind with CSS Modules, cascade layers, naming conventions, and other ways to avoid class clashes or dead CSS. Several readers also praised Evans’ explanatory style, while sharper critiques treated Tailwind enthusiasm as a symptom of shallow CSS knowledge.

My Favorite Bugs: Invalid Surrogate Pairs

Summary: George Mandis recounts a rare production bug from migrating a legacy editor to a collaborative stack using TipTap, ProseMirror, and Yjs. Users could keep typing, but after an invalid Unicode surrogate-pair state appeared, edits silently stopped syncing to the Yjs document. The post includes an interactive tool for exploring surrogate pairs, turning the bug into a concrete lesson about string encoding assumptions.

HN Discussion: Commenters shared similar Unicode failures, including environment-dependent corruption caused by SSH changing LANG settings on a server. A technical thread criticized CRDT behavior at the code-unit level and debated safer units such as Unicode scalar values versus extended grapheme clusters. The broader theme was that “normal” string handling becomes fragile across encodings, regexes, collation changes, and collaborative editing.

Clusters become personal (like PCs did)

Summary: Aranya argues that individuals will soon consume a cluster’s worth of compute, with AI chat already making personal requests trigger large backend systems. The post defines a cluster as many computers operated as one entity by orchestration software such as Kubernetes, used for scale and failure tolerance. It predicts a shift from people merely consuming enterprise-hosted clustered services toward using personal clusters at work and eventually at home.

HN Discussion: Commenters pushed back on the premise, asking why an individual needs a cluster when a single Linux VM or several isolated VMs can scale far enough. Cost and complexity were central objections: vertical scaling with a workstation or bare-metal server looked simpler for personal workloads. The thread questioned whether the pooling advantages of clusters still matter when usage is personal rather than shared across unpredictable tenants.

Academic & Research

Δ-Mem: Efficient Online Memory for Large Language Models

Summary: The arXiv paper proposes delta-mem, a lightweight online memory mechanism for LLMs used in long-term assistants and agent systems. Instead of only expanding context windows, it augments a frozen full-attention backbone with a compact associative memory state updated online. The abstract says the method compresses past information into a fixed-size state matrix using delta-rule learning, aiming for lower cost and better reuse of history.

HN Discussion: One precise thread objected that HN title normalization changed the intended lowercase Greek delta symbol. Technical skepticism centered on whether a fixed-size memory state solves capacity and retrieval problems when small query variations can produce different activations. Commenters also wanted practical resource reporting: RAM requirements, time to first token, throughput, latency, and cost comparisons alongside model metrics.

Fecal transplants for autism deliver success in clinical trials

Summary: Refractor reports on Arizona State University work examining fecal transplants and the gut-brain connection in autistic children with gastrointestinal issues. The article says a two-year study found symptom reductions of up to 45 percent and highlights microbial diversity as a possible mechanism. It also notes that the story was originally published earlier and updated with newer information current as of April 2025.

HN Discussion: Commenters stressed confounders around diet and comorbid gastrointestinal problems, including examples of limited diets causing severe nutritional deficiencies. A methodology-focused comment identified a likely clinical trial registration with N=60 and placebo controls, while cautioning that many small autism trials have failed larger replication. Several readers argued the headline can mislead by implying autism itself is treated, when the safer claim is about GI health and associated behavior or comfort.

How to Write to SSDs [pdf]

Summary: The submitted item is a VLDB PDF, so the guard could not extract the paper text directly from the file. The first author linked an extended arXiv version and said its appendix contains additional details and FAQ-style answers beyond the VLDB paper. Discussion quotes the paper as introducing NoWA, a “No Write Amplification” pattern intended to guarantee an SSD write amplification factor of 1 even at full device utilization.

HN Discussion: Commenters were impressed that the NoWA pattern reportedly works on regular commodity SSDs from multiple vendors. A storage-systems thread asked about missing references to Zoned XFS and related filesystem-level data placement work. Database-focused comments highlighted analysis of write amplification, slowdown, and wear for large MySQL and Postgres deployments, with speculation about applying similar analysis to SQLite.

Tech Tools & Projects

Windows 9x Subsystem for Linux

Summary: The Codeberg repository implements a Windows 9x subsystem for Linux and is written mostly in Assembly with a substantial C portion. The excerpt shows recent work on boot/loading, VxD macros, a Linux submodule, build simplification, and spawning a shell on a console. It reads as a compact experimental retro-computing project rather than a polished compatibility layer.

HN Discussion: Commenters connected the submission to a previous HN thread and treated it as part of a niche project lineage. A technical theme was surprise that 2026 Linux kernels can still target i386-era systems despite wider distribution and kernel deprecations. Some comments joked or speculated about whether old Windows environments could have practical maintenance uses.

Accelerate

Summary: AccelerateHS/accelerate is a GitHub project for an embedded Haskell language targeting high-performance array computations. The repository description positions it as a way to express array programs in Haskell while enabling JIT compilation, vectorization, CPU parallelism, or GPU offload. The pack excerpt is mostly GitHub page chrome, but the discussion indicates the project is mature and has existed for roughly a decade.

HN Discussion: Commenters translated the purpose for newcomers as roughly NumPy plus a JIT compiler with mostly standard Haskell syntax and type-signature changes. A small language-design thread contrasted Accelerate with array languages such as APL and J, arguing that Haskell’s types may make similar ideas more approachable. Naming confusion came up because several tools and Apple’s Accelerate.framework use similar names.

Greek Alphabet Cards

Summary: The project is a set of Greek alphabet cards made by a parent to help young children learn letters through visual associations. Each card uses an object whose Greek name begins with the target letter and whose drawing also resembles that letter, linking shape and vocabulary. The author describes using GreekLex, a dictionary with more than 35,000 Modern Greek words and frequency data, to find good candidate objects.

HN Discussion: Commenters compared the idea to Letterland for English and to informal shape mnemonics used in math education for Greek symbols. Some suggested grammar-book exercises as a more traditional way to make the Greek alphabet stick. The thread’s tone was mostly appreciative, including Greek-language praise for the visual design.

Futhark by Example

Summary: Futhark by Example is a hands-on introduction to Futhark, a purely functional, high-performance, data-parallel array language. The page organizes commented programs in increasing complexity, from factorials and primitive values through reductions, scans, polymorphism, gather/scatter, loops, and array flattening. It points readers to an interpreter, implemented benchmarks, projects using Futhark, and a more conventional book-length introduction.

HN Discussion: Commenters praised Futhark’s typed array-shape ideas, especially carrying vector and matrix lengths in type information to prevent CUDA-style mistakes. The name prompted confusion with Elder Futhark runes, with some users expecting a page about reading ancient script. Several comments framed Futhark as a welcome alternative to C/C++-style low-level GPU programming.

Charity – Categorical programming language (1998)

Summary: The linked GitHub document is for Charity, a categorical programming language associated with a 1998-era project. The compact excerpt is mostly GitHub page chrome rather than README content, so details of the language design are not visible in the pack. The visible comments identify one central property: Charity computations are described as terminating, making totality part of the language’s identity.

HN Discussion: Commenters reacted to the language’s surface feel by comparing it to a mix of Python and OCaml. The main technical observation was that guaranteed termination means the language is not Turing-complete in the usual unrestricted sense. The discussion is sparse, so the submission functions mostly as a pointer to an unusual historical language.

I designed a nibble-oriented CPU in Verilog to build a scientific calculator

Summary: FPGA-Calculator is a from-gates-to-scientific-calculator project implemented in Verilog. The submission centers on a nibble-oriented CPU designed specifically to build a scientific calculator rather than relying on an off-the-shelf processor core. Discussion points to a separate detailed write-up, including a chapter on instruction-set design, as an important companion to the GitHub repository.

HN Discussion: Commenters connected the project to HP calculator history, praising construction, accuracy, and RPN culture around models such as the 11C and 15C. Other builders compared it with smaller calculator projects, including a MicroPython calculator, and admired the depth of the FPGA implementation. One technical aside corrected a caption involving HP-71B and HP-48GX hardware history, noting their Saturn CPU connection.

History & Science

How an Australian Teen Team Is Making Radio Astronomy Affordable for Schools

Summary: The OpenRockets Magazine page is titled as a profile of an Australian teen team working on affordable radio astronomy for rural schools. The compact excerpt exposes almost no article body beyond the publication date and login/navigation text, so specific hardware, cost, or deployment claims are not available in the pack. The story is therefore best read cautiously as an education-and-radio-astronomy project whose implementation details were not recoverable here.

HN Discussion: Commenters criticized the article as too thin for such an interesting title and asked for an official site, repository, or technical page. A thread questioned whether the article was AI-generated or promotional. Technically minded readers wanted system architecture and cost-reduction details, comparing it with prior low-cost radio telescope projects that used GNSS receiver chips as RF front ends.

Accelerando (2005)

Summary: The linked page is the 2005 science-fiction work Accelerando, but the guard could not fetch the article text because certificate verification failed. Comment excerpts identify the work as near-future fiction concerned with personal agents, neural networks, rapid social change, and a society accelerating under technological pressure. Because the article body is unavailable in the pack, the safe summary comes from the title and discussion rather than the page text.

HN Discussion: Readers focused on how prescient the book feels in 2026, especially its depiction of wearable agent systems that handle research and tasks for the protagonist. Several compared it with The Quantum Thief, The Culture, and William Gibson, emphasizing futures that feel both strange and causally reachable. Quoted passages about billion-node neural networks and automatic language learning made some 2005 ideas feel less remote.

Project Gutenberg – keeps getting better

Summary: Project Gutenberg’s homepage describes a library of more than 75,000 free ebooks, focused on older works whose U.S. copyright has expired. The site offers EPUB, Kindle, and online reading options, with volunteer-digitized and proofread texts, newest releases, frequently downloaded lists, categories, and volunteer-curated reading lists. A Gutenberg programmer in the thread said the site has received substantial improvements over recent months and that more changes are coming.

HN Discussion: Readers discussed Project Gutenberg’s long history, including its origin in 1971 with Michael Hart digitizing the U.S. Declaration of Independence. Personal comments emphasized the service’s practical value for older readers and people who want frictionless access to classic books. A recurring product complaint was that ebook reader vendors rarely provide a first-class Gutenberg store.

What Were Ancient Greco-Roman Curse Tablets?

Summary: The submitted History.com article is about ancient Greco-Roman curse tablets, but the guard excerpt mostly captured a Sky History landing page rather than the article body. HN comments quote the article as saying archaeologists have recovered more than 1,500 tablets secretly aimed at rivals. Another quoted passage describes the logic of binding someone by curse, such as making them unattractive, ineffective in speech, or causing a chariot wheel to fail.

HN Discussion: One commenter reported a forced redirect to a regional History TV site even with scripts blocked, so access behavior became part of the discussion. Most visible comments treated the subject playfully, asking whether ancient newspapers could verify if curses worked or riffing on familiar curse jokes. A more reflective comment compared curse tablets with prayer, manifestation, and the tendency to credit improbable outcomes to chosen rituals.

The Physics–and Physicality–Of Extreme Juggling (2018)

Summary: The Wired piece is a 2018 article about the physics and bodily demands of extreme juggling. The compact excerpt is mostly page styling and metadata, so the pack does not expose much of the article’s argument or reported detail. The visible discussion indicates one focal example: a 2017 attempt in which Barron reportedly flashed 14 beanbags, a feat the commenter says has not been repeated or exceeded.

HN Discussion: The single visible comment narrows the discussion to record verification and repeatability rather than broad physics analysis. It distinguishes a one-time flash from a sustained or repeated achievement, implying that extreme juggling records depend heavily on definitions. Because the thread is thin, the discussion should be read as a small note on measurement rather than a developed debate.

Business & Industry

Nearly 50 Years Later, WKRP in Cincinnati Becomes a Real Radio Station

Summary: Open Culture reports that Cincinnati-area FM station The Oasis acquired and adopted the WKRP call letters nearly 50 years after the sitcom debuted. The call letters came from a North Carolina nonprofit station that auctioned them as part of a fundraising effort. The station marked the launch by playing the TV show’s theme song for six straight hours and plans to keep playing classic rock from the 1960s through the 1980s, with actor Gary Sandy recording promos.

HN Discussion: Commenters discussed nostalgia for radio work and how modern station programming differs from the free-form DJ ideal portrayed by the show. A fan archive combining show station breaks and song announcements from the show with full-length songs drew attention as a clever preservation project. Licensing problems around the sitcom’s original soundtrack remained a sore point for streaming availability.

Other

Impeaching Every Federal Judge and Justice Who Endorsed Unitary Executive Theory

Summary: Christopher Armitage’s Substack essay argues that federal judges and Supreme Court justices associated with the Federalist Society or unitary executive theory should face impeachment and removal. The author frames the claim as serious enough to require a long treatment grounded in doctrine, history, rulings, and constitutional standards. The excerpt says the argument includes hundreds of federal judges and six sitting Supreme Court justices, but does not expose the detailed legal reasoning.

HN Discussion: The visible HN discussion is very thin, with one commenter shifting the focus from federal judiciary ideology to local judges and bail decisions. That comment treats judicial accountability through a public-safety lens rather than the author’s constitutional-theory lens. The pack does not support a broader claim about community consensus or a developed legal debate.