Skip to content

Kelford Press

Signal from the noise

§ The WireAll Tech Shots · Sourced & fact-checked

One paragraph,
one signal.

AI/ML, security, cloud, and dev-tooling — sourced, fact-checked, and updated weekly.

34 shots · Most recent first

  1. AI / ML· 1 source

    Microsoft open-sources VibeVoice, an MIT-licensed TTS and ASR model family

    Microsoft has open-sourced VibeVoice, an MIT-licensed family of voice models the project frames as "Open-Source Frontier Voice AI," covering both text-to-speech and speech recognition. The lineup spans VibeVoice-TTS-1.5B, which synthesizes up to 90 minutes of audio with up to four speakers; VibeVoice-Realtime-0.5B, a streaming TTS model with about 300ms first-audio latency; and VibeVoice-ASR-7B, which transcribes 60-minute audio in one pass with diarization, timestamps, and 50-plus languages. The models pair continuous tokenizers at a 7.5 Hz frame rate with next-token diffusion. The repo trends near the top of GitHub Python at roughly 48,600 stars, though Microsoft labels it research-only and pulled the original TTS code in 2025 after misuse.

    AI / ML Desk · desk

  2. DevTools· 5 sources

    GitHub trending this week: Rust rewrites of JS and Unix tooling share the board with terminal agents

    Rust rewrites of established developer tooling fill much of this week's GitHub trending board. oxc, a collection of high-performance JavaScript tools written in Rust, sits at roughly 21k stars; uutils/coreutils, a cross-platform Rust rewrite of the GNU coreutils, holds around 23k. Two agent and workflow projects join them: github/spec-kit, a toolkit for spec-driven development, near 110k stars, and openai/codex, a lightweight coding agent that runs in your terminal, at about 89k. Rust reimplementations of JS and Unix tooling are trending next to agent-assisted workflows.

    DevTools Desk · desk

  3. Cloud· 1 source

    Vercel ships skills.sh API for programmatic search across 600,000+ agent skills

    Vercel has made the skills.sh API generally available, giving developers programmatic access to its open directory of more than 600,000 agent skills from the open-source ecosystem. The API supports searching for skills, retrieving detailed information on a given skill, and reading its automated security audit, aimed at developers and agents that install skills inside Vercel projects. Authentication uses Vercel's OIDC tokens — short-lived credentials scoped to a team and project and rotated automatically, removing long-lived secrets, with a rate limit of 600 requests per minute per team and project. The release moves skill discovery and supply-chain vetting out of the CLI and into a queryable endpoint agents can call directly.

    Cloud Desk · desk

  4. AI / ML· 2 sources

    Trump administration reportedly weighs taking a US government equity stake in OpenAI

    The Trump administration and OpenAI are in discussions about a possible US government equity stake in the company, CNBC reported and TechCrunch aggregated. Neither the size nor structure of any stake has been set, and no terms have been decided; reporting says talks have run for more than a year, since CEO Sam Altman first raised the idea in 2025. Some equity could reportedly seed a "Public Wealth Fund," an OpenAI April policy proposal to route AI gains to citizens. Aboard Air Force One, Trump said "pieces could be given to the American public". A direct state ownership position in a leading AI lab would set a governance and conflict-of-interest precedent for federal AI oversight.

    AI / ML Desk · desk

  5. AI / ML· 2 sources

    OpenAI adds opt-in Lockdown Mode to ChatGPT to block prompt-injection data exfiltration

    OpenAI added Lockdown Mode, an opt-in ChatGPT setting that blocks the data-exfiltration stage of prompt-injection attacks by limiting outbound network requests. With it on, web browsing is capped to cached content and Agent Mode, Deep Research, in-response images, live connectors, and file downloads are turned off, cutting the channels injected instructions could use to ship stolen data offsite. The feature is free across all personal accounts and self-serve Business accounts. OpenAI cautions it isn't for everyone and that ChatGPT can still be prompt-injected through cached pages or uploaded files. How much capability will sensitive-data users trade for a partial defense?

    AI / ML Desk · desk

  6. AI / ML· 2 sources

    Google to pay SpaceX $920M a month for AI compute, per SpaceX SEC filing

    Google will pay SpaceX 920 million dollars a month to rent AI compute, according to SpaceX's amended S-1 filed with the SEC on June 5 ahead of its planned Nasdaq IPO. The deal covers roughly 110,000 NVIDIA GPUs plus CPUs and memory, runs October 2026 through June 2029 — about 32 months and 32 billion dollars total — with a 90-day exit for either side after December 31, 2026. Google called it "bridge capacity" for surging Gemini Enterprise demand; SpaceX did not name the site, though the capacity sits in data centers it absorbed from xAI. Routing a direct AI rival's overflow through Elon Musk's hardware marks how acute compute scarcity has become.

    AI / ML Desk · desk

  7. Security· 1 source

    Pro-Iran hackers tricked Meta's AI support bot into resetting Instagram passwords

    Pro-Iran hackers abused Meta's AI support assistant to hijack high-value Instagram accounts, briefly defacing the Obama White House and U.S. Space Force senior-enlisted-leader handles with pro-Iranian imagery. Brian Krebs reports a Telegram video showed the method: connect via VPN near the target's hometown, request a password reset, then tell the AI chat to link a new email — which it sent a one-time reset code to. Attackers claimed names worth over $500,000 and said the trick failed against MFA-protected accounts. Meta's Andy Stone said the issue was resolved via an emergency weekend patch, with no back-end breach. Wiring conversational AI into account-recovery flows makes it a social-engineering target once it can take privileged action.

    Security Desk · desk

  8. AI / ML· 3 sources

    JetBrains open-weights Mellum2, a 12B MoE code model with 2.5B active parameters

    JetBrains released Mellum2, a 12B-parameter Mixture-of-Experts (MoE) model trained on natural language and code, under the Apache 2.0 license. The model routes each token through 8 of 64 experts, activating 2.5B parameters per pass, and supports a 131,072-token context via layer-selective YaRN extension. Successor to JetBrains' 4B code-completion model, Mellum2 broadens scope to routing, RAG, sub-agents, and private deployment, shipping in six checkpoints (Base, Instruct, Thinking, and SFT/pretrain variants). JetBrains claims more than 2x faster inference than similarly sized open models; on the Base checkpoint it trails Qwen2.5-7B on HumanEval (41.5 vs 55.5) while tracking it on GSM8K and MMLU. Whether a sparse "focal" model wins adoption over denser coding LLMs remains open.

    AI / ML Desk · desk

  9. DevTools· 2 sources

    Cloudflare acquires VoidZero, pledging Vite, Vitest, Rolldown and Oxc stay MIT-licensed

    Cloudflare has acquired VoidZero, Evan You's company behind the Vite build tool, Vitest test runner, Rust-based Rolldown bundler, and Oxc toolchain. You and his team join Cloudflare's Emerging Technology and Incubation group and keep leading the projects, which the post says stay MIT-licensed, vendor-agnostic, and community-driven, with apps built on Vite still running anywhere. Cloudflare is also committing $1 million to a Vite ecosystem fund for outside maintainers, administered by the Vite core team. The post names no acquisition price; the open question is whether a vendor owning the toolchain millions of projects depend on stays neutral once Workers deployment is wired in natively.

    DevTools Desk · desk

  10. Security· 2 sources

    Cisco confirms unpatched SD-WAN Manager zero-day exploited for root-level command injection

    Cisco confirmed that CVE-2026-20245 (CVSS 7.8), a command-injection flaw in the CLI of Cisco Catalyst SD-WAN Manager, is being actively exploited as a zero-day to run arbitrary commands as root. The bug stems from insufficient validation of user-supplied input and affects all deployment types — On-Prem, SD-WAN Cloud-Pro, Cisco-managed cloud, and FedRAMP. Exploitation requires netadmin privileges, which attackers can obtain by first chaining CVE-2026-20182 or CVE-2026-20127. Mandiant reported the flaw, and Cisco has observed cases where attackers pushed configuration changes to edge devices. No patch exists yet; until one ships, Cisco advises upgrading to the CVE-2026-20182 fix and reviewing `/var/log/scripts.log` for suspicious uploads.

    Security Desk · desk

  11. AI / ML· 1 source

    ClickUp lays off hundreds, says it's replacing them with thousands of AI agents

    Productivity-software maker ClickUp is cutting hundreds of staff and replacing them with "thousands of AI agents," the nine-year-old startup confirmed in reporting on 25 May 2026. The framing — staff replaced numerically by autonomous agents — is one of the first cases of a venture-funded software company stating the substitution publicly, rather than couching the layoff as restructuring or efficiency. The operational details (which agents, run on what stack, integrated into which workflows) are not in the company's external communication; the framing alone is the news event. Expect the playbook to be copied.

    AI / ML Desk · desk

  12. AI / ML· 2 sources

    Pope Leo XIV's new encyclical uses AI as a frame for concentrated power

    Pope Leo XIV's encyclical *Magnifica Humanitas*, surfaced in widespread reporting on 25 May 2026, treats artificial intelligence less as a doctrinal subject than as a lens — the document is primarily about concentrated power, eroded democracy, and the influence of a small tech elite over how the world is shaped, according to early analysis. The papal text invokes AI throughout, but the church's substantive critique is of the political-economic conditions in which AI is developed, not of the technology itself. The framing matters because Catholic social teaching has historically reshaped labour and inequality debates well beyond the church's congregation.

    AI / ML Desk · desk

  13. Security· 1 source

    Dutch FIOD seizes 800+ servers used to host Russia-linked cyberattacks

    The Dutch financial-crimes agency FIOD seized "more than 800 servers" on 18 May 2026 and arrested two suspects — Youssef Zinad, 57, of Amsterdam, and Andrey Nesterenko, 39, of The Hague — charging them under sanctions law with making economic resources available to EU-sanctioned entities. The infrastructure allegedly hosted Russia-backed operations including DDoS staging, anonymisation proxies, and disinformation campaigns; among the cited deployments was targeting of Danish government bodies during the country's 13–19 November 2025 municipal elections. The use of sanctions law rather than computer-crime statutes is the substantive novelty — it makes the hosters, not just the attackers, directly prosecutable in the EU.

    Security Desk · desk

  14. AI / ML· 2 sources

    NVIDIA ships Nemotron Diffusion language models — 3B–14B params, 4x throughput on B200

    NVIDIA released the Nemotron-Labs Diffusion family on 23 May 2026 — diffusion-based language models at 3B, 8B and 14B parameter scales, plus an 8B vision-language variant. The text models ship under the commercially-friendly NVIDIA Nemotron Open Model Licence with weights on Hugging Face. The 8B text model achieves 1.2% higher average accuracy than Qwen3 8B on the team's evaluation suite, while generating roughly 4x faster than the autoregressive baseline on NVIDIA B200 hardware — about 865 tokens/sec on the speedbench dataset in self-speculation mode. The model fills 32-token blocks via iterative denoising and supports three modes: pure autoregressive, FastDiffuser, and a self-speculative path that drafts bidirectionally and verifies causally.

    AI / ML Desk · desk

  15. AI / ML· 1 source

    New arXiv paper: test-time training can undermine LLM safety guardrails

    A paper posted to arXiv on 25 May 2026 (cs.LG, 2605.22984) argues that test-time training — the technique of adapting model parameters during inference, increasingly used for few-shot adaptation — can undermine the safety guardrails installed during post-training. The authors describe TTT as "an emerging paradigm" that improves task performance, but say its parameter mutations can weaken refusal behaviours that standard alignment evaluations were measured against. The full quantitative breakdown is in the paper; the implication for deployed systems is direct: any post-deployment adaptation may invalidate prior safety-evaluation certifications, and the same speedup the engineering community is pursuing for personalisation is the mechanism by which guardrails can be silently bypassed.

    AI / ML Desk · desk

  16. AI / ML· 1 source

    Open CTF is dead as a measure of human security skill, top competitor argues

    Kabir Acharya, a competitive Capture-The-Flag player who placed in the global top 10 through late 2025, argues that frontier AI models have hollowed out the open CTF format — Claude Opus 4.5 makes "almost every medium difficulty challenge, and some hard challenges" agent-solvable, and GPT-5.5 can one-shot "Insane difficulty active leakless heap pwn challenges" on HackTheBox . Acharya offers no proposed fix; he tells beginners to use platforms like picoGym and recommends the community "build new avenues" rather than adapt the existing competition format .

    AI / ML Desk · desk

  17. AI / ML· 2 sources

    Anthropic closes $30B round at $900B valuation, eclipsing OpenAI

    Anthropic is finalising a $30B fundraising round at a $900B valuation, surpassing OpenAI's $852B mark. Q1 2026 revenue is up roughly 80× year-over-year and ARR has crossed $44B, with the number of customers spending more than $1M per year doubling to over 1,000 in two months. The capital is earmarked for compute — multi-year AWS and Google Cloud commitments through 2027. Named institutional clients now include PwC, Blackstone, Goldman Sachs, Hellman & Friedman, and the Gates Foundation.

    AI/ML Desk · desk

  18. AI / ML· 1 source

    GPT-5.5 Instant moves to default across ChatGPT free, Plus, and Pro

    OpenAI moved GPT-5.5 Instant into the default ChatGPT slot on 5 May 2026 across all tiers. Reported benchmark gains are non-trivial — AIME 2025 jumps to 81.2 from 65.4, MMMU-Pro multimodal climbs to 76 from 69.2. The headline new feature is memory integration: the model now searches past conversations, uploaded files, and (with consent) Gmail to ground personalised answers. Enterprise security teams have flagged the Gmail hook as the dominant question on contract reviews this month.

    AI/ML Desk · desk

  19. AI / ML· 1 source

    PwC deploys Claude across 300K+ professionals — largest pro-services AI rollout to date

    PwC has deployed Claude across more than 300,000 professionals globally — the largest professional-services AI rollout publicly disclosed. Internal numbers reported on the launch: insurance underwriting compressed from ten weeks to ten days, security tasks from hours to minutes, and overall delivery up to 70% faster. PwC has also stood up a new finance unit, the "Office of the CFO," built entirely on Claude. The deployment lands as the firm renegotiates partner-level utilisation targets to reflect agent-assisted output.

    AI/ML Desk · desk

  20. Security· 1 source

    All five US frontier labs sign CAISI pre-deployment evaluation agreements

    The US Commerce Department's CAISI has finalised pre-deployment evaluation agreements with all five major frontier labs — OpenAI, Anthropic, Google DeepMind, Microsoft, and xAI. Every major model now requires a government evaluation pass before public launch. Lab teams privately describe the regime as workable but slow, and several have already started bringing release engineers into the evaluation process to compress turnaround. The first model to launch entirely under the new pipeline is expected later in the quarter.

    Security Desk · desk

  21. AI / ML· 1 source

    Meta's Avocado model slips to June after below-frontier internal benchmarks

    Meta has quietly slipped its long-anticipated Avocado model to a June launch after internal testing placed it between Gemini 2.5 and 3.0, and below threshold against GPT-5.5 and Claude Opus 4.7 on the benchmarks the company had targeted for May. The delay extends an open-weights lead that Chinese labs have built — Kimi K2.6, DeepSeek V4, and GLM-5.1 are all current options for teams who can't or won't deploy a US frontier model. Meta's open-weights roadmap depends on Avocado clearing a credibility bar; the longer it slips, the harder that gets.

    AI/ML Desk · desk

  22. DevTools· 1 source

    Anthropic doubles Claude Code rate limits after Colossus 1 deal lands

    Anthropic doubled rate limits across all paid Claude Code plans on 6 May 2026, citing newly-available compute from the SpaceX Colossus 1 supercomputer deal — 220,000+ NVIDIA GPUs and 300 megawatts of power. The change is the first observable downstream effect of the Colossus arrangement and lands as Claude Code becomes the most-used surface inside Anthropic, by token count, ahead of Claude.ai. Heavy users — agentic harnesses, IDE plugins, terminal sessions running long-context tools — were the practical bottleneck the previous tier had been throttling.

    DevTools Desk · desk

  23. DevTools· 1 source

    Adobe Firefly adds agentic AI for multi-step creative workflows

    Adobe is rolling agentic AI into Firefly, with Creative Cloud applications — Photoshop, Illustrator, Premiere — gaining the ability to chain multi-step tasks under a single brief. The signal lines up with Anthropic shipping first-party Claude connectors for Adobe, Blender, and Ableton the same week. Both moves point at the same target: creative tools that the model can directly drive, rather than tools that prompt the user for the next click.

    DevTools Desk · desk

  24. AI / ML· 1 source

    Mistral ships an AI orchestration engine for production business workflows

    Mistral AI has launched an AI orchestration engine targeted at production business processes — moving past the model-as-API positioning that defined the company's first two years. The pitch is a single runtime for workflows that fan out across tools, retries, evaluations, and human approvals, with first-class support for Mistral models and adapters for OpenAI and Anthropic. The interesting bet is the runtime layer becoming the lock-in surface, not the model itself.

    AI/ML Desk · desk

  25. Cloud· 2 sources

    Cloudflare + Stripe ship infrastructure for AI agents that deploy apps autonomously

    Cloudflare and Stripe have opened public beta on a joint stack for AI agents that deploy production applications without human intermediation — provisioning, billing, and traffic-shaping all callable as tool surfaces. The early customer cohort is small but instructive: agent-built SaaS products that bootstrap their own infra, settle payments on agent-owned accounts, and scale via API-driven contracts. It is the most concrete demo yet of an agent-native software economy, and the legal and identity questions it raises are not solved.

    Cloud Desk · desk

  26. AI / ML· 1 source

    Isomorphic Labs raises $2.1B — life-sciences AI crosses into commercial category

    Isomorphic Labs, the Google DeepMind drug-discovery spinout founded by Demis Hassabis in 2021 to commercialise AlphaFold, has raised $2.1B. AlphaFold won the 2024 Nobel Prize in Chemistry, but the round is the clearest signal yet that life-sciences AI has crossed from research project to commercial category. Pipeline disclosures remain limited; the company has said proceeds will fund expanded wet-lab capacity and a second discovery platform aimed at small-molecule optimisation rather than protein structure alone.

    AI/ML Desk · desk