Fresh daily

AI News

Latest AI tool releases, research breakthroughs, and industry news.

All Releases Research Funding Tutorials Opinion

Earlier this week

Jersey Mike’s IPO illustrates how bad the AI hype has become

Just for kicks, I took a look at Jersey Mike's IPO documents. Surely a sandwich shop would have no need to mention AI. But low and behold.

TechCrunch AI·Jul 2opinion

Newly discovered PamStealer isn't your typical macOS malware

The discovery underscores the increased effort being poured into Mac infostealers.

Ars Technica·Jul 2research

llm-coding-agent 0.1a0

Release: llm-coding-agent 0.1a0 Another Fable 5 experiment. Now that my LLM library has evolved into more of an agent framework it's time to see what a simple coding agent would look like built on it. I started a new Python library using my python-lib-template-repository GitHub template repository, then ran these two prompts (here's the Claude Code for web transcript ): Write a spec.md for this project - it will depend on the latest “llm” alpha from PyPI and implement a Claude code style coding agent complete with tools for reading and editing files and executing commands Then: Commit the spec, then build it using red/green TDD in a series of sensible commits (each with passing tests and updated docs) - occasionally manually test it using the OpenAI API key in your environment Here's the resulting README file and the sequence of commits . I've shipped a slop-alpha to PyPI, so you can run the new agent like this: uvx --prerelease=allow --with llm-coding-agent llm code It's pretty good for a first attempt! Here's the (Fable-authored) README , which lists recipes like llm code --yolo and llm code --allow "pytest*" --allow "git diff*" . It also presents a Python API based around a CodingAgent(model="gpt-5.5", root="/path", approve=True).run("Fix the failing test in tests/test_parser.py") class which I didn't ask for but I'm delighted to see implemented. Here's the suite of tools it implemented , listed using uvx ... llm tools : CodingTools_edit_file(path: str, old_string: str, new_string: str, replace_all: bool = False) -> str Replace an exact string in a file. old_string must match the file contents exactly (including whitespace) and must identify a unique location unless replace_all is true. Returns a diff of the change so it can be verified. CodingTools_execute_command(command: str, timeout: int = 120) -> str Run a shell command in the session root directory. Returns combined stdout and stderr followed by an Exit code line. timeout is in seconds (maximum 600); on tim

Simon Willison·Jul 2release

Meta quietly launches vibe-coded gaming app Pocket

Meta has quietly launched Pocket, an experimental AI app that lets users generate and share interactive mini games using text prompts.

TechCrunch AI·Jul 2release

Anthropic is discussing a new custom chip with Samsung

The news comes about a week after OpenAI announced its own custom AI chip in a partnership with Broadcom.

TechCrunch AI·Jul 2release

Using DSPy to evaluate and improve Datasette Agent's SQL system prompts

Research: Using DSPy to evaluate and improve Datasette Agent's SQL system prompts One of this morning's AIE keynotes covered dspy , which reminded me I've been meaning to see if it could help me improve the system prompt used by Datasette Agent - so I fired off an asynchronous research task in Claude Code for web using Claude Fable 5: Pip install the latest Datasette alpha and datasette-agent and dspy - then figure out how to use dspy to evaluate and improve the main system prompts used by Datasette Agent for the feature where it can execute read only SQL queries to answer user questions about data. Fable chose to test using GPT 4.1 mini and nano, and identified several promising looking directions for improvements. I particularly like this one: The schema listing gives only table names; the "don't call describe_table if you already have the information" advice caused column-name guessing (page_count, o.order_id, first_name) and error-retry loops in baseline traces. Either include column names in the prompt's schema listing or soften that advice. Tags: ai , datasette , generative-ai , llms , evals , dspy , datasette-agent , claude-mythos

Simon Willison·Jul 2research

How GitHub used secret scanning to reach inbox zero

GitHub had 20,000+ secret scanning alerts across 15,000 repositories. Here's how we separated signal from noise, built remediation workflows, and reached inbox zero in nine months. The post How GitHub used secret scanning to reach inbox zero appeared first on The GitHub Blog.

GitHub Blog·Jul 2tutorial

Achieving operational excellence with AI

Frameworks like Lean Six Sigma and business process management (BPM) first gained traction because they promised clarity in the chaos—a structured way to bring order to messy, sprawling operations. Lean Six Sigma emphasized statistical rigor and quality control; BPM created end-to-end maps of how work should flow across departments. Both offered a repeatable way to…

MIT Tech Review·Jul 2opinion

OpenAI proposed donating 5% of its equity to a US sovereign wealth fund

OpenAI CEO Sam Altman has reportedly proposed giving 5% of the company’s equity to a U.S. sovereign wealth fund, reviving discussions about letting the public share in the financial gains from the AI boom.

TechCrunch AI·Jul 2opinion

Microsoft launches its own AI deployment company with $2.5 billion commitment

Microsoft follows Amazon, OpenAI and Anthropic with its new AI deployment group.

TechCrunch AI·Jul 2funding

Teaching AI to run with the turbines

Artificial intelligence may have captured the public imagination through chatbots and image generators, but some of its most consequential use cases are unfolding far from consumer-facing tools. In industries where physical infrastructure, operational continuity, and safety are paramount, AI is becoming a core operating layer. With its sprawling industrial systems and constant stream of operational…

MIT Tech Review·Jul 2research

Yep, we’re using OpenClaw to date now

Ben Guez has "a bunch of potential international wives in [his] DMs," thanks to an automated script he set up using OpenClaw, Claude code, and Instagram trials.

TechCrunch AI·Jul 2opinion

OpenAI floats giving Trump administration 5 percent cut of AI boom

OpenAI has floated giving the US government a 5 percent ownership stake as a way of easing tensions with the Trump administration and blunting mounting public backlash against AI, according to the Financial Times. CEO Sam Altman argued that giving the public a financial interest in the company would be the best way to share the upside of AI, the FT reported, citing two unnamed people familiar with the talks. He's said to have first pitched the idea to Trump early last year. Altman reportedly suggested the 5 percent figure. Based on OpenAI's latest funding round, which ended with the company valued at $852 billion, that stake would be worth … Read the full story at The Verge.

The Verge AI·Jul 2funding

Indian tech tycoon bets $30M of his own money to build AI alternative to Microsoft Office

Neo is Bhavin Turakhia’s fifth venture and his latest involving enterprise software. This time he's taking on Microsoft Office, Google Apps with AI.

TechCrunch AI·Jul 1funding

More details on Fable 5’s cyber safeguards and our jailbreak framework

Anthropic News·Jul 1research

Autoresearch: The feedback loop behind self-improving agents

Introspection co-founder Roland Gavrilescu explains autoresearch, agent “recipes,” self-improving loops, and why humans remain central to the software factory.

Latent Space·Jul 1research

T-Mobile moving tens of thousands of virtual machines off VMware amid lawsuit

Ars Technica·Jul 1opinion

How Cursor deploys AI inside the enterprise

Cursor's Pauline Brunet explains how her team of Forward Deployed Engineers help organizations implement agents — essentially setting up software factories.

Latent Space·Jul 1tutorial

SpaceX has an AI device prototype, and it sure sounds phone-ish

SpaceX reportedly showed investors a "handset-like" AI device before going public. It could be another signal SpaceX wants to expand into wireless.

TechCrunch AI·Jul 1research

Ashton Kutcher leaving Sound Ventures to launch new VC firm with Morgan Beller

The actor and investor is joining forces with Morgan Beller, who was previously a GP at NFX, to invest in early-stage startups.

TechCrunch AI·Jul 1funding

Search AI Workflow Pro

AI News

Earlier this week

Jersey Mike’s IPO illustrates how bad the AI hype has become

Newly discovered PamStealer isn't your typical macOS malware

llm-coding-agent 0.1a0

Meta quietly launches vibe-coded gaming app Pocket

Anthropic is discussing a new custom chip with Samsung

Using DSPy to evaluate and improve Datasette Agent's SQL system prompts

How GitHub used secret scanning to reach inbox zero

Achieving operational excellence with AI

OpenAI proposed donating 5% of its equity to a US sovereign wealth fund

Microsoft launches its own AI deployment company with $2.5 billion commitment

Teaching AI to run with the turbines

Yep, we’re using OpenClaw to date now

OpenAI floats giving Trump administration 5 percent cut of AI boom

Indian tech tycoon bets $30M of his own money to build AI alternative to Microsoft Office

More details on Fable 5’s cyber safeguards and our jailbreak framework

Autoresearch: The feedback loop behind self-improving agents

T-Mobile moving tens of thousands of virtual machines off VMware amid lawsuit

How Cursor deploys AI inside the enterprise

SpaceX has an AI device prototype, and it sure sounds phone-ish

Ashton Kutcher leaving Sound Ventures to launch new VC firm with Morgan Beller