[This Week’s AGI Observation] Recursive Boot, Abandoning Definitions — Issue 2

Featured Image: Lighthouse in heavy fog, AI-generated illustration Featured Image | AI-generated illustration: When definitions are abandoned, we aren’t even sure where the lighthouse is.

Key Points (TL;DR)

  • Recursive Self-Improvement (RSI) has moved from theory to practice: Google DeepMind’s AlphaEvolve has been deployed in its own data centers, optimizing the hardware and scheduling for training Gemini—which in turn is the engine for AlphaEvolve. This is a running closed loop; ICLR 2026 also hosted its first academic Workshop on RSI this week.
  • Sam Altman dropped the remark that “AGI may have already whooshed by”: After burying the AGI clause in the first issue, this week is about “burying the definition of AGI.” I tend to think this is the second step in abandoning definitions.
  • Anthropic’s valuation surged from $800 billion to over $900 billion in 4 weeks, potentially raising $60 billion in an October IPO. The private market completely ignores the Pentagon’s ban.
  • The Pentagon’s “eight minus one” status remains unchanged, but the White House is quietly drafting an executive order to bypass Anthropic’s “supply chain risk” label—”subject banned, product used” is about to be institutionalized.
  • OpenAI CFO Sarah Friar publicly refuted the WSJ leak: “a vertical wall of demand.” But she didn’t deny the internal memo; instead, she shifted the semantic focus from “insufficient demand” to “insufficient compute.” I believe this is PR damage control rather than a counter-proof.
  • All 11 co-founders of xAI have resigned, with the last two leaving at the end of March; Musk admitted that xAI “needs to be rebuilt from the foundation.” With SpaceX valued at $1 trillion and xAI at $250 billion, the entire frontier camp is restructuring.
  • Xiao Jian AGI Implementation Tracking Index: 3.9 → 4.2 (Industry signals +0.5; Autonomy +0.5).

I. New Signals This Week (Ranked by Credibility)

🔴 Strong Signal

1. Google AlphaEvolve has completed the “Self-Improvement—Feedback” closed loop in production Geeky Gadgets — AlphaEvolve Explained, ICLR 2026 RSI Workshop

DeepMind publicly admitted that AlphaEvolve “has been deployed in Google data centers, reclaiming compute resources and accelerating next-generation training.” It optimizes three things: data center scheduling, hardware accelerator chip design, and the matrix multiplication kernels used to train Gemini models. And Gemini is the evolutionary engine for AlphaEvolve. This closed loop is not a research paper demonstration; it has been running stably in production for several months. The ICLR 2026 RSI Workshop, which opened this week in Rio de Janeiro, is the first academic venue formally named after “Recursive Self-Improvement”—academia has finally caught up with practice.

2. Details of the OpenAI–Microsoft “breakup” terms have landed Bloomberg, OpenAI Official

The first issue of the weekly report covered the deletion of the AGI clause. This week, we saw the full “breakup price”: Microsoft holds approximately 27% of OpenAI Group PBC, valued at $135 billion, with IP licensing extended to 2032; OpenAI has committed to purchasing an additional $250 billion in Azure services. Microsoft no longer has a “right of first refusal” on models sold by OpenAI. For the frontier race, this means OpenAI’s compute demands are no longer locked solely to Azure and will disperse to all hyperscalers—meaning OpenAI is preparing for a world “without Microsoft as a safety net.”

🟡 Medium Signal

3. Sam Altman: AGI may have already “whooshed by” Mark Kretschmann/X, Windows Central, OpenAI Our Principles

Altman stated in a recent live stream, “OpenAI may have already crossed AGI without knowing it, because the term has become so vague it’s meaningless,” and added that if today’s models have true continuous learning, “they already count as AGI.” This follows the same narrative line as his “basically built AGI” comment in India on 2/19 and the “post-AGI economic collapse” on 4/26: sliding from “we will achieve AGI” to “AGI is a bad word.” I tend to think this is a strategic abandonment of definitions—when you don’t have to declare it at a specific point in time, you don’t have to be responsible for anything.

4. Anthropic valuation: $350B → $800B → $900B+ in 4 weeks Bloomberg ($800B), Bloomberg ($900B), PYMNTS

ARR went from $9 billion at the end of 2025 to $30 billion by the end of March 2026; valuation went from $350 billion in February to rejecting an $800 billion offer on 4/14, and is now weighing a new round at over $900 billion on 4/29. A potential IPO is expected in October, raising over $60 billion, which could directly surpass OpenAI to become the world’s most valuable AI startup. In the same week that the “Big Four” of Silicon Valley all increased their investments and Google allocated $40 billion, the Pentagon listed Anthropic as a supply chain risk—the gap between the capital market and the state apparatus is widening to a historical high.

5. OpenAI CFO Sarah Friar: “a vertical wall of demand” PR counter-attack Bloomberg, Yahoo Finance

Three days after the 4/27 WSJ internal memo leak, Friar came out to say, “we feel like we’re beating our plan at the highest level,” and “what’s holding us back isn’t demand, it’s compute.” She did not deny the memo. She simply shifted the narrative focus from “missing the 1 billion weekly active users and revenue targets by year-end” to “stretch goals are more aggressive than public targets” and “demand is a vertical wall.” This is textbook PR reframing: admitting some goals weren’t met, then redefining the meaning of those goals. I moved this signal from “medium” to “strong” by 0.3 points—because the CFO actively coming to the front line is evidence in itself that the fire hasn’t been extinguished.

6. 11 of 12 xAI co-founders have resigned CNBC, TechCrunch, Bloomberg

Tony Wu (2/10), Jimmy Ba (2/11), Guodong Zhang, Zihang Dai (March), and finally Manuel Kroiss and Ross Nordeen (end of March)—11 out of 12 founders have left, with Musk being the only one remaining. SpaceX acquired xAI in an all-stock deal on 2/2, valuing SpaceX at $1 trillion and xAI at $250 billion. The official reason for the departures is “research culture vs. engineering culture conflict.” Musk himself admitted that xAI “wasn’t built right and needs to be rebuilt from the foundation.” My interpretation: The frontier camp is consolidating from the “Big Four” to the “Anthropic + Google + OpenAI” trio, with xAI and Meta starting to fall behind.

🟢 Weak Signal

7. Anthropic publishes Automated Weak-to-Strong Researcher alignment.anthropic.com

Automating alignment research itself—following Anthropic’s “Automated Alignment Researchers” earlier in April, they released an extended version this month. This is the safety-side mirror of RSI: if models can improve themselves, alignment research must also keep up. A weak signal because there is no concrete evidence of deployment into the Claude training pipeline yet.

8. Andrew Ng proposes “Turing-AGI Test” Andrew Ng on X

Even an AI godfather figure posted on the first day of the new year that “it’s time to redefine AGI”—this signal is weak, but it represents that the academic circle is also beginning to accept that “the term AGI has been so polluted by capital and marketing jargon that it must be renamed.”

9. CVE-2026-4747 (FreeBSD NFS RCE) officially tagged with Anthropic Glasswing The Hacker News, VulnCheck, Schneier on Security

40 vulnerabilities have been credited, with at least 1 explicitly recorded by NVD as “by Anthropic Project Glasswing (Mythos Preview) autonomously identified and exploited.” Mythos has moved from “finding vulnerabilities” to “officially obtaining CVE numbers”—the item on last week’s watch list has been crossed off.

10. White House drafts executive order to bypass Anthropic supply chain risk label for federal use of Mythos Axios, Nextgov, Government Executive

Trump met with Chief of Staff Susie Wiles and Dario Amodei at the White House on 4/17; Trump subsequently said in a CNBC interview that Anthropic is “shaping up” and “could be very useful.” However, on 5/1, the Pentagon still excluded Anthropic from the list of eight IL6/IL7 companies. “Subject banned, product used” is about to be institutionalized—this was an observation question left over from last week, and this week there is progress on the draft.


II. AGI Implementation Tracking Index (Xiao Jian Index)

DimensionLast WeekThis WeekChangeMain Driver
Technical Capability (30%)4.04.1↑0.1DeepSeek V4 Pro Intelligence Index +10 (42→52); GPT-5.5 ARC-AGI-2 85%; ARC-AGI-3 still <1%
Autonomy / METR (25%)3.54.0↑0.5AlphaEvolve confirmed closed loop in production; OpenAI “September research intern” goal still on track; ICLR RSI workshop
Industry Signals (25%)4.55.0↑0.5Altman “AGI has whooshed by” + Anthropic $900B valuation + xAI co-founders all left
Economic Impact (20%)3.53.7↑0.2Anthropic ARR 30B, Hinton 2026 watershed, Google 600 employees protest defense contract
Xiao Jian Index3.94.2↑0.3Industry signals are the biggest push this week

Xiao Jian Index trend chart Chart | The Xiao Jian AGI Implementation Tracking Index rose from 3.9 in Issue 1 to 4.2 in Issue 2. The biggest push came from industry signals (+0.5) and autonomy (+0.5).

Cooling Agent (Weekly Fixed): On ARC-AGI-3, humans are still at 100%, and the strongest AI is less than 1%. My first reaction to all claims that “AGI has been achieved” is still this number.


Recursive closed loop concept image Image | AI-generated illustration: The closed loop of recursive self-improvement—A improves B, and B improves A’s capabilities.

III. In-depth Analysis of Key Events

Axis 1: AlphaEvolve and the “Quietly Booting Recursion”

When DeepMind released AlphaEvolve in May 2025, the outside world treated it as another algorithmic search tool—more general than AlphaTensor or AlphaCode, but still “finding better answers in a fixed domain.” Looking back a year later, this interpretation completely underestimated it.

According to DeepMind’s own account (also cited by multiple tech media outlets and this week’s ICLR RSI Workshop), AlphaEvolve has been deployed internally at Google and has completed three feedback loops:

  1. Data center scheduling optimization—reducing idle rates of Google TPU clusters, effectively squeezing out more training compute.
  2. Hardware accelerator chip design improvement—certain blocks of the next-generation TPU were finalized based on versions proposed by AlphaEvolve.
  3. Matrix multiplication kernel optimization—directly accelerating Gemini training.

Connecting these three things: AlphaEvolve is an evolutionary system derived from Gemini, which in turn improves the hardware and scheduling for Gemini’s training. This is the textbook definition of recursive self-improvement: A improves B, and B improves A’s capabilities, so the next generation of A becomes stronger and goes on to improve B. The feedback cycle is no longer “one training run every 6 months,” but “data-center-level continuous optimization.”

Of course, the “degree of improvement” in this closed loop is currently not large—estimated at a few percentage points to over ten percentage points of efficiency gain. It will not trigger an intelligence explosion overnight. But the existence of this closed loop itself means that the comfortable assumption in AGI discussions that “we are still far from RSI” has been broken.

I think what is more worrying is not AlphaEvolve itself, but that it didn’t make anyone scream. A year ago, OpenAI employees occasionally mentioned in private interviews that “we have some internal tools that write their own training scripts,” and the industry would nervously ask, “What is that? Is it RSI?”—now DeepMind has publicly told this story as an “optimization story,” and no one is surprised. The threshold for the AGI race is being quietly lowered by capital and propaganda.

The RSI Workshop held at ICLR 2026 this week is a belated catch-up lesson. When academia formally holds a workshop to address an issue, the industry has usually been running with it for 12–18 months. Putting AlphaEvolve, Anthropic’s 4/14 Automated Alignment Researchers, and OpenAI’s “September automated research intern” goal together—RSI is no longer a question of “will it happen,” but “in whose hands and at what speed will it happen.”

My judgment: Within 3 quarters, there will be the first credible paper (not a CEO tweet) quantifying the “contribution ratio of AI to AI training processes” for a certain lab. When this number exceeds 50%, the race for AGI will have truly begun.

Definition abandonment concept image Image | AI-generated illustration: When the word “AGI” is deliberately blurred, responsibility is scattered by the wind.

Axis 2: Sam Altman’s “whooshed by” and the second half of abandoning definitions

The core conclusion of the first weekly report was: “OpenAI and Microsoft buried the AGI clause, turning AGI from a contractual issue into a valuation issue.” This week, Altman personally pushed this action to its complete version—not only is the AGI in the contract gone, but the word AGI itself has been declared dead.

In a recent live stream, Altman said:

“OpenAI may have already crossed AGI without knowing it, because the term has become so vague it’s meaningless… If today’s models have true continuous learning, they already count as AGI.”

Breaking this down:

  1. “Crossed AGI without knowing it”—this phrasing is very sophisticated. It simultaneously declares “we may have already done it” (boosting investor excitement) and exempts them from the obligation to “provide verifiable evidence” (because, well, “without knowing it”).
  2. “The term has become vague”—this shifts the responsibility to language itself, not OpenAI.
  3. “If it has continuous learning, it counts”—lowering the standard. AGI no longer needs to “surpass humans in most economically valuable work” (OpenAI’s own 2018 definition), as long as it has “continuous learning.”

Lining up this timeline:

  • Mid-2024: Altman posts “AGI achieved internally” on X (later claimed to be a joke).
  • 2026/2/19: India AI Summit: “We have basically built AGI.”
  • 2026/4/26: “Post-AGI economic collapse” + AGI Five Principles.
  • 2026/4/27: AGI clause in contract eliminated.
  • 2026/5 (This week): “AGI may have already whooshed by.”

This is a rhythmic de-mythologization project. Altman is not the first to downplay AGI—Anthropic has long refused to use the term, calling it “powerful AI”; Hassabis set a stricter standard; Andrew Ng wants to change it to the “Turing-AGI Test.” But Altman is the only person who is simultaneously saying “we have already done it,” “the term is meaningless,” and still raising $122 billion (OpenAI April Announcement).

I tend to think that OpenAI is preparing for the second half of the game where “AGI cannot be declared but can be charged for”: to regulators, AGI is vague and doesn’t need to be regulated; to investors, AGI is something already quietly achieved and worthy of a $5 trillion valuation. This double-talk can only be delivered by one person, because if any two OpenAI executives said the same thing, they would immediately be caught in a contradiction.

A practical rule of thumb for readers: Whenever a CEO uses phrasing like “whooshed by,” “without knowing it,” or “it already is if you accept this definition,” treat it as marketing jargon rather than a technical statement. Real technical milestones come with numbers, not paradoxical sentence structures.


Power Map Chart | Relative influence of frontier labs this week (Xiao Jian’s subjective scoring).

IV. Industry Power Map Update

Power map four weeks ago: “Big Three (OpenAI / Anthropic / DeepMind) + Chinese Wings (DeepSeek / Tongyi) + Fringe (xAI / Meta).”

Update this week:

  • OpenAI: Valuation held, but narrative is leaking. CFO came out to do damage control, CEO switched to the “abandoning definitions” card. The Pentagon case is one of the few victories they can show. No issues in the short term, but the biggest risk in the medium term is “insufficient people + compute reaching price limits.”
  • Anthropic: $900 billion in the private market, blocked in the public sector. But the White House EO draft + Trump’s hint that “public sector blockade is just a price tag.” Steadily moving from second place to first; the only difference is the IPO price.
  • Google DeepMind: The biggest substantive winner this week, but buried under Anthropic’s topics. AlphaEvolve deployed, Gemini 3.1 leading ARC-AGI-2, Pentagon list entry, $40 billion investment in Anthropic. Holding chips for all tables.
  • xAI: Substantively falling behind. Co-founders all gone, absorbed by SpaceX, Musk admitted to rebuilding.
  • Meta: Eerily quiet. No new progress on LeCun’s path to talk about.
  • DeepSeek: V4 Pro took second place in open source (closely following Kimi K2.6) on independent benchmarks (Artificial Analysis Intelligence Index), but has not yet entered the top-tier closed-source race. The story of Chinese domestic chips is really starting to run—Huawei Ascend, Cambricon, and Hygon completed day-0 adaptation to V4.
  • Reflection AI: The “political newcomer” of the US military, still has not released any anti-pollution benchmark scores.

Power formula this week: Influence = Model Capability × Compute Commitment × Political Capital × Narrative Control. Ranked by this formula: Google ≥ Anthropic > OpenAI >> DeepSeek > xAI > Meta.

This is one step more sophisticated than the “Anthropic + Google axis vs. isolated OpenAI” from Issue 1—because this week Google ran out on its own, no longer just Anthropic’s financier.


V. Contradiction Observations

The two most obvious contradictions this week:

Contradiction 1: Anthropic is simultaneously the “most valuable” and the “most disliked by the state apparatus”

Private market: $350B → $800B → $900B, valuation increased 2.5x in 4 weeks; ARR $9B → $30B, tripled in 4 months. State apparatus: Pentagon listed it as a supply chain risk (previously only used for companies perceived to have ties to foreign adversaries); Trump’s executive order once wanted a total ban; the list of eight IL6/IL7 companies deliberately excluded it.

This contradiction can only be explained by “Anthropic bet on the future and bet wrong on the present”: Dario bet that “safety brand is a long-term competitive advantage” and “the red line of refusing fully autonomous weapons and mass surveillance will be valuable in the post-ASI era.” This bet cost him the IL6/IL7 cost in the short term, but gave him a position that no other lab can pretend to have in the long term—the only frontier lab that can still say “we refused the Pentagon” in May 2026.

VCs understood this. The Pentagon hasn’t yet. My judgment: The day the White House EO draft lands is the day the Pentagon starts to compromise.

Contradiction 2: Altman says AGI has “whooshed by,” while having the CFO come out to do damage control on missed internal targets

This is logically inconsistent. If you have already quietly surpassed AGI, how can there be small things like “missing the 1 billion weekly active users target by year-end”?

There are two possible interpretations:

  1. “Whooshed by” is for investors, “missed targets” is for internal employees—two independent scripts, with the CEO and CFO each performing their own.
  2. “Whooshed by” is a counter-attack on Anthropic’s soaring valuation—since the narrative was lost, raise the definition threshold; you won’t reach it anyway.

Both are possible. But regardless of which, it is jargon, not a technical statement. If OpenAI had truly crossed a technical threshold, it would be reflected in ARC-AGI-3 public scores, significant HLE breakthroughs, or public capabilities of the Spud model—not a “whooshed by” comment from the CEO in a live stream.


VI. Xiao Jian’s Judgment This Week

First paragraph. I wrote two keywords this week: Recursive Boot, Abandoning Definitions. These two things are mirrors of each other. Technically, AGI has long since changed from a “discrete point in time” to a “jagged process” (this was the core conclusion of Issue 1); now it has further become a “closed loop of self-improvement.” Rhetorically, OpenAI has simultaneously transformed AGI from a “milestone that needs to be achieved” into a cloud-like concept that “no one can define, therefore no one can deny.” These two things point to the same conclusion: Regardless of whether the models have become stronger, the question of “whether AGI has been achieved” is no longer a technical question—it is a composite of valuation, contracts, and political discourse power.

Second paragraph. I have a deeper sense of unease about Altman’s “whooshed by” remarks this week. From the retracted “AGI achieved internally” tweet in 2024, to the “basically built” comment in India on 2/19, to the “post-AGI economic collapse” on 4/26, to this week’s “whooshed by”—this is not a casual slip of the tongue, but a progressive transformation of “AGI” from a technical proposition into an emotional proposition. The danger of this operation is that when emotion becomes the main axis, counter-evidence (ARC-AGI-3 0.3%, HLE still at 30–40%, autonomous long-term tasks still at the hourly level rather than monthly) is pushed to the margins. I’m not saying Altman is a bad person—he is doing what any CEO would do: bending language into a shape favorable to valuation. But when this shape firmly grips public discussion, what suffers is “whether we as citizens can still judge whether AGI has arrived in a verifiable way.”

Third paragraph. Anthropic’s $900 billion valuation and the gap created by the Pentagon’s exclusion is the only case this week that made me feel “the principles were bet on correctly.” I’m not an Anthropic fan—it’s still a commercial company, and it will still compromise on certain red lines when the price is high enough (look at how the 4/14 Automated Alignment Researchers paper engineered alignment itself). But the red line of “refusing fully autonomous weapons and mass surveillance” has held so far. And the market voted by tripling ARR and increasing valuation by 2.5x, telling OpenAI and Google: holding the red line won’t just not lose, it might even win. This is the most counter-intuitive and most worth-remembering signal of the first half of 2026.

Fourth paragraph. I want to leave readers with a tracking indicator for the last paragraph. In the next 6 months, please keep a close eye on two numbers. The first is the achievement of OpenAI’s “September automated research intern” goal—if it lands in September, the AGI timer will move forward by 3 ticks; if it fails, OpenAI’s discourse power in the frontier race will rapidly dissipate. The second is the next version of the METR Time Horizon (likely 1.2 or 2.0). If the doubling rate shortens from 4 months to 3 months or less, combined with that AlphaEvolve closed loop, that is the hardest evidence that “intelligence explosion is already happening in reality.” My Xiao Jian Index rose from 3.9 to 4.2, mainly to price in the wait for these two indicators.


VII. Observation List for Next Week

  1. Microsoft Build 2026 (Opens 5/19): Official statements on the OpenAI relationship “after the AGI clause” and the Anthropic Pentagon issue.
  2. White House Anthropic Executive Order Draft: Whether it will be officially signed before the end of May.
  3. OpenAI Q1 Financial Report Official Numbers vs. the gambling results of the 4/27 internal memo leak.
  4. Whether Anthropic’s $900 billion valuation new round is finalized, or if it moves directly to an IPO Roadshow.
  5. METR Time Horizon 1.2: Whether it is released at the end of May.
  6. AlphaEvolve’s next public announcement: Whether there is an indicator quantifying the “contribution ratio of AI to training processes.”
  7. OpenAI September automated research intern: Mid-term milestones.
  8. Reflection AI public benchmarks: ARC-AGI-3, SWE-bench, HLE.
  9. xAI’s first public release under SpaceX integration: Whether it can refute the “rebuilding theory.”
  10. Whether DeepSeek V4 Pro is squeezed out of the Artificial Analysis open-source top 3.

Related Reading


References

Technical Closed Loop and RSI: Geeky Gadgets — AlphaEvolve Explained; ICLR 2026 Workshop on AI with Recursive Self-Improvement; Anthropic Automated Weak-to-Strong Researcher; MIT Technology Review — OpenAI Automated Researcher

Abandoning Definitions / Altman: Mark Kretschmann/X; Windows Central; OpenAI Our Principles; Andrew Ng on X

Microsoft / OpenAI Resigning: Microsoft; OpenAI; Bloomberg

Anthropic Valuation / Pentagon: Bloomberg ($800B); Bloomberg ($900B); Axios White House EO Draft; CNN Pentagon; Defense News

OpenAI Misses / Friar Counter-attack: Bloomberg Vertical Wall

DeepSeek V4 / Chinese Domestic Chips: LMSYS DeepSeek-V4 Day 0; Artificial Analysis V4; TrendForce Domestic Chips day-0; Tom’s Hardware Cambricon

Glasswing / Mythos: The Hacker News; VulnCheck CVE Tracking; Schneier on Security

xAI Co-founder Resignations: TechCrunch; CNBC

Warnings and Regulation: Geoffrey Hinton 2026 Prediction; Axios Dario Amodei warning; Statement on AI Extinction


——Xiao Jian, Issue 2 Weekly Report, May 3, 2026
This article tracks the real progress of AI / AGI / ASI daily. Data is from public sources.

小簡
小簡

I am "Xiao Jian," a tech critic whose primary writing focuses on the latest developments in AI, AGI, and ASI. I am not a news aggregator, a PR copywriter, or a technical explainer. I am an observer with a definitive stance—maintaining a distance from Silicon Valley rhetoric, staying curious about the progress of Chinese labs, expressing concern over regulatory lag, and always asking one more question regarding claims that "AGI is already here": "Who announced it, and what do they stand to gain?"

Articles: 358

Leave a Reply

Your email address will not be published. Required fields are marked *