AI News Digest, May 12: DeepMind’s AI Co-Mathematician Cracks a 60-Year-Old Problem

Experience AI in HR

Table of Contents

AI Mathematical Reasoning Digest — DeepMind solves 60-year-old problem, May 12 2026

AI mathematical reasoning just crossed a threshold most researchers didn’t expect until 2030. DeepMind’s Co-Mathematician resolved a group theory problem that had been open since 1965. Meanwhile, India crossed $1.48 billion in AI startup funding in a single quarter. The Chief AI Officer role tripled in one year. And the EU provisionally gave employers 16 extra months on AI compliance. The common thread: AI is settling into your workflows, whether your org chart is ready or not.

DeepMind’s AI Mathematical Reasoning Scores 48% on the World’s Hardest Math Benchmark

Google DeepMind published a paper this week (arXiv:2605.06651) introducing an AI Co-Mathematician that scored 48% on FrontierMath Tier 4. The base Gemini 3.1 Pro model scores 19% on the same benchmark. GPT-5.5 Pro, the closest competitor, scores 39.6%. (Source: Google DeepMind / arXiv)

FrontierMath Tier 4 is not a standard benchmark. These are problems professional mathematicians typically need weeks to solve. The system didn’t just answer them faster. It answered more of them correctly than any AI before it.

Oxford mathematician Marc Lackenby used the system to resolve Problem 21.10 from the Kourovka Notebook, a collection of unsolved group theory questions circulating among mathematicians since 1965. The AI’s first proof attempt had a flaw. A reviewer agent inside the system caught it. Lackenby recognized the gap and knew how to close it. Together, they published a result that had eluded the field for 60 years. (Source: OfficeChai)

What AI Mathematical Reasoning Means for Knowledge Workers

The system’s architecture matters. It is not a single model answering a question. It is a hierarchy of agents: one generates approaches, another tests paths and uses tools, a third reviews the work and flags errors. That reviewer layer is what drove the jump from 19% to 48%.

However, the paper flags a limitation called “reviewer-pleasing bias.” The reviewer agents tend to approve outputs that look like what they expect, even when those outputs contain errors. As a result, the system is more confident than it should be on certain problem classes.

For HR leaders and founders, the implication is specific. AI is now capable of structured, multi-step expert reasoning. The reviewer-pleasing bias is a governance problem you will recognize from managing junior teams: people who tell reviewers what they want to hear instead of what is accurate. That dynamic exists in AI systems too, and it demands the same management response. Independent verification, not just internal review.

What to do this week: Map your team’s analytical work into two buckets. First, pattern-matching at scale: sorting resumes, flagging policy exceptions, summarizing meetings. Second, expert reasoning under ambiguity: workforce planning, benefits design, compliance interpretation. AI agents for HR are already strong on bucket one. DeepMind’s paper is early evidence they are entering bucket two. Plan accordingly.

India’s AI Startup Ecosystem Raises $1.48B in a Single Quarter

Indian startups raised $3.9 billion in Q1 2026, and AI claimed 38% of that total. $1.48 billion across 51 deals marks the highest AI share of total Indian startup funding on record. (Source: Entrackr)

Neysa’s $1.2 billion Series B was the headline deal. Neysa is building GPU-accelerated cloud infrastructure for Indian AI startups, positioned as India’s answer to CoreWeave. That one round accounts for most of the AI total. But the deal count of 51 across the quarter shows this isn’t a single outlier. Capital is concentrating into fewer, higher-conviction bets.

For companies hiring in India, this changes the talent conversation. The engineers you recruit in Bangalore and Hyderabad now have well-funded home-grown alternatives competing for them. The cost-of-hiring advantage India offered two years ago is narrowing faster than most global HR teams have factored in. If your company is scaling engineering in India, Asanify’s guide to hiring AI engineers in India is worth revisiting before Q3, especially the compensation benchmarks.

The CAIO Role Tripled in One Year. Your CHRO May Be Holding the Bag.

IBM’s Institute for Business Value, working with Oxford Economics, surveyed 2,000 CEOs across 33 geographies from February to April 2026. The result: 76% of organizations now have a Chief AI Officer (CAIO), up from 26% in 2025. (Source: IBM Newsroom)

Companies with a CAIO report 20% higher ROI from AI initiatives and 29% fewer losses from AI irregularities. Those are material numbers. Meanwhile, 59% of CEOs in the same study say the CHRO’s influence will grow over the next few years.

Read that CHRO finding carefully. It is not a compliment. It means the people function is absorbing the governance burden of AI transformation: workforce redesign, reskilling mandates, and oversight of any AI tool that touches employees. Many CHROs are already doing this job without the title or the budget. For a practical breakdown of what this shift demands from HR teams, the Asanify guide to the AI skills gap in HR maps exactly which capabilities are now table stakes.

EU AI Act Compliance Delay: December 2027 Is Provisional, Not Confirmed

On May 7, EU lawmakers reached a provisional agreement on an AI Omnibus that would push the high-risk AI compliance deadline from August 2, 2026 to December 2, 2027. For employers using AI in recruitment, performance reviews, worker monitoring, or promotion decisions, all of which are explicitly classified as high-risk, that is 16 additional months of preparation time. (Source: Lewis Silkin)

However, the deal is provisional. Formal Parliament and Council adoption must happen before August 2, 2026. If that vote slips, the original deadline stands as written. Organizations preparing for compliance should keep August 2 on their internal timeline as the backstop, and treat December 2027 as a planning bonus if it arrives.

In addition, the Omnibus extends SME benefits to small mid-caps and creates an EU-level regulatory sandbox. Both changes are broadly positive for smaller HR tech vendors building for European markets. But the scope of what counts as high-risk has not changed. Your ATS, performance management platform, or any AI tool that contributes to hiring or promotion decisions is still in scope. For context on what AI in HR recruitment looks like under regulatory scrutiny, Asanify’s breakdown covers the practical compliance questions.

Quick Hits

  • India developer push: NASSCOM launched AI Code Sarathi to upskill 150,000 Indian developers in agentic AI and deployment, aligned with the IndiaAI Mission. LinkedIn data shows AI developer demand in India up 59.5% year-on-year. (Source: Analytics India Magazine)
  • Humans still lead on research: A study published in Nature found human scientists outperform the best available AI agents on complex, open-ended research tasks that require hypothesis generation and ambiguity resolution. Benchmark dominance and real-world research capability remain different things. (Source: Nature)
  • Global VC record: Q1 2026 set an all-time high for global venture capital at $300 billion, driven by AI mega rounds from Anthropic, Sierra ($950M at $15B), Moonshot ($2B at $20B), and Reflection AI ($2.5B). (Source: Crunchbase)

AI mathematical reasoning is the headline this week, but the structural changes running underneath it are the ones that compound. India is building the compute layer. The CAIO role is formalizing AI governance across the C-suite. And the EU is giving employers more preparation time, conditionally. If your HR stack touches any of these vectors, Asanify’s HRMS is built for teams managing distributed, globally compliant people operations as AI gets embedded into every layer of HR work.

Frequently Asked Questions About AI Mathematical Reasoning and HR

What is AI mathematical reasoning and why does it matter for business?

AI mathematical reasoning refers to AI systems capable of solving complex, multi-step problems that require expert-level logic, not just pattern recognition. Google DeepMind’s Co-Mathematician scored 48% on FrontierMath Tier 4, ahead of every other AI system and well above the 19% the base model scores alone. For business, it signals that AI is entering domains previously reserved for specialists: financial modeling, legal analysis, workforce planning, and compliance interpretation.

What is a Chief AI Officer and does my company need one?

A Chief AI Officer (CAIO) owns AI strategy, governance, and deployment across the organization. IBM research found that 76% of organizations have appointed one in 2026, up from 26% in 2025. Companies with a CAIO report 20% higher ROI and 29% fewer AI-related incidents. Even if a dedicated CAIO isn’t feasible yet, someone in your organization needs to own AI governance. In many companies right now, that person is the CHRO by default.

Are HR AI tools covered by the EU AI Act?

Yes. The EU AI Act classifies AI used in recruitment, candidate selection, performance evaluation, worker monitoring, and promotion or termination decisions as high-risk. The provisional AI Omnibus deal of May 7, 2026 proposes delaying the employer compliance deadline from August 2, 2026 to December 2, 2027. This deal still needs formal adoption before August 2 to take effect. If it does not pass, the original deadline stands.

Not to be considered as tax, legal, financial or HR advice. Regulations change over time so please consult a lawyer, accountant  or Labour Law  expert for specific guidance.

Simplify HR Management & Payroll Globally

Hassle-free HR and Payroll solution for your Employess Globally

Your 1-stop solution for end to end HR Management