AI Agents & Vertical Intelligence Platforms (Q2
2025)
Key
Takeaways: The "agentic" AI wave builds on GPT-era LLMs by adding tools, memory, and
autonomy. New models like Claude 3.5 (Haiku/Sonnet), GPT-4 Turbo/Vision and Google's Gemini 1.5
excel at different agentic tasks (tool use, long-context reasoning)anthropic.comblog.google.
Architectures combine LLMs with plug-ins/APIs, retrieval (RAG) and multi-agent orchestration.
Startups are proliferating: e.g. Dust (multi-agent assistants), Hippocratic AI (healthcare LLM
agents), Apriora (AI interviewers)techcrunch.comfiercehealthcare.com.
Enterprise use cases span healthcare (AI
intake, documentation), finance (fraud,
compliance), legal/tax (automated drafting),
devtools (code assistants), HR (AI interviewers) and moreanthropic.comturing.com.
Market forecasts are staggering (e.g. $5.3B→$216.8B by 2035globenewswire.com,
~$8B→$52.6B by 2030marketsandmarkets.com)
as agents tap the ~$11T U.S. labor spendnea.com.
Business models range from Agent-as-a-Service (AaaS) subscriptions to fine-tuning/licensing
APIs, and even SaaS replacement by agent-driven workflows. Investors are active: Anthropic
raised $3.5B (May 2025)anthropic.com,
Perplexity is targeting $500Mreuters.com,
Hippocratic AI hit unicorn status ($1.64B valuation)fiercehealthcare.com,
and Dust has raised ~$21Mtechcrunch.com.
Big tech is moving too: Salesforce is acquiring Convergence.ai (May 2025) to power
"Agentforce"salesforce.com;
IBM and ServiceNow have launched agent featuressiliconangle.com.
Regulation & Risk: Experts warn agents
must "follow the law" (despite autonomy)lawfaremedia.org.
In practice this means strict oversight in sensitive domains: e.g. EU's AI Act will impose
transparency (labeling AI-generated content)digital-strategy.ec.europa.eu
and human-in-the-loop for high-risk systems; U.S. state laws now require formal AI risk
management (Montana's "Right to Compute" law mandates NIST-based risk policiesncsl.org).
Start low-risk pilots and bake in governancewww2.deloitte.com.
Evolution of AI Agents
Whereas 2022–24's LLM boom produced chatbots (ChatGPT, Claude, Bard), the next wave focuses
on agents: LLMs that act on data and tools. Starting from GPT with plug-ins, to systems
like AutoGPT or LangChain-powered chains, agents can now automate workflows rather than just
answer questions. Today's "agent" products often come pre-connected to databases, APIs and
company knowledge. For example, Dust lets teams spin up multiple specialized assistants linked
to internal docs (e.g. a @supportExpert aware of a company's Slack, knowledge base, and
CRM)techcrunch.com.
The shift is towards verticalized agents: narrow,
goal-driven AIs for specific industries or functions, rather than one-size-fits-all botsblog.dust.ttnea.com.
This specialization promises higher accuracy and efficiency on domain tasks (e.g. legal research
or patient interviews) because agents can be tailored with domain knowledge and compliance
rules.
Model Comparison: GPT-4 Turbo, Gemini 1.5,
Claude 3.5, etc.
Latest large models differ in agentic strengths:
-
GPT-4 Turbo & Omni (GPT-4o): Offers huge context (128K tokens
in Turbo) and multimodal input (text, vision, voice)en.wikipedia.orgen.wikipedia.org.
It supports tool use: e.g. tagged prompts like <search>…</search>
let GPT-4 call web search or APIs on
demanden.wikipedia.org.
Turbo's version is cheaper/"lighter" and adds vision, enabling agents to interpret images or
live audio.
-
Gemini
1.5 (Pro/Flash): Google's MoE-based successor to Gemini 1.0. Gemini 1.5 Pro
matches prior "Ultra" performance with much less compute; it natively handles 128K-token
context (with private preview up to 1 million tokens)blog.google.
Google reports it still finds "needle-in-a-haystack" facts (99% accuracy) even in very long
contextblog.google.
The architecture (mixture-of-experts) is designed for rapid in-context learning and
efficient scalingblog.googleblog.google.
-
Claude
3.5 (Haiku, Sonnet): Anthropic's latest Claude variants are tuned for agents.
"Sonnet" improves reasoning and tool use, boosting performance on agent benchmarks
(TAU-bench tool tasks) from 62.6% to 69.2%anthropic.com.
"Haiku" is optimized for speed/cost and still leads on coding/agent metrics: it scored 40.6%
on SWE-bench Verified, outperforming GPT-4oanthropic.com.
Claude 3.5 retains Anthropic's safety guardrails while being fully multi-turn
conversational.
-
Others: Smaller/mid-sized models (Mistral 1.0, Llama3, Qwen,
etc.) also integrate into agent platforms via open APIs or custom fine-tuning. Specialty
"assistant" models (e.g. Amazon's internally developed Nova) have appeared as well.
Each model now comes with native plugin interfaces or SDKs. For instance, GPT-4's developer API
supports function-calling to arbitrary web APIs; Google's Gemini API and Anthropic's Claude API
likewise allow custom tool calls. Competitive benchmarks show Claude 3.5 can even exceed GPT-4
on some tasksanthropic.com,
but OpenAI and Google lead on multimodality and scale. In practice, companies often choose based
on compliance (e.g. HIPAA-ready deployments), cost, and ecosystem (e.g. integrating Google
Cloud's tools vs Microsoft's Azure AI).
Agent Architectures: Tools, Memory, Coordination
Modern agents are not single LLM calls but systems. Key components include:
-
Tool
Integration: Agents invoke external tools (search engines, calculators, CRM APIs,
code execution environments). For example, GPT-4 can be instructed to wrap a query in <search></search>
to call a web
search API and incorporate the resultsen.wikipedia.org.
Many platforms now support "plugins" or function-calling. This lets agents answer questions
using up-to-date data or perform actions (e.g. booking travel via a web API).
-
Retrieval-Augmented Generation (RAG) and Memory: Agents often use
vector databases or knowledge graphs to retrieve domain-specific knowledge before or during
generation. This extends their context far beyond token limits. Some systems maintain a
long-term "memory" (user preferences, project notes) to personalize interactions. For
instance, enterprise agents pull from a company's document corpus (like Dust connecting to
Google Drive/Notion) so they "know" company policy and historytechcrunch.com.
-
Multi-Agent Coordination: Complex workflows may be split among
specialized sub-agents (planner, verifier, executor). Frameworks like LangChain or
Microsoft's Semantic Kernel orchestrate multiple LLM calls: one agent plans steps, others
execute API calls or parse results. Constellation Research identifies criteria for agent
platforms that include agent orchestration, failover handling, and marketplaces of reusable
skillsinfosys.com.
In practice, startups like AutoGPT or Agents.guru demonstrate chains of LLM-LLM
collaboration.
-
Validation & Safety: Agents incorporate checks to avoid
"hallucinations" or harmful actions. Some use self-critique (LLM checks its own answers) or
run parallel decider agents. Anthropically, Claude's model is advised by lower-level
"supervisor" models for safety and accuracy. In regulated fields (finance, healthcare),
agent outputs are often passed through rule-based filters and human review loops.
Enterprise Use Cases
AI agents are already transforming business
workflows. Examples include:
-
Healthcare: Patient triage and documentation are heavily
agentified. Hippocratic AI's LLM (4.2T parameters) powers phone-based "care agents" for
chronic disease follow-up and post-discharge support, helping address nursing shortagesfiercehealthcare.comglobenewswire.com.
Novo Nordisk reported using Claude agents reduced a 12-week clinical report to 10 minutesanthropic.com.
Agents also draft notes from doctor–patient conversations (e.g. Abridge) and assist
radiologists with image analysis by retrieving patient history.
-
Finance & Risk: Agents automate fraud detection, compliance
reporting and customer service. Feedzai uses AI
agents to monitor $8 trillion in payments annually for fraud/AMLturing.com.
Salient’s agents handle loan servicing (calls,
emails, voice) end-to-end, cutting handle times by ~60% and processing hundreds of millions
in loansturing.comaimresearch.co.
Regulatory reporting (Basel, KYC checks) and real-time risk monitoring are prime targets for
AI workflows.
-
Legal
& Tax: Document drafting and review can be delegated to agents. Thomson
Reuters’ CoCounsel uses Claude to
summarize complex tax regulations for CPAs, and tools like EvenUp generate demand letters and pleadings in personal injury
cases. These agents leverage legal databases (e.g. Westlaw) via RAG to ensure compliance.
-
DevTools & IT: Developer productivity is boosted by agentic
code assistants. Replit integrated Claude into
its “Agent” feature to translate natural-language requests into code (unit tests, bug
fixes), driving ~10× growth in revenueanthropic.com.
GitHub Copilot (OpenAI Codex) and tools like Cursor offer a conversational IDE. Site reliability and ops teams
also deploy agents to triage alerts and run diagnostic scripts via chat.
-
HR
& Recruitment: Platforms like Apriora use AI interviewers (“Alex”) to conduct live video
interviews and analyze candidates, automating resume screening and generating predictive
hiring scoresaimresearch.co.
This reduces recruiter workload and aims to reduce bias by applying consistent criteria.
-
Sales & Customer Support: Agents automate proposal writing,
email drafting, and customer Q&A. For example, Sweetspot automates government contract discovery and proposal
writingaimresearch.co.
Internal assistants (e.g. @SalesAI) pull CRM data to draft personalized outreach. Service
teams deploy intelligent chatbots that can escalate to humans, blending 24/7 coverage with
agency.
-
Other Industries: Niche sectors are adopting agents too. In
private aviation, Avitor.ai leverages
agents on 1.5M flight records to instantly generate charters and quotes, boosting booking
conversions by 50%. In manufacturing, agents coordinate supply chain updates from IoT feeds.
The common theme is: agents ingest domain data (EHRs, legal codes, financial records, etc.),
automate end-to-end workflows, and surface actionable insights.
Market Opportunity & Landscape
The potential market is enormous. NEA estimates
the addressable market by replacing labor with AI at ~$11 trillion in U.S. annual
wagesnea.com
(vs ~$450 billion enterprise software market). Research forecasts similarly sky-high
growth: global AI agent revenues are projected at ~$7.8B in 2025 to $52.6B by 2030marketsandmarkets.com
(46% CAGR), or even $216B by 2035globenewswire.com.
Sectors like customer service, healthcare,
finance and multi-agent orchestration
tools lead adoptionglobenewswire.comturing.com.
Competitive landscape: Horizontal platforms and startups abound.
OpenAI (GPT-4/Turbo with Plugins), Google (Gemini/Bard), Anthropic (Claude), and major cloud ML
services (AWS Bedrock, Azure AI) provide the base models and toolkits. Atop these, dozens of
startups build vertical agents or orchestration platforms. For example, Hippocratic AI (healthcare), Harvey AI (legal), Alan AI
(enterprise search), Sana AI (satellite data). CRM
and enterprise software giants are also racing in: Salesforce is building “Agentforce” and just
agreed to acquire Convergence.ai
(May 2025) for its UI-navigating agentssalesforce.com.
IBM and ServiceNow have announced new agent capabilities for automation and securitysiliconangle.com.
At the same time, incumbents like Microsoft integrate GPT models into Office and Azure, blurring
lines between SaaS and agentic services.
In summary, there is a shift from one-off AI
services to full-stack Agent Platforms.
Firms seek unified solutions that manage multiple specialized agents (e.g. Dust’s platform)
rather than point tools. Venture investors are betting heavily: early 2025 saw European VCs
alone commit ~$548M to AI agent startupsnews.crunchbase.com.
Established companies are quick to partner or acquire (e.g. Cohere’s purchase of
workflow-automation startup Ottogridtechcrunch.com,
Salesforce/Convergence). The race is on to dominate both the foundational model layer and the
industry-specific agent layer.
Business Models: AaaS, Fine-Tuning, SaaS
Disruption
Several monetization strategies are emerging.
Agents-as-a-Service (AaaS) is a popular
model: firms offer managed, subscription-based agents tailored to business tasks. For example, a
company might license an “AI support agent” that ties into its helpdesk and data. Conversely,
API/fine-tuning models let customers pay to
customize a core LLM with their data (e.g. fine-tuned GPT-4 or Claude for internal use). Many
startups also follow traditional SaaS
pricing (monthly/seat) for their agentic apps (e.g. an AI-driven CRM tool). Notably, some
analysts argue vertical AI agents could replace
multiple fragmented SaaS tools by automating whole workflows end-to-endnea.comturing.com.
In practice, vendors mix approaches: Anthropic and OpenAI offer usage-based APIs plus
consulting; startups like Dust sell enterprise licenses; others (e.g. Amelia or Spate) plan to
embed agents into clients’ apps with revenue tied to outcomes. Underpinning all models is the
“AI flywheel”: agents improve with more usage/data, driving lock-in (see “data flywheels” in
Nvidia’s Neo agent toolsperplexity.ai).
Recent Funding & M&A Trends
Investor and corporate activity has surged around
agent AI:
-
Anthropic – May 2025: $3.5B Series E, ~$61.5B post-money
valuationanthropic.com.
Anthropic plans to accelerate Claude R&D (e.g. Sonnet/Haiku) and expand deployments
(e.g. Amazon Bedrock, Google Vertex).
-
Perplexity AI – May 2025: in talks for ~$500M round at $14B
valuationreuters.com
(Accel-led). Its generative search/agent platform is aimed at enterprise search and commerce
(even partnering with PayPal for agentic check-out). Perplexity was ~$9B in 2024, so this
up-round reflects strong growth.
-
Hippocratic AI – Jan 2025: $141M Series B led by Kleiner
Perkins, valuation ~$1.64B (unicorn)fiercehealthcare.com.
The funds fuel its specialized healthcare agents (and new “app store” for clinician-designed
agents). To date Hippocratic has ~$278M total funding.
-
Dust – Jun 2024: $16M Series A (Sequoia-led)techcrunch.com;
Jan 2025: +$16M (separate blog). Dust’s platform lets companies build multiple custom
assistants on their data. Its founders emphasize that multiple focused agents outperform one general agent, and report
high engagement (70% of users active weekly)blog.dust.tt.
-
Cohere – May 2025: Acquired Ottogrid (AI workflow automation,
Vancouver)techcrunch.com
as it pivots to enterprise usage. Cohere (a16z-backed) had missed growth targets and is
focusing on healthcare/government clients.
-
Convergence.ai → Salesforce – May 2025: Salesforce agreed to buy
this London-based agent startup (teams with Rulex AI)salesforce.com.
Convergence’s agents can navigate web apps and UIs, so Salesforce will embed its tech in
“Agentforce” for end-to-end digital workflows.
-
Other notable deals: Numerous startups have raised funds or been
acquired: e.g. Mistral (€1.8B valuation, 2024),
Hugging Face ($200M in 2023), open-source LLM
companies, AI agent platform firms (Jaive, LangChain backing), etc. Large tech giants remain
aggressive acquirers of AI talent and tools (Microsoft, Google, Amazon have collectively
spent billions on AI M&A).
Regulatory & Governance Considerations
Obedience & Liability: Experts stress that autonomy does not absolve legal responsibility. As
Lawfare notes: “Before entrusting AI agents with government power, it’s essential to verify that
they’ll obey the law — even when instructed not to”lawfaremedia.org.
In practice, this means firms must embed guardrails and human-in-the-loop controls, especially
in regulated industries.
Data
Privacy and Safety: Agents handling personal data must comply with privacy laws (e.g.
HIPAA in healthcare, GDPR in EU). Training data and retrieval sources must be vetted for bias
and consent. For instance, a healthcare agent must only use HIPAA-compliant datasets; an HR
agent must ignore protected attributes. Models and agents should undergo safety audits (as
Hippocratic does: its LLM outperformed GPT-4 on medical safety benchmarksglobenewswire.com)
and maintain data logs for accountability.
Emerging Laws: In the US, regulation is fragmented. States are
active: e.g., Montana’s new “Right to Compute” law (2025) requires AI deployers to adopt
risk-management policies aligned with the NIST AI Risk Management Frameworkncsl.org.
New York mandates public inventories of automated decision tools in agencies. Industry
guidelines (FDA for AI medical devices, EEOC on employment) also apply. Federally, the AI Bill
of Rights and FTC guidance encourage transparency and accountability in automated decisions.
Internationally, the EU AI Act (expected to take effect 2026) will impose strict rules on
“high-risk” systems (those affecting health, legal status, safety, employment, etc.)digital-strategy.ec.europa.eu.
High-risk agents must pass conformity assessments, have human oversight, and maintain logsdigital-strategy.ec.europa.eu.
Critically, providers of generative AI will be obliged to mark AI-generated content: e.g.
chatbots must disclose they’re AI, and “deep fakes” or news content must be clearly labelled as
syntheticdigital-strategy.ec.europa.eu.
(This directly impacts agentic outputs – training an agent on EU data or deploying in Europe
will require compliance.)
Practical Advice: Many experts (e.g. Deloitte) advise a phased
rollout. Start with low-risk pilots under
human supervisionwww2.deloitte.com.
Establish robust data pipelines: agents thrive on clean,
structured domain dataturing.com.
Implement continuous monitoring and feedback loops: any agent mistake can have “real world”
consequences that erode trustwww2.deloitte.com.
Finally, document everything: clear audit trails for decisions, and user disclosures that AI is
involved.
Conclusion: The agent era is here. For investors and founders, the
key is building task-focused, integrated AI
solutions with clear ROI. The technology is advancing rapidly, but so are expectations for
safety and value. Organizations that can harness agents to cut costs and open new capabilities
(while managing their risks) stand to capture a huge slice of the next software revolutionnea.comwww2.deloitte.com.
Sources: Research and market reportsglobenewswire.commarketsandmarkets.comnea.comwww2.deloitte.com;
company announcementsanthropic.comfiercehealthcare.comsalesforce.com;
industry newsreuters.comtechcrunch.com;
technical documentationen.wikipedia.orgblog.google;
and expert analyseslawfaremedia.orgwww2.deloitte.com.