the master
Posts
How Does AI Agent Work?

How Does AI Agent Work?

AI Agents by Google, Anthropic, OpenAI, MIT No Longer Stand Behind Students Research

Himanshu Ramchandani
May 17, 2025 • Estimated Reading Time: 9 minutes

You have already heard a lot of noise about AI agents. Today’s newsletter breaks down how AI agents work and includes my video on building your first AI agent.

I added some content, news, and resources about AI agents by Google, Anthropic, and OpenAI.

I was engaged with AI Engineer HQ and delayed the newsletter edition. The FREE resources to learn AI Engineering will be available in the roadmap itself soon.

In today’s edition:

AI Roundup— guides on AI Agents by Google, Anthropic, OpenAI
Dive Deep Drill— How Does AI Agent Work?
Build Together— Here’s How I Can Help You

The Elite —Build your second brain for Leading AI Products/Projects.

1:1 Coaching Program

AI Roundup

I found these resources, content, and news.

— [news] MIT Says It No Longer Stands Behind Student’s AI Research Paper
— [content] build your first AI agent by using n8n an automation software tool.
— [resource] 9 Guides on AI Agents.

— [news] Amazon and Nvidia are going all in on healthcare AI. Here's why.
— [resource] white paper by Google on Agents.

Start learning AI in 2025

Keeping up with AI is hard – we get it!

That’s why over 1M professionals read Superhuman AI to stay ahead.

Get daily AI news, tools, and tutorials
Learn new AI skills you can use at work in 3 mins a day
Become 10X more productive

Dive Deep Drill

How AI Agents Work?

Imagine if your smart fridge had an AI agent that not only ordered milk when you were out but also debated whether almond milk is better for you based on your browsing history.

Scary or helpful? You decide!

This is the most simple I can define an AI agent:

AI agents have the power to understand our language(because of LLMs), reason, plan, and also execute the tasks given to them without human intervention.

AI Agents can handle complex challenges, making them far more dynamic than basic automation tools.

They are designed as part of software, not just a script, which allows them to have complex interactions with their environment.

How AI Agents Are Different from Simple Automation?

You must be having the same question.

Well, they are different because of 2 major capabilities:

tools
planning

You have seen ChatGPT making mistakes in basic math problems. That is because it only responds based on the data it was trained on.

In the same way, if I ask you to multiply 85 and 65, as a human you can directly answer this if you already know the answer or by using a tool called calculator, correct?

You are doing the same with agents, giving them access to tools.

The second thing is planning.

Take the same math calculation, you can only solve this problem if you know multiplication or you know what parameters to pass to a calculator that is 85 and 65 along with a multiply.

That’s what planning and reason is.

Here is the flow of what happens when you query an AI agent.

The architecture of an AI Agent

3 major components of AI agents:

Orchestration layer
Models
Tools

Let us understand each component individually.

1. Orchestration layer (The Control Center)

Let’s say I want to create an AI agent meet scheduler, I query the scheduler, “I want to host a webinar for all my students”.

This will be considered as a trigger to the AI agent.

orchestration layer

The query can be text, audio, video, or image. (You already know that whatever the type of data is, it will always be converted into numerical values for the machine)

The query will be handled by the orchestration layer aka the control center of an AI agent.

There are 4 major works of the orchestration layer:

Memory: maintaining the memory of your whole interaction.
State: storing the current state of the whole process.
Reasoning: guiding the agent’s reasoning.
Planning: what are the steps and what will be the next step?

It will interact with the model(LLM).

2. Models (The Brain)

The model is the centralized decision-maker for the whole agent.

It is typically an AI model like the Large Language Model.

models in AI agents

To understand the query, formulate a plan, and determine the next action, the model uses reasoning and logic frameworks like:

ReAct
(Reason + Act) ensures thoughtful and deliberate actions
Chain-of-Thought
reason through intermediate steps.
Tree-of-Thoughts
explores multiple paths to find the best solution

The model determines what actions to take, and performs those actions using specific tools.

3. Tools (The Hands)

Using tools the agent can interact with the external world.

Like I told you, a calculator, APIs, web search, external databases, etc.

Tools enable agents to perform actions beyond the model's capabilities, access real-time information, or complete real-world tasks.

There are 3 types of tools:

Extensions: when the agent needs external live API calls.
Functions: similar to programming functions for client-side code execution.
Data Stores: vector databases, RAG, structured and unstructured data.

The model outputs a function and its arguments but doesn’t make a live API call.

The whole process will iterate until the goal is reached.

If you want to build your first AI Agent, here is my video on [YouTube]:

Want to work together? Here’s How I Can Help You

AI Engineering & Consulting (B2B) at Dextar—[Request a Brainstorm]
You are a leader?—Join [The Elite]
Become an AI Engineer in 2025—[AI Engineer HQ]
AI Training for Enterprise Team—[MasterDexter]
Get in front of 5000+ AI leaders & professionals—[Sponsor this Newsletter]

I use BeeHiiv to send this newsletter.

Paper Unfold

A series in which you will get a breakdown of complex research papers into easy-to-understand pointers. If you missed the previous ones:

AI Engineer HQ

PS: Reply to this email if you want me to write on the topic you are interested in.

Him

Reply

or to participate.