Secure Agent Augmentation

@knail2on 9 months ago

0 MIT

FreeCommunity

AI Systems

#ai-actions#ai-functions#ai-helper#generative-ai#model-context-protocol#multi-agent-systems

a scalable, secure framework for AI agents to interact with external systems, perform complex multi-agent workflows, and manage asynchronous operations.

What is Secure Agent Augmentation

Secure Agent Augmentation is a scalable and secure framework designed for AI agents to interact with external systems, perform complex workflows involving multiple agents, and manage asynchronous operations. It utilizes production-grade technologies like FastAPI and OAuth 2.0 for security, aiming to transition AI deployments from proof of concepts to reliable, production-ready environments.

Use cases

The framework supports various use cases such as executing authenticated AI actions via REST endpoints, controlling access to these actions through fine-grained OAuth 2.0 scopes, and managing complex asynchronous workflows that involve multiple AI agents working together. It is particularly beneficial for tasks that require safe interactions with external APIs and data sources.

How to use

To use the framework, developers must first set up the environment, including installing dependencies and configuring database access. After creating JWT keys for secure authentication, they can define API endpoints for agent actions, control access with OAuth scopes, and implement multi-agent workflows, utilizing asynchronous programming capabilities.

Key features

Key features include a robust authentication and authorization mechanism with OAuth 2.0, support for secure external actions through contained APIs, a multi-agent framework for managing complex workflows, and LLM abstraction for flexible interactions with external language models. The architecture is designed for container-first deployment, ensuring scalability and portability.

Where to use

Secure Agent Augmentation is suitable for hosted cloud environments such as AWS, Heroku, and Snowflake, where security, scalability, and accessibility are critical. It is designed to enhance AI capabilities in production settings, particularly in scenarios requiring secure API integrations and multi-agent collaboration.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Overview

What is Secure Agent Augmentation

Use cases

How to use

Key features

Where to use

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Content

Secure Agent Augmentation

Secure Agent Augmentation is a scalable, secure framework for AI agents to interact with external systems, perform complex multi-agent workflows, and manage asynchronous operations. Built with production-grade technologies like FastAPI, and using OAuth 2.0 for robust security, the goal for this framework is to provide developers intending to enhance their AI agent capabilities beyond POCs and laptops and into productions. The goal to to make AI deployments safer, more capable, and fully accessible in hosted environments.

Architecture
SAAF Architecture

The framework is expected to be deployed in a run-time environment, and includes

AuthN/AuthZ Implements OAuth 2.0 for user authentication and authorization, with granular scopes for controlling access to specific actions and data for calling agents
Secure External Actions wrapped up in fully contained APIs, which double as libraries for agents executing locally in the runtime.
- RAG: To support API data which can bust the input context of LLMs, chunking and vectorization for that data will be supported
- Semantic Layer: For external actions which call data warehouses, a semantic layer (in the form of locally stored yaml/json) for each useful field within database table is stored
  - Along with helpful metadata on each useful field in a table, some hints on what type of data can be served from them along with helpful SQL examples also stored in this semantic layer.
  - this metadata will be available through an API fetch (remotely) or an SQL query (locally).
  - This is super helpful in enriching the prompts for locally running agents or agents which are calling these APIs from outside.
  - By this enrichment, the agents can craft smart SQL queries to extract the requested data.
Multi-Agent-Framework which supports complex workflows involving multiple AI agents that can operate asynchronously, leveraging callbacks to notify of completed tasks.
- a popular multi-agent framework (tbd) will also be deployed as part of this solution.
- The beauty of this approach is that the code powering the external actions (tools) APIs are also locally powering these local agents as imported function calls.
LLM abstraction: LLMs are not hosted as part of this framework, and are called by the MAF through API calls. This allows for

Infrastructure tenets

Container-First: Consistent, scalable, and portable application deployment across diverse environments, enhancing agility and resource efficiency.
Scalable Production-Grade API: Utilizes FastAPI for rapid, yet scalable, deployment. Easily integrates into containerized environments (e.g., Docker, Kubernetes).
Hosted Cloud-Ready Architecture: Built to be deployed on popular hosting solutions instead of running on your laptop like most open-source solutions.
OpenAPI support: Supports the OpenAPI spec out of the box which allows both humans and computers to understand and interact with the API’s functionalities without having access to the underlying codebase.

Why Secure Agent Augmentation?

I see the need for this framework because all common and new agent framework project are written without consideration of security and scalability mind. These frameworks work great on laptops or as POCs, but can’t be easily enhanced for production-grade deployment.

Secure Agent Augmentation represents a secure, versatile framework for enhancing the capabilities of AI agents through providing a safe, authenticated and scalable runtime execution for external actions, and in the future, executing multi-agent framework tasks.

Directory Structure

project_root/
├─ .env                             # Env vars ; in .gitignore
├─ requirements.txt                 # Python dependencies
├─ docker/                          # Docker related configs and Compose
│  ├─ Dockerfile.api_server         # Dockerfile for API server container
│  ├─ Dockerfile.admin_app          # Dockerfile for Admin app container
│  ├─ docker-compose.yml            # Orchestration file
├─ src/
│  ├─ api_server/                   # Flask-based OAuth2 server + resource endpoints
│  │  ├─ __init__.py                # Makes api_server a package
│  │  ├─ main.py                   # Entry point for Flask app
│  │  ├─ config.py                 # Env detection, DB URLs, token expiry
│  │  ├─ database.py               # Sync SQLAlchemy setup
│  │  ├─ models.py                 # DB models
│  │  ├─ schemas.py                # Pydantic-like schemas (optional) or dataclasses
│  │  ├─ security.py               # Authlib OAuth2 integration, JWT logic
│  │  ├─ utils.py                  # Utilities
│  │  ├─ constants.py              # Available scopes
│  │  ├─ oauth2_implementation.py  # Flask blueprint with /token, /introspect, /revoke, /jwks
│  │  ├─ logging_conf.py           # Logging configuration
│  │  ├─ templates/
│  │  │  ├─ admin_gui.html
│  │  ├─ admin/
│  │  │  ├─ api/
│  │  |  │  ├─ endpoints.py     # Flask-based admin API (password protected)
│  │  |  └─ gui/
│  │  |     ├─ endpoints.py     # Flask-based GUI (password protected)
│  │  └─ routers/                  # Resource endpoints as separate blueprints
│  │     ├─ auth_testing/
│  │     │  ├─ endpoints.py
│  │     ├─ release_oversight/          # wip
│  │     │  ├─ endpoints.py
│  │     ├─ service_ticket_analysis/    # wip
│  │     │  ├─ endpoints.py
│  │     ├─ tableau_dashboard_usage/    # wip
│  │     │  ├─ endpoints.py
│  │     ├─ internal_audit_analysis/    # wip
│  │     │  ├─ endpoints.py
│  │     ├─ slack_analysis/             # wip
│  │     │  ├─ endpoints.py
│  │     ├─ common_actions/             # wip
│  │     │  ├─ endpoints.py
│  ├─ oauth_client/
│  │  ├─ client.py
│  ├─ common/
│  │  ├─ utils.py
│  ├─ python_notebooks/
│  │  ├─ admin_and_client_demo.ipynb    # Notebook demonstrating usage
└─ README.md

Setup

Install Python on your computer. Recommended versions are 3.9.x through 3.11.x, Some snowflake libraries balk with 3.12
Install Docker on your computer, and postgresDB as well.
Install uv:

curl -LsSf https://astral.sh/uv/install.sh | sh

Clone the Repository:

git clone https://github.com/knail2/secure-agent-augmentation.git

Switch into the directory and create virtual environment:

cd secure-agent-augmentation
uv venv

Activate the virtual environment:

source .venv/bin/activate  # On Unix/macOS
# or
.venv\Scripts\activate  # On Windows

Install dependencies:

uv pip install -e ".[dev]"  # Install with development dependencies
# or
uv pip install -e .  # Install only runtime dependencies

Create Public and Private keys
We will upload the private key into the auth server, and that dude will be using the private key to sign the (JWT) tokens that it will be issuing to the clients, which the clients could use to verify the authenticity of the tokens. (this is an added layer of security provided by the oauth2.0 RFC that we are implementing here)
- For LOCAL:
  - Create a private/public key. See creating keys for JWT signing
  - Place those (JWT) private/public keys in:
- ~/.ssh/jwt_private_key.pem
- ~/.ssh/jwt_public_key.pem
- For hosted environments like AWS, HEROKU or SNOWFLAKE:
  - Put these keys in environment variables:
    - JWT_PRIVATE_KEY
    - JWT_PUBLIC_KEY
      directly in AWS, Heroku or Snowflake.
Set up local environments using this .env example:
(I don’t upload .env to github obviously! see my .gitignore)
ENVIRONMENT=local
POSTGRES_USER=local_user
POSTGRES_PASSWORD=local_password
POSTGRES_DB=local_db
POSTGRES_HOST=localhost
POSTGRES_PORT=5432
TOKEN_EXPIRY_SECONDS=3600
Run the server (locally)
uvicorn src.api_server.main:app –reload
Set up database and local user, just run run.sh
the script attempts to log into the local postgres db (you may need to provide admin creds), and checks to see if user/password and db exist, and if not, then it creates them, and binds the user to the db, and starts the app.
(for localhost testing only) Run Jupyter Lab: jupyter lab and fire up notebooks/admin_and_client_demo.ipynb to test the various APIs in this run-time framework.

:construction: Running the server on Docker

:construction: Deploying on Heroku

the main branch has Procfile which deploys the api_server code to the heroku repo

:construction: Security Guidelines

I need to flesh this out with a standard approach on how scopes can be used for external AI actions. There are no standards defined on the internet, so I’ll have to come up with some abstractions myself. Stay tuned

OAuth 2.0 Scopes: Fine-tune access control by assigning different scopes to each API endpoint.
Secure Defaults: All actions require an OAuth token for access.

:construction: Example Usage (WIP)

All the stuff in this section is in construction, the info below is random GPT generated stuff, it doesn’t work.
I’ll come back and fix it up

Register an Action: Create AI agent actions accessible via REST endpoints.
Control Access with OAuth 2.0 Scopes: Limit who can call actions, and what data they have access to, using fine-grained scopes.
Multi-Agent Asynchronous Workflow: Trigger a complex action that involves multiple agents working together, asynchronously.

:construction: Creating authenticated API Endpoints (WIP)

To create a simple authenticated AI action, add a new endpoint:

from fastapi import APIRouter, Depends
from app.security import oauth2_scheme

router = APIRouter()

@router.get("/run-action", tags=["actions"])
async def run_action(token: str = Depends(oauth2_scheme)):
    # Logic for the AI agent action
    return {"message": "Action executed successfully."}

:construction: Multi-Agent Asynchronous Workflow Example (WIP)

Here is an example of triggering an asynchronous multi-agent workflow:

from fastapi import APIRouter, Depends
from app.security import oauth2_scheme
import asyncio

router = APIRouter()

@router.post("/trigger-async-workflow", tags=["workflows"])
async def trigger_async_workflow(token: str = Depends(oauth2_scheme)):
    # Start an async workflow involving multiple agents
    task = asyncio.create_task(async_workflow())
    return {"message": "Workflow triggered successfully, you will be notified upon completion."}

async def async_workflow():
    # Simulate a complex, multi-agent workflow
    await asyncio.sleep(5)  # Simulate work being done asynchronously
    # Logic to post results to a callback endpoint or update status
    print("Workflow completed, posting results.")

Contributing

Fork the repository.
Create a new feature branch (git checkout -b feature-name).
Commit your changes (git commit -m 'Add feature').
Push to the branch (git push origin feature-name).
Create a Pull Request.

Create clients using the admin UI or admin API. The client_id and client_secret generated can be used by another system (the OAuth2 client) to request tokens.

JWT Keys and Deployment

Local: Keys in ~/.ssh/
Heroku/Snowflake: Keys in environment variables JWT_PRIVATE_KEY and JWT_PUBLIC_KEY.

Ensure keys are properly generated as shown above and stored securely. On production environments like Heroku or Snowflake, never commit keys to source control; only set them via environment variables.

Additional Notes

The code is environment-agnostic: it detects ENVIRONMENT and configures database and JWT keys accordingly.
DATABASE_URL is used for Heroku Postgres.
For Snowflake, all required Snowflake credentials must be in environment variables.
The resource endpoints (/protected, /protected_post, /highly_confidential, etc.) rely on scopes included in the JWT scope claim. The token issuing process (via /token endpoint) must include appropriate scopes. The admin can configure clients with the desired scopes.

License

MIT

Contact

Feel free to open an issue or pull request if you encounter any problems or have a feature request.

Thanks!
Omer.

Dev Tools Supporting MCP

The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.

Zed: High-performance collaborative code editor, supports MCP protocol, providing a smooth programming experience. zed.dev

Cursor: AI code editor built on VS Code, supports MCP protocol for context-aware programming. cursor.com

Windsurf: AI code editor from Codeium, integrates MCP protocol to provide intelligent code assistance. windsurf.com

Continue: Open-source AI programming assistant plugin, supports VS Code and JetBrains, compatible with MCP protocol. continue.dev

Trae: AI-driven code editor, supports MCP protocol, focusing on enhancing developer programming experience. trae.ai

View More MCP Dev Tools

Tools

No tools

Comments

Recommend MCP Servers

Tavily MCP Server The Tavily MCP server provides: search, extract, map, crawl tools Real-time web search capabilities through the tavily-search tool Intelligent data extraction from web pages via the tavily-extract tool Powerful web mapping tool that creates a structured map of website Web crawler that systematically explores websites.

MCP Server Chart This is a TypeScript-based MCP server that provides chart generation capabilities. It allows you to create various types of charts through MCP tools. You can also use it in Dify.

GitHub MCP Server MCP Server for the GitHub API, enabling file operations, repository management, search functionality, and more.

Brave Search MCP Server Web and local search using Brave's Search API

Firecrawl MCP Server Advanced web scraping with JavaScript rendering, PDF support, and smart rate limiting

Context7 MCP LLMs rely on outdated or generic information about the libraries you use. You get:

Slack MCP server Channel management and messaging capabilities

Sequential Thinking MCP Server Dynamic and reflective problem-solving through thought sequences

Fetch MCP Server A Model Context Protocol server that provides web content fetching capabilities.

Playwright MCP A Model Context Protocol (MCP) server that provides browser automation capabilities using [Playwright](https://playwright.dev). This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.

View All MCP Servers