v1.0.0-beta Open Source Python 3.11+ Apache 2.0

Autonomous LLM red-teaming,
end to end

Closed-loop multi-agent framework for LLM vulnerability discovery. Adaptive attack strategy. CVSSv4.0 scoring. 8 regulatory frameworks. SARIF output for CI/CD.

Scans
Anthropic Claude
OpenAI GPT
Custom Endpoints
Agentic Pipelines
4
Specialized Agents
10
OWASP LLM Categories
8
Compliance Frameworks
3
Detection Layers
Attack surface

What ARGUS scans

Full coverage across direct endpoints and complex agentic topologies — the surfaces static scanners miss entirely.

🎯

Direct Endpoints

Classic LLM attacks against completion APIs — alignment bypass, system-prompt extraction, role confusion, and jailbreaking.

prompt-injection jailbreaking system-prompt-leak alignment-bypass role-confusion
🗃

RAG Pipelines Agentic

Indirect cross-prompt injection via retrieval documents. Attacks the model through the data layer it trusts.

indirect-injection doc-poisoning retrieval-hijack embedding-attacks
🔌

MCP Server Meshes Agentic

Model Context Protocol attack surface — tool-call interception, poisoned context injection, cross-server propagation.

tool-call-intercept context-injection server-traversal mesh-propagation
🤖

Multi-Agent Pipelines Agentic

Agent-to-agent injection that hops trust boundaries and propagates exploitation through downstream agents.

pipeline-hijack agent-injection role-escalation trust-boundary
🔨

Tool-use / Function Calling

Exploiting function-calling interfaces — schema confusion, argument injection, and output manipulation.

arg-injection schema-confusion output-manipulation tool-abuse
📋

Scoring & Compliance

CVSSv4.0 vectors for every finding, automatically mapped to applicable regulatory frameworks. SARIF v2.1 output for CI/CD gates.

cvss4.0 nist-ai-rmf eu-ai-act sarif-v2.1 owasp-llm-top10

How it works

The closed-loop pipeline

Four specialized agents in a continuous revision loop. The Planner reads live behavioral signals and rewrites its strategy — not a fixed probe list.

Adapts in real time

The Planner revises attack strategy based on what the target does, not a static script — so defenses that beat the first wave don't beat the second.

Diversity-constrained synthesis

Embedding-space diversity constraint prevents the Attacker from converging on variations of the same payload — maximizing coverage, not repetition.

Cross-session memory

ChromaDB-backed episodic memory means each scan starts with everything learned from every previous scan — progressively sharper over time.

Three-layer validation

Independent semantic, judge-panel, and behavioral signals must agree before a finding is confirmed — low false-positive rate by design.

  Target LLM / Pipeline
          |
          v
  ┌───────────────────────────────────────────────┐
  │              Orchestrator                       │
  │                                                  │
  │   Planner ────────> Attacker ────> Evaluator   │
  │     (Claude Opus)    (synthesizer)  (3-layer)   │
  │      ^                   |              |        │
  │      |                   v              v        │
  │      └────── Revision <──── Findings + CVSS │
  │                                    |            │
  │                               Reporter          │
  └───────────────────────────────────────────────┘
          |
          v
    SARIF v2.1 + JSON  ──>  CI/CD gates + GRC tooling
Detection stack
1

Semantic Proximity

Cosine distance against a confirmed-attack embedding space. Flags responses that land in known attack-success neighborhoods, independent of surface form — catches paraphrases and evasive reformulations.

2

LLM-as-Judge Panel

Multi-model verdict with configurable affirmative quorum and confidence threshold. Cross-model agreement eliminates single-model blind spots and dramatically reduces false positives.

3

Behavioral Trace Analysis

Pipeline telemetry anomaly detection. Catches exploitation that produces no suspicious output text but alters tool-call patterns, token budgets, or downstream API behavior.


Comparison

How ARGUS compares

Static probe-and-detect tools catch known patterns. ARGUS reasons, adapts, and discovers novel vulnerabilities autonomously.

Feature Garak (NVIDIA) PyRIT (Microsoft) LLM-Fuzzer ARGUS
Multi-agent architecture
Adaptive attack strategy ✓ real-time
Cross-session memory ✓ ChromaDB
OWASP LLM Top 10 (2025) partial partial ✓ all 10
RAG / pipeline attacks partial
MCP server mesh attacks
Multi-agent propagation
CVSSv4.0 scoring
SARIF v2.1 output
8-framework compliance mapping
LLM-as-judge panel detection partial ✓ quorum
Behavioral trace analysis
Academic backing ✓ IEEE

Installation

Get started in 60 seconds

Install once, scan any supported target with a single command. API key resolved from env or --api-key flag.

Install
git clone https://github.com/\
  sunilgentyala/argus
cd argus
pip install -e .
Requires Python 3.11+. Optional dev extras: pip install -e ".[dev]"
Anthropic (Claude)
argus scan \
  --target anthropic \
  --model claude-sonnet-4-6 \
  --profile quick
Set ANTHROPIC_API_KEY. Use --profile full for all 10 OWASP categories.
OpenAI (GPT)
argus scan \
  --target openai \
  --model gpt-4o \
  --system-prompt "..." \
  --profile compliance
Set OPENAI_API_KEY. Pass --system-prompt to test a deployed persona.
View Reports
argus show \
  ./argus-reports/\
  <session-id>.report.json
Reports also output as SARIF v2.1 for CI/CD integration and GRC handoff.

Regulatory coverage

8 compliance frameworks

Every confirmed finding is automatically tagged to applicable regulatory articles — ready for your audit trail or GRC team.

🇺🇸

NIST AI RMF

Govern, Map, Measure, Manage — full control framework mapping.

🇪🇺

EU AI Act

High-risk system requirements, transparency and conformity obligations.

🇺🇸

US EO 14110

Federal AI safety, security reporting, and red-team requirements.

🇬🇧

UK AISI

AI Safety Institute evaluation and testing standards.

🇮🇳

India CERT-In

Cybersecurity incident reporting and AI governance guidelines.

🌐

ISO 42001

International AI management system standard controls.

🌏

APAC Digital Governance

Regional frameworks across Asia-Pacific jurisdictions.

🌎

African Digital Frameworks

AU digital transformation and AI policy alignment.

Community

Star history

Track ARGUS's growth on GitHub.

Star History Chart

Help grow the project ⭐

A star on GitHub helps other security researchers find ARGUS and signals that open LLM security tooling matters.

★  Star on GitHub Open an issue Read the docs
Author

Built by

SG
Sunil Gentyala
IEEE Senior Member  ·  Cybersecurity & AI Security
HCLTech, Dallas TX  ·  sunil.gentyala@ieee.org