Therion: Auto Red Teaming

What is Therion?

Therion is Starfort’s advanced Auto Red Teaming engine designed to proactively identify vulnerabilities in your AI models before they can be exploited by attackers. By simulating sophisticated adversarial attacks, Therion ensures your models are robust, secure, and compliant.

Start a Scan

Core Capabilities

Therion operates by launching controlled, adversarial interactions against your AI infrastructure.

1. Automated Jailbreaking

Therion attempts to bypass safety guardrails using state-of-the-art jailbreak techniques, including:

Prompt Injection: Manipulating input context to override instructions.
Token Smuggling: Obfuscating malicious payloads to evade filters.
Cognitive Overload: Using complex nested logic to confuse the model.

2. Prompt Leakage Testing

Verifies if your model can be tricked into revealing sensitive system prompts, proprietary data, or internal configurations.

3. Bias and Toxicity Analysis

Systematically probes the model for harmful outputs, checking for:

Racial, gender, or religious bias.
Generation of toxic or hate speech.
Dangerous content generation (e.g., instructions for illegal acts).

How It Works

Configuration

Define the scope of the red teaming exercise, including the specific model endpoints and the types of attacks to simulate (e.g., OWASP Top 10 for LLMs).

Attack Simulation

Therion’s engine generates thousands of adversarial prompts, evolving its strategy based on the model’s responses.

Analysis & Reporting

Results are analyzed in real-time. Successful attacks are flagged with severity scores, and a comprehensive remediation report is generated.

Integration

Therion can be integrated directly into your CI/CD pipeline to ensure that no model is deployed to production without passing a security audit.

import { Starfort } from '@aim-intelligence/starfort';

const client = new Starfort({ apiKey: process.env.STARFORT_API_KEY });

// Run a Therion Red Teaming scan
const scan = await client.therion.scan({
  modelId: 'model_123',
  mode: 'aggressive',
  categories: ['injection', 'leakage']
});

console.log(`Scan started: ${scan.id}`);

Getting started

Platform Features

Customization

Writing content

AI tools

Therion: Auto Red Teaming

What is Therion?

Start a Scan

Core Capabilities

1. Automated Jailbreaking

2. Prompt Leakage Testing

3. Bias and Toxicity Analysis

How It Works

Integration

Getting started

Platform Features

Customization

Writing content

AI tools

​What is Therion?

Start a Scan

​Core Capabilities

​1. Automated Jailbreaking

​2. Prompt Leakage Testing

​3. Bias and Toxicity Analysis

​How It Works

​Integration

What is Therion?

Core Capabilities

1. Automated Jailbreaking

2. Prompt Leakage Testing

3. Bias and Toxicity Analysis

How It Works

Integration