farhoodlabs/trebuchet

Fork 0

T

Chris Farhood 47a6e4933a

CI / Type-check & lint (pull_request) Successful in 16s

Details

CI / Build & push worker image (pull_request) Has been skipped

Details

CI / Build & push API image (pull_request) Has been skipped

Details

feat: backport auth-validation preflight + email_login credentials

Backport upstream Shannon PR #335:
- Add credential validation activity that drives a real browser login
  before the full pipeline, catching bad credentials early
- New email_login credentials type for magic-link and email-OTP flows
- Make credentials.password optional for passwordless flows
- Playwright stealth config (chrome.runtime, plugin simulation, UA)
- Centralize prompt directory resolution into resolvePromptDir helper
- New AUTH_LOGIN_FAILED error code with non-retryable classification
- Remove dangerous-pattern validation on credentials.password
- Pipeline-testing stub for auth validation (returns success)
- Auth validation timeout of 10 minutes for browser-based login
- .playwright directory workspace overlay for CLI/Docker

Co-Authored-By: Paperclip <noreply@paperclip.ing>

2026-05-20 00:59:27 +00:00

.claude

feat: backport Opus 4.7 + adaptive thinking, remove scan tools, add --help to scripts

2026-05-20 00:26:25 +00:00

.gitea

chore: move .github folder to .gitea for Gitea compatibility

2026-05-18 15:56:05 +00:00

apps

feat: backport auth-validation preflight + email_login credentials

2026-05-20 00:59:27 +00:00

assets

Add files via upload

2026-04-16 12:54:16 -07:00

charts/hightower

ci: migrate from GitHub Actions to Gitea Actions

2026-05-16 18:55:32 -04:00

infra

feat(infra): replace Temporal dev server with production deployment

2026-04-21 06:36:40 -04:00

repos

Initial commit

2025-10-03 19:35:08 -07:00

sample-reports

Initial commit

2025-10-03 19:35:08 -07:00

skills/hightower

feat: add hightower skill for Paperclip agents

2026-04-23 14:00:35 +00:00

workspaces

feat: add npx CLI with monorepo, CI/CD, and ephemeral worker architecture (#256 )

2026-03-27 02:34:29 +05:30

.dockerignore

feat: mount user repo as read-only with writable shannon overlay (#273 )

2026-04-03 23:46:28 +05:30

.env.example

feat: backport Opus 4.7 + adaptive thinking, remove scan tools, add --help to scripts

2026-05-20 00:26:25 +00:00

.gitattributes

feat: mount user repo as read-only with writable shannon overlay (#273 )

2026-04-03 23:46:28 +05:30

.gitignore

ci: migrate from GitHub Actions to Gitea Actions

2026-05-16 18:55:32 -04:00

.npmrc

chore: enforce pnpm minimum release age and upgrade to v10.33.0 (#266 )

2026-04-02 01:22:24 +05:30

.releaserc.json

ci: migrate from GitHub Actions to Gitea Actions

2026-05-16 18:55:32 -04:00

biome.json

feat: add npx CLI with monorepo, CI/CD, and ephemeral worker architecture (#256 )

2026-03-27 02:34:29 +05:30

CLAUDE.md

feat: backport Opus 4.7 + adaptive thinking, remove scan tools, add --help to scripts

2026-05-20 00:26:25 +00:00

COVERAGE.md

Initial commit

2025-10-03 19:35:08 -07:00

Dockerfile

feat: backport Opus 4.7 + adaptive thinking, remove scan tools, add --help to scripts

2026-05-20 00:26:25 +00:00

entrypoint.sh

feat: extract pipeline core for library consumption (#282 )

2026-04-10 04:53:36 +05:30

LICENSE

chore: change license to AGPL-3.0

2025-11-26 18:45:36 -08:00

package.json

chore: enforce pnpm minimum release age and upgrade to v10.33.0 (#266 )

2026-04-02 01:22:24 +05:30

pnpm-lock.yaml

feat: backport Opus 4.7 + adaptive thinking, remove scan tools, add --help to scripts

2026-05-20 00:26:25 +00:00

pnpm-workspace.yaml

feat: backport Opus 4.7 + adaptive thinking, remove scan tools, add --help to scripts

2026-05-20 00:26:25 +00:00

README.md

Update GitHub URLs from hightower to trebuchet repos

2026-05-06 23:56:51 +00:00

SHANNON-PRO.md

feat: backport Opus 4.7 + adaptive thinking, remove scan tools, add --help to scripts

2026-05-20 00:26:25 +00:00

tsconfig.base.json

feat: add npx CLI with monorepo, CI/CD, and ephemeral worker architecture (#256 )

2026-03-27 02:34:29 +05:30

tsconfig.json

feat: add npx CLI with monorepo, CI/CD, and ephemeral worker architecture (#256 )

2026-03-27 02:34:29 +05:30

turbo.json

feat: add npx CLI with monorepo, CI/CD, and ephemeral worker architecture (#256 )

2026-03-27 02:34:29 +05:30

README.md

Trebuchet — AI Pentester

Trebuchet is a fork of Shannon by Keygraph, wrapped with a REST API and Kubernetes tooling for cluster-based deployments.

What is Trebuchet?

Trebuchet is an API-driven AI pentester built on top of Shannon's autonomous penetration testing engine. It performs white-box security testing of web applications and APIs by combining source code analysis with live exploitation.

Unlike the upstream Shannon CLI, Trebuchet is designed to run as a service on Kubernetes — scans are triggered via REST API, orchestrated by Temporal, and executed in ephemeral worker pods.

Important

White-box only. Trebuchet expects access to your application's source code and repository layout.

Features

Fully Autonomous Operation: A single API call launches the full pentest. Handles 2FA/TOTP logins (including SSO), browser navigation, exploitation, and report generation without manual intervention.
Reproducible Proof-of-Concept Exploits: The final report contains only proven, exploitable findings with copy-and-paste PoCs. Vulnerabilities that cannot be exploited are not reported.
OWASP Vulnerability Coverage: Identifies and validates Injection, XSS, SSRF, and Broken Authentication/Authorization.
Code-Aware Dynamic Testing: Analyzes source code to guide attack strategy, then validates findings with live browser and CLI-based exploits against the running application.
Integrated Security Tooling: Leverages Nmap, Subfinder, WhatWeb, and Schemathesis during reconnaissance and discovery phases.
Parallel Processing: Vulnerability analysis and exploitation phases run concurrently across all attack categories.

Architecture

Trebuchet uses a multi-agent architecture that combines white-box source code analysis with dynamic exploitation across five phases:

        +----------------------+
        |   Pre-Reconnaissance |
        |  (nmap, subfinder,   |
        |  whatweb, code scan) |
        +----------+-----------+
                   |
                   v
        +----------------------+
        |   Reconnaissance     |
        |  (attack surface     |
        |   mapping)           |
        +----------+-----------+
                   |
                   v
        +----------+----------+
        |          |          |
        v          v          v
  +-----------+ +---------+ +---------+
  | Vuln      | | Vuln    | |   ...   |
  |(Injection)| |  (XSS)  | |         |
  +-----+-----+ +----+----+ +----+----+
        |             |           |
        v             v           v
  +-----------+ +---------+ +---------+
  | Exploit   | | Exploit | |   ...   |
  |(Injection)| |  (XSS)  | |         |
  +-----+-----+ +----+----+ +----+----+
        |             |           |
        +------+------+-----------+
               |
               v
        +----------------------+
        |      Reporting       |
        +----------------------+

Each scan runs as an ephemeral Kubernetes Job with a per-invocation Temporal task queue, enabling concurrent scans with different target repositories.

Deployment

Kubernetes manifests live in a separate repository: farhoodlabs/trebuchet-infra.

Sample Reports

Sample penetration test reports from industry-standard vulnerable applications:

OWASP Juice Shop — 20+ vulnerabilities including auth bypass and database exfiltration. View Report
c{api}tal API — ~15 critical/high vulnerabilities including command injection and auth bypass. View Report
OWASP crAPI — 15+ critical/high vulnerabilities including JWT attacks and database compromise. View Report

Benchmark

Shannon Lite scored 96.15% (100/104 exploits) on a hint-free, source-aware variant of the XBOW security benchmark.

Full results with detailed agent logs and per-challenge pentest reports

Disclaimers

Warning

DO NOT run Trebuchet on production environments. It actively executes attacks to confirm vulnerabilities. Use only on sandboxed, staging, or local development environments.

Caution

You must have explicit, written authorization from the owner of the target system before running Trebuchet. Unauthorized scanning is illegal.

Verification is Required: Human oversight is essential to validate all reported findings. LLMs can still generate hallucinated content.
Targeted Vulnerabilities: Broken Authentication & Authorization, Injection, XSS, SSRF.
Cost: A full test run typically takes 1-1.5 hours and may cost ~$50 USD using Claude Sonnet.

License

Released under the GNU Affero General Public License v3.0 (AGPL-3.0).

Support

Report bugs: GitHub Issues
Discussions: GitHub Discussions

Based on Shannon by Keygraph

Languages

TypeScript 91.7%

JavaScript 6.3%

Dockerfile 1.2%

Go Template 0.7%

Shell 0.1%