← All projects
Active · v5.0.0-beta.1

Blackreach

Autonomous web research agent. Runs overnight. Handles Cloudflare, dynamic JS, login walls, and rate limits. Built because every other option failed the moment it touched anything real.

2,904
test assertions
v5.0
current version
27
stars
MIT
license
live session

What it is

Blackreach is an autonomous browser agent built for research tasks that actually need to finish. ReAct loop, custom DOM walker, stealth Playwright. Give it a goal and it runs until done. Overnight, unsupervised, on real sites that push back.

Want the full story on how it works and why I built it the way I did? Read the writeup →

How a task looks

Thought: I need the inscription table on this page
Action: navigate("https://sigla.phis.me/")
Observation: Page loaded. Nav: [About, Database, Signs].
  Main: table, 847 rows, columns [ID, Site, Text, Image].
  Interactive: pagination controls, export button.
Thought: extract all rows and handle pagination
Action: extract_table(selector=".inscription-table", paginate=True)
...

The DOM walker compresses a 200k token page down to a 2k token observation. The model sees structure, not noise. Every interactive element gets a numeric ID so the agent clicks [15], not a CSS selector it has to guess.

Architecture

Task input
    |
    v
[ ReAct Loop                              ]
[   Think (LLM) <---> DOM Walker         ]
[         |                              ]
[         v                              ]
[   Act ------> Stealth Playwright       ]
    |
    v
Output + task log

Capabilities

DOM Semantic Walker
Reduces 200k token pages to 2k structured observations. The model sees what matters, not the markup.
Stealth Playwright
Patches webdriver flags, viewport signatures, CDP artifacts, and input timing before any page loads.
Session Resume
Tasks auto-save on interrupt. Pick up exactly where you left off with full context intact.
Stuck Detection
Loop detection with automatic strategy switching and source failover. It doesn't just spin.
Cross-Session Memory
SQLite-backed. Remembers what worked per domain so it gets better over time.
Loud Failure Mode
2,904 test assertions covering real failure modes. Silent success is not acceptable.

Install

pip install blackreach
playwright install chromium
blackreach run "your task here"

Open source at gitlab.com/null.phnix/blackreach. Issues and PRs welcome.