Test the Tester
Built for RevylPaste a test. Find the holes before the bugs do.
About Revyl
Revyl is a YC Fall 2024 AI-native mobile testing platform. The founders built DragonCrawl at Uber, an LLM-based mobile testing framework that saved Uber $25M in four months. The premise: replace brittle script-based E2E tests that break on every UI change with natural-language tests that run on real cloud iOS and Android devices, wired to OpenTelemetry traces so bugs get caught before production.
Today
With This Demo
| Script-based E2E | Revyl natural-language | This audit layer | |
|---|---|---|---|
| Survives UI changes | No | Yes | N/A |
| Catches missing assertions | Never | Partial | Yes |
| Explains why a test is weak | No | No | Yes |
| Requires writing test code | Yes | No | No |
| Root cause analysis | Manual | Via OTel traces | Pre-run, structural |
| Runs on real devices | Emulators | Yes | N/A |
What the audit covers
Adversarial analysis
Claude approaches your test as an attacker, not a reviewer. The goal is to find ways the test passes while the feature is broken, not ways to make the test more verbose.
Severity-scored weaknesses
Every flaw is categorized and rated high, medium, or low. Hard-coded waits and loose selectors are different problems with different blast radii.
Confidence score
A 0-100 number representing how likely this test is to catch a real bug. Most tests score 40-70. A test that scores 90 is genuinely rare and earns it.
Rewrite in Revyl style
The audit closes with a rewritten version of the test in Revyl's natural-language format: numbered steps, explicit assertions, no ambiguous verbs.
Run an audit
Paste any test below, or load one of the examples. Works with Cypress, Playwright, Appium, pytest, Revyl natural-language, or plain prose.
Paste a test, or load an example.
I'll find the holes.