Open-source scraper pipeline for finding robotaxi license plates, uploading evidence, and creating moderation submissions in a Supabase-backed tracker.
More providers can be added by dropping a new scraper into src/scrapers/ and registering it in src/index.ts.
This project is designed to run via GitHub Actions once per hour.
Workflow: .github/workflows/scrape.yml
- Install dependencies
npm install- Create
.env
cp .env.example .env
npm run setup- Run the scraper once
npm run scrape:onceAdd the following repo secrets:
GEMINI_API_KEYSUPABASE_URLSUPABASE_SERVICE_ROLE_KEYBOT_USER_ID
The scheduled workflow runs hourly and can be triggered manually.
src/index.ts: pipeline orchestrationsrc/scrapers/: provider scraperssrc/config/: env + search termssrc/vision/: robotaxi detection + plate extractionsrc/database/: Supabase reads/writessrc/storage/: image upload helpers
Start here: docs/ADDING_PROVIDER.md.
See CONTRIBUTING.md for the fastest path to a clean PR.