Portuguese corpus growth control

Seed Management

Review the sources that feed the background corpus crawler, keep trusted domains active, and collect new source suggestions without touching the database.

What this page controls
Crawler seeds
Moderation mode
Pending review
Preferred sources
Trusted PT domains
Action flow
Suggest, approve, crawl

Current Seeds

Each card represents a source the background crawler may use to grow the Portuguese corpus.

Loading seeds...
Loading seeds...

Add Seed

Add a missing source as a direct URL or search query. New seeds start as pending review, and the domain can be auto-detected.