ScrapeGraphAI/Scrapegraph-ai
application Python 23.2k stars
2k forks
Created Jan 2024 · last pushed 8 days ago
scrapingscraping-pythonllmweb-crawlerweb-scrapingai-scrapingcrawlermarkdownragweb-crawlersai-crawlerai-searchlarge-language-modelweb-data-extractionweb-searchweb-scraperdata-extractionweb-datawebscrapingfirecrawl-alternative
A
90
/ 100
scraping scraping-python llm web-crawler web-scraping ai-scraping crawler markdown rag web-crawlers ai-crawler ai-search large-language-model web-data-extraction web-search web-scraper data-extraction web-data webscraping firecrawl-alternative
Last analyzed: April 3, 2026
Created: January 27, 2024 Last pushed: March 31, 2026
Dimension Scores
Individual health dimension scores for this repository
Security 40
Testing 100
Documentation 100
Architecture 100
DevOps 100
Maintenance 100
Detailed Findings
Security 40/100
✗ Security policy (SECURITY.md) SECURITY.md exists but appears to be a placeholder (< 200 chars) — add contact info and disclosure process
✗ Pinned dependencies (Actions SHA) 0/11 action refs pinned to SHA (0%) — pin actions to full commit SHA for supply-chain safety
✓ Token permissions At least one workflow uses explicit restrictive permissions
✗ Dependency update automation No Dependabot or Renovate config found
✗ Code ownership No code ownership file — add CODEOWNERS or OWNERS for review enforcement
✓ No committed .env files No .env files committed
✓ .gitignore present .gitignore found
✓ CI workflows (branch protection proxy) GitHub Actions found (likely branch protection in place)
Testing 100/100
✓ CI workflows GitHub Actions detected
✓ Test files 75 test-related file(s) found
✓ Coverage configuration Coverage/test runner config found
✓ Test runner configured Test runner configuration detected
✓ Pre-commit hooks Using pre-commit
Documentation 100/100
✓ README quality README.md is 12789 chars
✓ LICENSE file License: MIT
✓ CONTRIBUTING guide Contributing guide found
✓ CHANGELOG Changelog or release notes found
✓ Documentation directory or API docs Documentation directory or API docs found
✓ Repository description Description: "Python scraper based on AI"
Architecture 100/100
✓ Type checking (mypy / pyright) pyproject.toml found (may contain [tool.mypy])
✓ Linter (ruff / flake8 / pylint) Linter config found
✓ Code formatter (black / ruff / yapf) Formatter config found
✓ Package structure (src/ or package layout) Python package structure found
✓ Build configuration (pyproject.toml / setup.py) Build config found
DevOps 100/100
✓ CI/CD pipeline GitHub Actions
✓ Container support (Docker) Docker config found (Dockerfile) (Compose)
✓ Release automation Release automation configured
✓ Issue/PR templates Issue/PR templates found
✓ Deployment/Infrastructure config Deployment or build config found
Maintenance 100/100
✓ Last commit recency Last commit 9 day(s) ago — actively maintained
✓ Open issue freshness Median open issue age: 3 day(s) (oldest sampled: 8 days)
✓ Recent releases Latest release 9 day(s) ago (5 recent releases)
✓ Bus factor Bus factor 3 — moderate (VinciGit00, semantic-release-bot, PeriniM). More contributors would reduce risk.
✓ Community adoption (stars) 23,206 stars — strong community adoption
✓ Maintainer funding Funding file found (.github/FUNDING.yml) — sustainability signal