ScrapeGraphAI/Scrapegraph-ai Python A application scrapingscraping-pythonllmweb-crawlerweb-scrapingai-scrapingcrawlermarkdownragweb-crawlersai-crawlerai-searchlarge-language-modelweb-data-extractionweb-searchweb-scraperdata-extractionweb-datawebscrapingfirecrawl-alternative

ScrapeGraphAI/Scrapegraph-ai

application Python 23.2k stars 2k forks
Created Jan 2024 · last pushed 8 days ago
scrapingscraping-pythonllmweb-crawlerweb-scrapingai-scrapingcrawlermarkdownragweb-crawlersai-crawlerai-searchlarge-language-modelweb-data-extractionweb-searchweb-scraperdata-extractionweb-datawebscrapingfirecrawl-alternative
A
90
/ 100
scraping scraping-python llm web-crawler web-scraping ai-scraping crawler markdown rag web-crawlers ai-crawler ai-search large-language-model web-data-extraction web-search web-scraper data-extraction web-data webscraping firecrawl-alternative
Last analyzed: April 3, 2026
Created: January 27, 2024 Last pushed: March 31, 2026

Dimension Scores

Individual health dimension scores for this repository

Security
40
Testing
100
Documentation
100
Architecture
100
DevOps
100
Maintenance
100

Detailed Findings

Security 40/100
Security policy (SECURITY.md) SECURITY.md exists but appears to be a placeholder (< 200 chars) — add contact info and disclosure process
Pinned dependencies (Actions SHA) 0/11 action refs pinned to SHA (0%) — pin actions to full commit SHA for supply-chain safety
Token permissions At least one workflow uses explicit restrictive permissions
Dependency update automation No Dependabot or Renovate config found
Code ownership No code ownership file — add CODEOWNERS or OWNERS for review enforcement
No committed .env files No .env files committed
.gitignore present .gitignore found
CI workflows (branch protection proxy) GitHub Actions found (likely branch protection in place)
Testing 100/100
CI workflows GitHub Actions detected
Test files 75 test-related file(s) found
Coverage configuration Coverage/test runner config found
Test runner configured Test runner configuration detected
Pre-commit hooks Using pre-commit
Documentation 100/100
README quality README.md is 12789 chars
LICENSE file License: MIT
CONTRIBUTING guide Contributing guide found
CHANGELOG Changelog or release notes found
Documentation directory or API docs Documentation directory or API docs found
Repository description Description: "Python scraper based on AI"
Architecture 100/100
Type checking (mypy / pyright) pyproject.toml found (may contain [tool.mypy])
Linter (ruff / flake8 / pylint) Linter config found
Code formatter (black / ruff / yapf) Formatter config found
Package structure (src/ or package layout) Python package structure found
Build configuration (pyproject.toml / setup.py) Build config found
DevOps 100/100
CI/CD pipeline GitHub Actions
Container support (Docker) Docker config found (Dockerfile) (Compose)
Release automation Release automation configured
Issue/PR templates Issue/PR templates found
Deployment/Infrastructure config Deployment or build config found
Maintenance 100/100
Last commit recency Last commit 9 day(s) ago — actively maintained
Open issue freshness Median open issue age: 3 day(s) (oldest sampled: 8 days)
Recent releases Latest release 9 day(s) ago (5 recent releases)
Bus factor Bus factor 3 — moderate (VinciGit00, semantic-release-bot, PeriniM). More contributors would reduce risk.
Community adoption (stars) 23,206 stars — strong community adoption
Maintainer funding Funding file found (.github/FUNDING.yml) — sustainability signal
View on GitHub OpenSSF Scorecard Back to Leaderboard