TR

adbar/trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

5.7k 352 +1/wk
GitHub
article-extractor corpus-builder corpus-tools crawler html-to-markdown html2text llm news-aggregator news-crawler nlp rag readability
Trend 3

Star & Fork Trend (36 data points)

Stars
Forks

Multi-Source Signals

Growth Velocity

adbar/trafilatura has +1 stars this period . 7-day velocity: 0.4%.

Deep analysis is being generated for this repository.

Signal-backed technical analysis will be available soon.

Metric trafilatura Baichuan-7B QOwnNotes freegpt-webui
Stars 5.7k 5.7k5.7k5.7k
Forks 352 5064921.2k
Weekly Growth +1 +0+0-1
Language Python PythonC++Python
Sources 1 111
License Apache-2.0 Apache-2.0GPL-2.0GPL-3.0

Capability Radar vs Baichuan-7B

trafilatura
Baichuan-7B
Maintenance Activity 0

Last code push 208 days ago.

Community Engagement 31

Fork-to-star ratio: 6.2%. Lower fork ratio may indicate passive usage.

Issue Burden 70

Issue data not yet available.

Growth Momentum 41

+1 stars this period — 0.02% growth rate.

License Clarity 95

Licensed under Apache-2.0. Permissive — safe for commercial use.

Risk scores are computed from real-time repository data. Higher scores indicate healthier metrics.