TR
adbar/trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
5.7k 352 +1/wk
GitHub
article-extractor corpus-builder corpus-tools crawler html-to-markdown html2text llm news-aggregator news-crawler nlp rag readability
Trend
3
Star & Fork Trend (36 data points)
Stars
Forks
Multi-Source Signals
Growth Velocity
adbar/trafilatura has +1 stars this period . 7-day velocity: 0.4%.
Deep analysis is being generated for this repository.
Signal-backed technical analysis will be available soon.
| Metric | trafilatura | Baichuan-7B | QOwnNotes | freegpt-webui |
|---|---|---|---|---|
| Stars | 5.7k | 5.7k | 5.7k | 5.7k |
| Forks | 352 | 506 | 492 | 1.2k |
| Weekly Growth | +1 | +0 | +0 | -1 |
| Language | Python | Python | C++ | Python |
| Sources | 1 | 1 | 1 | 1 |
| License | Apache-2.0 | Apache-2.0 | GPL-2.0 | GPL-3.0 |
Capability Radar vs Baichuan-7B
trafilatura
Baichuan-7B
Maintenance Activity 0
Last code push 208 days ago.
Community Engagement 31
Fork-to-star ratio: 6.2%. Lower fork ratio may indicate passive usage.
Issue Burden 70
Issue data not yet available.
Growth Momentum 41
+1 stars this period — 0.02% growth rate.
License Clarity 95
Licensed under Apache-2.0. Permissive — safe for commercial use.
Risk scores are computed from real-time repository data. Higher scores indicate healthier metrics.