UN

Unstructured-IO/unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

14.4k 1.2k +7/wk
GitHub
data-pipelines deep-learning document-image-analysis document-image-processing document-parser document-parsing docx donut information-retrieval langchain llm machine-learning
Trend 3

Star & Fork Trend (36 data points)

Stars
Forks

Multi-Source Signals

Growth Velocity

Unstructured-IO/unstructured has +7 stars this period . 7-day velocity: 0.1%.

Deep analysis is being generated for this repository.

Signal-backed technical analysis will be available soon.

Metric unstructured mlops-zoomcamp trigger.dev opencli
Stars 14.4k 14.4k14.4k14.4k
Forks 1.2k 2.9k1.1k1.3k
Weekly Growth +7 +4+15+337
Language HTML Jupyter NotebookTypeScriptTypeScript
Sources 1 111
License Apache-2.0 N/AApache-2.0Apache-2.0

Capability Radar vs mlops-zoomcamp

unstructured
mlops-zoomcamp
Maintenance Activity 100

Last code push 0 days ago.

Community Engagement 42

Fork-to-star ratio: 8.4%. Lower fork ratio may indicate passive usage.

Issue Burden 70

Issue data not yet available.

Growth Momentum 43

+7 stars this period — 0.05% growth rate.

License Clarity 95

Licensed under Apache-2.0. Permissive — safe for commercial use.

Risk scores are computed from real-time repository data. Higher scores indicate healthier metrics.