DJ

datajuicer/data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

6.2k 356 +8/wk
GitHub
data data-analysis data-pipeline data-processing data-science data-visualization foundation-models instruction-tuning large-language-models llm llms multi-modal
Trend 3

Star & Fork Trend (22 data points)

Stars
Forks

Multi-Source Signals

Growth Velocity

datajuicer/data-juicer has +8 stars this period . 7-day velocity: 0.3%.

Deep analysis is being generated for this repository.

Signal-backed technical analysis will be available soon.

Metric data-juicer fragments plano tensorflow_cookbook
Stars 6.2k 6.2k6.2k6.2k
Forks 356 8673822.4k
Weekly Growth +8 +3+11-1
Language Python TypeScriptRustJupyter Notebook
Sources 1 111
License Apache-2.0 Apache-2.0Apache-2.0MIT

Capability Radar vs fragments

data-juicer
fragments
Maintenance Activity 100

Last code push 0 days ago.

Community Engagement 29

Fork-to-star ratio: 5.7%. Lower fork ratio may indicate passive usage.

Issue Burden 70

Issue data not yet available.

Growth Momentum 48

+8 stars this period — 0.13% growth rate.

License Clarity 95

Licensed under Apache-2.0. Permissive — safe for commercial use.

Risk scores are computed from real-time repository data. Higher scores indicate healthier metrics.