CU
NVIDIA-NeMo/Curator
Scalable data pre processing and curation toolkit for LLMs
1.5k 252 +2/wk
GitHub
data data-curation data-prep data-preparation data-processing data-processing-pipelines data-quality datacuration datarecipes deduplication fast-data-processing fine-tuning
Trend
3
Star & Fork Trend (39 data points)
Stars
Forks
Multi-Source Signals
Growth Velocity
NVIDIA-NeMo/Curator has +2 stars this period . 7-day velocity: 0.5%.
Deep analysis is being generated for this repository.
Signal-backed technical analysis will be available soon.
| Metric | Curator | Lumos | nlp-lang | coffee |
|---|---|---|---|---|
| Stars | 1.5k | 1.5k | 1.5k | 1.5k |
| Forks | 252 | 111 | 495 | 75 |
| Weekly Growth | +2 | -1 | +0 | +0 |
| Language | Python | TypeScript | Java | Python |
| Sources | 1 | 1 | 1 | 1 |
| License | Apache-2.0 | MIT | Apache-2.0 | Apache-2.0 |
Capability Radar vs Lumos
Curator
Lumos
Maintenance Activity 100
Last code push 0 days ago.
Community Engagement 83
Fork-to-star ratio: 16.7%. Active community forking and contributing.
Issue Burden 70
Issue data not yet available.
Growth Momentum 48
+2 stars this period — 0.13% growth rate.
License Clarity 95
Licensed under Apache-2.0. Permissive — safe for commercial use.
Risk scores are computed from real-time repository data. Higher scores indicate healthier metrics.