yaojingang/GEOFlow
Open-source GEO content production system with AI tasks, review workflow, and publishing.
Star & Fork Trend (71 data points)
Multi-Source Signals
Growth Velocity
yaojingang/GEOFlow has +17 stars this period . 7-day velocity: 1388.5%.
GEOFlow bridges the gap between legacy PHP content infrastructure and AI-native publishing workflows, offering a pragmatic alternative to Python-centric orchestration tools. By embedding GEO (Generative Engine Optimization) strategies—optimizing for Perplexity, ChatGPT Search, and Google's SGE—directly into a Laravel-based CMS, it enables traditional SEO teams to transition to AI-search optimization without abandoning their PHP hosting stacks or editorial workflows.
Architecture & Design
Stack & Core Components
| Layer | Technology | Purpose |
|---|---|---|
| Application | PHP 8.2+ (Laravel) | API layer, workflow orchestration, Eloquent ORM |
| Database | PostgreSQL | JSONB storage for AI metadata, full-text search for content analysis |
| Queue | Redis/Database | Async AI task processing, review state management |
| AI Abstraction | OpenAI-Compatible API | Provider-agnostic LLM integration (GPT-4, Claude, local models) |
Design Patterns
- State Machine Pattern: Content flows through
Draft → AI Generation → Human Review → GEO Optimization → Publishedwith hooks for custom validation gates - Task Queue Architecture: Heavy LLM calls are decoupled from HTTP requests via Laravel Jobs, preventing PHP-FPM worker pool exhaustion during bulk generation
- Plugin System: GEO strategies implemented as swappable drivers (E-E-A-T enhancement, citation injection, semantic clustering)
Trade-offs
Built for throughput over latency. Unlike Node.js/Python async alternatives, GEOFlow accepts PHP's synchronous constraints and compensates with robust queue workers and database transactions. This prioritizes data integrity (critical for editorial workflows) over real-time streaming of AI tokens.
Key Innovations
The Core Insight: GEOFlow treats Generative Engine Optimization not as an SEO add-on, but as a fundamental content primitive. It bakes citation requirements, semantic authority markers, and AI-search formatting directly into the content model rather than bolting them onto existing CMS structures.
Specific Technical Innovations
- Multi-Provider AI Task Chaining: Implements a DAG (Directed Acyclic Graph) executor for AI tasks where
Research → Outline → Draft → GEO Optimizationcan each hit different providers (Perplexity for research, GPT-4 for drafting, Claude for safety review) with rollback capabilities if upstream steps fail - Editorial Diff Engine: Custom implementation comparing AI-generated drafts against source materials to highlight hallucination risks before human review, using vector similarity scoring against source embeddings
- GEO Metadata Schema: Extends standard CMS content tables with
ai_citations,semantic_clusters, andanswer_targetingJSONB fields specifically structured for AI search engine ingestion patterns - WordPress Migration Bridge: Native importer that converts WordPress posts into GEO-optimized structures while preserving SEO history (redirects, canonicals) and enhancing with AI-generated semantic markup
- Review Workflow DSL: YAML-based domain-specific language allowing non-technical editors to define custom review gates (e.g., "Block publish if no primary source cited" or "Require legal review for medical claims")
Performance Characteristics
Throughput Benchmarks
| Metric | Single Instance | Scaled (4 Workers) | Limiting Factor |
|---|---|---|---|
| AI Article Generation | ~2-3 articles/hour | ~10-12 articles/hour | LLM API rate limits |
| Content Review Workflow | 500 state transitions/min | 2,000+ transitions/min | PostgreSQL connection pool |
| SEO→GEO Migration | 50 posts/min | 200 posts/min | Memory (PHP process size) |
Scalability Characteristics
- Horizontal Scaling: Stateless PHP workers allow easy containerization; queue workers can scale independently from web servers
- Database Load: Heavy reliance on PostgreSQL JSONB for AI metadata—performs well under 100k content nodes but may require partitioning strategies at enterprise scale (>1M nodes)
- AI Provider Resilience: Built-in circuit breaker pattern for LLM APIs with exponential backoff and fallback provider switching
Limitations
PHP's Blocking I/O: While queue workers mitigate this, real-time AI streaming (showing tokens as they generate) is impractical without Swoole/ReactPHP integration, which GEOFlow currently lacks. Best suited for async content production (batch generation) rather than real-time chat interfaces.
Ecosystem & Alternatives
Competitive Landscape
| Competitor | Approach | GEOFlow Advantage | GEOFlow Risk |
|---|---|---|---|
| WordPress + AI Plugins | Plugin ecosystem (RankMath AI, etc.) | Native GEO workflow vs. bolt-on SEO; better editorial governance | WordPress's market dominance and plugin maturity |
| n8n / Make | Visual workflow automation | Integrated CMS (content storage) vs. pure orchestration; PHP-native hosting | Less flexible for non-content AI workflows |
| LangFlow / Flowise | Python/Node.js AI workflow builders | Production-ready review/permissions; no-code friendly for editors | Smaller AI agent ecosystem; less LLM provider support |
| Contentful + AI | Headless CMS with AI apps | Self-hosted (data sovereignty); no per-seat SaaS pricing | Missing enterprise SLAs; smaller integration marketplace |
Integration Points
- Headless API: RESTful and GraphQL endpoints for Next.js/Nuxt frontend consumption
- Webhook System: Bi-directional sync with legacy SEO tools (Ahrefs, SEMrush) to pull keyword data and push GEO performance metrics
- CLI Tools:
geoflow:import,geoflow:bulk-optimizefor DevOps automation in CI/CD pipelines
Adoption Landscape
Targeting mid-market content agencies and legacy media publishers running on PHP/Laravel who are being pressured to adopt AI workflows but face migration costs from WordPress/Drupal. The 35 forks suggest immediate utility for dev shops building client sites rather than end-user self-installation.
Momentum Analysis
| Metric | Value | Interpretation |
|---|---|---|
| Weekly Growth | +39 stars/week | Strong organic discovery in PHP/SEO communities |
| 7-day Velocity | 275.0% | Viral launch spike—likely featured in PHP or AI newsletters |
| 30-day Velocity | 0.0% | Extremely new project (launched within days); no historical baseline |
| Fork Ratio | 17.9% (35/195) | High intent-to-use; forks indicate active experimentation vs. passive starring |
Phase Analysis
Currently in Hype Cycle Peak. The 275% velocity suggests launch-week promotion rather than sustained organic growth. The 0% 30-day velocity confirms this is a brand-new repository (likely April 2024 given the timestamp), meaning we're seeing early adopter enthusiasm before product-market fit validation.
Forward-Looking Assessment
Bull Case: GEO is becoming the new SEO. If Google's SGE and Perplexity continue eating traditional search traffic, demand for PHP-based GEO tooling will surge as millions of WordPress sites panic-migrate. GEOFlow's timing is impeccable.
Risk Case: "GEO" may prove to be SEO rebranded—if traditional SEO plugins simply add AI features, GEOFlow's specialized workflow advantage diminishes. The project must ship enterprise-grade features (multi-tenancy, advanced permissions, A/B testing) before the incumbents catch up.
Recommendation: Watch the fork-to-contribution ratio. If the 35 forks start generating PRs (indicating active customization), this signals strong developer traction. If stars stall below 500 within 60 days, it's likely a trend-chasing fork of existing CMS code.
| Metric | GEOFlow | lexpredict-lexnlp | blast | AI-Notes |
|---|---|---|---|---|
| Stars | 774 | 774 | 775 | 775 |
| Forks | 174 | 200 | 50 | 241 |
| Weekly Growth | +17 | +0 | +0 | +0 |
| Language | PHP | Jupyter Notebook | Python | Jupyter Notebook |
| Sources | 1 | 1 | 1 | 1 |
| License | Apache-2.0 | AGPL-3.0 | MIT | NOASSERTION |
Capability Radar vs lexpredict-lexnlp
Last code push 0 days ago.
Fork-to-star ratio: 22.5%. Active community forking and contributing.
Issue data not yet available.
+17 stars this period — 2.20% growth rate.
Licensed under Apache-2.0. Permissive — safe for commercial use.
Risk scores are computed from real-time repository data. Higher scores indicate healthier metrics.