garywelz commited on
Commit
123cf13
·
verified ·
1 Parent(s): dfdfc02

Upload 2 files

Browse files
Files changed (2) hide show
  1. README.md +482 -104
  2. index.html +570 -417
README.md CHANGED
@@ -1,157 +1,535 @@
1
  ---
2
- title: The Programming Framework
3
- emoji: 🛠️
4
- colorFrom: yellow
5
- colorTo: red
6
  sdk: static
7
- pinned: true
8
  license: mit
9
  ---
10
 
11
- # 🛠️ The Programming Framework
12
 
13
- A Universal Method for Process Analysis
14
 
15
- ## 📚 Prior Work & Research Contributions
16
 
17
- ### Overview
18
- The Programming Framework represents **prior work** that demonstrates a novel methodology for analyzing complex processes by combining Large Language Models (LLMs) with visual flowchart representation. This research establishes a universal, domain-agnostic approach to process analysis that transforms textual descriptions into structured, interactive visualizations.
 
19
 
20
- ### 🔬 Research Contributions
21
- - **Universal Process Analysis:** Domain-agnostic methodology applicable across biology, chemistry, software engineering, business processes, and more
22
- - **LLM-Powered Extraction:** Automated extraction of process steps, decision points, and logic flows using Google Gemini 2.0 Flash
23
- - **Structured Visualization:** Mermaid.js-based flowchart generation encoded as JSON for programmatic access and integration
24
- - **Iterative Refinement:** Systematic approach enabling continuous improvement through visualization and LLM-assisted refinement
25
 
26
- ### ⚙️ Technical Achievements
27
- - **Meta-Tool Architecture:** Framework for creating specialized process analysis tools (demonstrated by GLMP)
28
- - **JSON-Based Storage:** Structured data format enabling version control, cross-referencing, and API integration
29
- - **Multi-Domain Application:** Successfully applied to biological processes (GLMP), with extensions planned for software, business, and engineering domains
30
- - **Integration Framework:** Designed for integration with knowledge engines, research databases, and collaborative platforms
31
 
32
- ### 🎯 Position Within CopernicusAI Knowledge Engine
33
- The Programming Framework serves as the **foundational meta-tool** of the CopernicusAI Knowledge Engine, providing the underlying methodology that enables specialized applications:
34
 
35
- - **GLMP (Genome Logic Modeling Project)** - First specialized application demonstrating biological process visualization
36
- - **Copernicus AI** - Main knowledge engine integrating Framework outputs with AI podcasts and research synthesis
37
- - **Research Papers Metadata Database** - Planned integration for linking processes to source literature
38
- - **Science Video Database** - Potential integration for multi-modal process explanations
39
 
40
- This work establishes a proof-of-concept for AI-assisted process analysis, demonstrating how LLMs can systematically extract and visualize complex logic from textual sources across diverse domains.
 
 
41
 
42
- ## 🎯 Overview
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
43
 
44
- The Programming Framework is a **meta-tool**—a tool for creating tools. It provides a systematic method for analyzing any complex process by combining the analytical power of Large Language Models with the clarity of visual flowcharts.
45
 
46
- ## 💡 The Core Idea
47
 
48
- **Problem:** Complex processes are difficult to understand because they involve many steps, decision points, and interactions. Traditional text descriptions are hard to follow.
49
 
50
- **Solution:** Use LLMs to extract process logic from literature, then encode it as Mermaid flowcharts stored in JSON. Result: Clear, interactive visualizations that reveal hidden patterns and enable systematic analysis.
 
 
 
 
51
 
52
- ## ⚙️ How It Works
53
 
54
- 1. **Input Process** - Provide scientific papers, documentation, or process descriptions
55
- 2. **LLM Analysis** - AI extracts steps, decisions, branches, and logic flow
56
- 3. **Generate Flowchart** - Create Mermaid diagram encoded as JSON structure
57
- 4. **Visualize & Iterate** - Interactive flowchart reveals insights and enables refinement
 
 
 
58
 
59
- ## 🌍 Core Principles
60
 
61
- ### Domain Agnostic
62
- Works across any field: biology, chemistry, software engineering, business processes, legal workflows, manufacturing, and beyond.
 
 
 
 
63
 
64
- ### Iterative Refinement
65
- Start with rough analysis, visualize, identify gaps, refine with LLM, repeat until the process logic is crystal clear.
66
 
67
- ### Structured Data
68
- JSON storage enables programmatic access, version control, cross-referencing, and integration with other tools and databases.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
69
 
70
- ## 🚀 Applications
71
 
72
- ### 🧬 GLMP - Genome Logic Modeling (Live)
73
- First specialized application: visualizing biochemical processes like DNA replication, metabolic pathways, and cell signaling.
74
- - [Explore GLMP →](https://huggingface.co/spaces/garywelz/glmp)
75
 
76
- ### 💻 Software Architecture Analysis (Coming Soon)
77
- Map complex software systems, API flows, and data pipelines as flowcharts for better documentation and understanding.
 
 
 
78
 
79
- ### 🏢 Business Process Modeling (In Development)
80
- Analyze organizational workflows, decision trees, and operational procedures for optimization and automation.
 
 
81
 
82
- ### ⚛️ Physics & Engineering (Planned)
83
- Model physical processes, engineering systems, and experimental protocols with step-by-step visual clarity.
84
 
85
- ## 🔧 Technical Architecture
86
 
87
- ### LLM Integration
88
- - Google Gemini 2.0 Flash for analysis
89
- - Vertex AI for enterprise deployment
90
- - Custom prompts for process extraction
91
- - Structured JSON output formatting
92
 
93
- ### Visualization Stack
94
- - Mermaid.js for flowchart rendering
95
- - JSON schema for data validation
96
- - Interactive SVG output
97
- - Export to PNG/PDF supported
98
 
99
- ### Data Storage
100
- - Google Cloud Storage for JSON files
101
- - Firestore for metadata indexing
102
- - Version control with Git
103
- - Cross-referencing with papers database
104
 
105
- ### Integration Points
106
- - GLMP specialized collections
107
- - Copernicus AI knowledge graph
108
- - Research papers database
109
- - API endpoints for programmatic access
110
 
111
- ### How to Cite This Work
 
 
 
 
112
 
113
- Welz, G. (2024–2025). *The Programming Framework: A Universal Method for Process Analysis*.
114
- Hugging Face Spaces. https://huggingface.co/spaces/garywelz/programming_framework
 
 
 
115
 
116
- This project serves as a foundational meta-tool for AI-assisted process analysis, enabling systematic extraction and visualization of complex logic from textual sources across diverse scientific and technical domains.
 
 
 
 
117
 
118
- The Programming Framework is designed as infrastructure for AI-assisted science, providing a universal methodology that can be specialized for domain-specific applications.
 
 
 
 
 
119
 
120
- ## 🔗 Related Projects
 
 
 
 
 
121
 
122
- ### 🧬 GLMP - Genome Logic Modeling
123
- First specialized application of the Programming Framework to biochemical processes. 50+ biological pathways visualized.
124
- - [Visit GLMP →](https://huggingface.co/spaces/garywelz/glmp)
125
 
126
- ### 🔬 Copernicus AI
127
- Knowledge engine integrating the Programming Framework with AI podcasts, research papers, and knowledge graph for scientific discovery.
128
- - [Visit Copernicus AI →](https://huggingface.co/spaces/garywelz/copernicusai)
129
 
130
- ## 🎨 Interactive Demo
131
 
132
- The space includes interactive examples showing the framework applied to:
133
- - Scientific Method
134
- - Software Deployment Pipeline
135
- - Customer Support Workflow
136
- - Research Paper Publication
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
137
 
138
- Each example demonstrates how LLMs extract process logic and encode it as visual flowcharts.
139
 
140
- ## 💻 Technology Stack
141
 
142
- - **LLM**: Google Gemini 2.0 Flash, Vertex AI
143
- - **Visualization**: Mermaid.js
144
- - **Storage**: Google Cloud Storage, Firestore
145
- - **Format**: JSON with Mermaid syntax
146
- - **Frontend**: Static HTML + Tailwind CSS
147
 
148
- ## 🌟 Vision
 
 
 
 
 
 
149
 
150
- As AI systems become more capable of understanding complex processes, the Programming Framework provides the bridge between human comprehension and machine analysis. It's a tool for truth-seeking—transforming complexity into clarity.
151
 
152
  ---
153
 
154
- **A Universal Method for Process Analysis**
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
155
 
156
- © 2025 Gary Welz. All rights reserved.
157
 
 
 
1
  ---
2
+ title: Copernicus AI - Research-Driven Podcast Generation Platform
3
+ emoji: 🔬
4
+ colorFrom: purple
5
+ colorTo: blue
6
  sdk: static
7
+ pinned: false
8
  license: mit
9
  ---
10
 
11
+ # 🔬 Copernicus AI - Knowledge Engine for Scientific Discovery
12
 
13
+ A collaborative research platform that transforms cutting-edge scientific research into accessible, multi-format tools for collective knowledge exploration. These are research instruments—like microscopes for observing the collective knowledge of humanity—enabling hypothesis formation, testing, and discovery across scientific disciplines.
14
 
15
+ ## Prior Work: CopernicusAI Research Interface
16
 
17
+ CopernicusAI is an active research prototype exploring AI-generated audio briefings as an interface for assisted scientific research.
18
+
19
+ The system allows any user to generate, refine, and share AI-generated science podcasts based on structured prompts, enabling rapid orientation to a topic, iterative deepening, and personalized research briefings.
20
 
21
+ Rather than functioning as a static content platform, CopernicusAI supports collectively generated and shared research artifacts, analogous to community-driven knowledge platforms (e.g., discussion forums), but grounded in scientific sources and metadata-aware workflows.
 
 
 
 
22
 
23
+ This work demonstrates technical feasibility for:
24
+ - AI-assisted research briefing and orientation
25
+ - Iterative question refinement via conversational interfaces
26
+ - Integration of text, audio, and metadata in research workflows
 
27
 
28
+ ## 🎯 Mission & Vision
 
29
 
30
+ Inspired by Nicolaus Copernicus who challenged accepted knowledge with evidence and rigorous analysis, **Copernicus AI** creates collaborative research tools that enable collective participation in scientific discovery. These platforms are instruments for exploring humanity's collective knowledge—tools for hypothesis formation, testing, and collaborative research, not just educational content.
 
 
 
31
 
32
+ Just as a microscope enables observation of the microscopic world, Copernicus AI tools enable observation and exploration of humanity's collective knowledge. Subscribers collaborate to prompt, generate, and refine research content—sharing discoveries publicly or keeping them private. As large language models (LLMs) and AI systems gain unprecedented knowledge, Copernicus AI provides the infrastructure for human-AI collaborative knowledge exploration, with evidence-based truth-seeking as our guiding principle.
33
+
34
+ ---
35
 
36
+ ## 🌟 Core Platform Capabilities
37
+
38
+ ### 🎙️ AI-Powered Podcast Generation
39
+
40
+ **Production-Ready System:**
41
+ - Collaborative platform where subscribers prompt and generate multi-voice AI podcasts (5-10 minutes) synthesizing research from multiple academic sources
42
+ - Subscribers can share their podcasts publicly or keep them private
43
+ - Evidence-based content generation requiring minimum 3 research sources per episode
44
+ - Comprehensive research integration across 8+ academic databases
45
+ - **64 episodes** generated across Biology, Chemistry, Computer Science, Mathematics, and Physics
46
+ - Automated audio synthesis with professional multi-speaker dialogue
47
+ - AI-generated episode thumbnails with scientific visualizations
48
+ - RSS feed distribution compatible with Spotify, Apple Podcasts, Google Podcasts
49
+
50
+ **Research Integration:**
51
+ - Real-time discovery from PubMed, arXiv, NASA ADS, Zenodo, bioRxiv, CORE, Google Scholar, and News APIs
52
+ - Parallel search across multiple databases for comprehensive coverage
53
+ - Quality scoring and relevance ranking of research sources
54
+ - Paradigm shift identification and interdisciplinary connection analysis
55
+ - Automatic citation extraction and formatting
56
+ - Source validation and authenticity verification
57
+
58
+ ### 🤖 Advanced LLM Integration
59
+
60
+ **Multi-Model Architecture:**
61
+ - **Google Gemini 3** - Latest research analysis and content generation
62
+ - **OpenAI GPT-4/GPT-3.5** - Content synthesis and quality validation
63
+ - **Anthropic Claude 3** (Sonnet, Haiku via OpenRouter) - Alternative reasoning paths
64
+ - **ElevenLabs TTS** - Multi-voice text-to-speech synthesis
65
+ - Model selection based on task complexity and expertise level
66
+ - Fallback chains for reliability and cost optimization
67
+
68
+ **Capabilities:**
69
+ - Multi-paper analysis and synthesis
70
+ - Paradigm shift detection in research domains
71
+ - Interdisciplinary connection identification
72
+ - Entity extraction (genes, proteins, chemical compounds, mathematical concepts)
73
+ - Citation tracking and cross-reference analysis
74
+ - Content quality scoring and validation
75
+
76
+ ### 📊 Research Resource Access
77
+
78
+ **Comprehensive Academic Database Coverage:**
79
+
80
+ Our research pipeline integrates with **8+ major academic databases**, providing access to:
81
+
82
+ - **PubMed/NCBI** (~30+ million biomedical papers)
83
+ - **arXiv** (~2+ million preprints in physics, mathematics, CS, quantitative biology)
84
+ - **NASA ADS** (~15+ million astronomy/astrophysics papers)
85
+ - **Zenodo** (100K+ open science datasets and publications)
86
+ - **bioRxiv/medRxiv** (preprints in life sciences)
87
+ - **CORE** (~200+ million open access papers)
88
+ - **Google Scholar** (comprehensive academic search)
89
+ - **News API** (current events and trending research topics)
90
+ - **YouTube Data API** (academic videos, conference talks, lectures)
91
+
92
+ **Total Access:** **250+ million research papers and academic resources** across all major scientific disciplines.
93
+
94
+ ### 🎙️ Audio and Video Podcast Production
95
+
96
+ **Operating Audio Podcast System:**
97
+ Full production and distribution platform for subscriber-generated podcasts. Users can prompt, generate, publish, and distribute audio podcasts with RSS feed support for Spotify, Apple Podcasts, and Google Podcasts.
98
+
99
+ - Multi-voice AI podcast generation
100
+ - Research-driven content creation
101
+ - RSS feed distribution
102
+ - Public and private podcast options
103
+ - Professional audio quality
104
+
105
+ **Video Production (Future - Phase 2+):**
106
+
107
+ Advanced video features planned for future development:
108
+
109
+ **Planned Advanced Features (Phase 2-4):**
110
+ - **Visual Content Integration:**
111
+ - Automated extraction of figures and diagrams from research papers
112
+ - Screen capture and processing of academic illustrations
113
+ - Web scraping from scientific journal websites and preprint servers
114
+ - JSON database integration for structured visual data
115
+
116
+ - **Dynamic Visualization Generation:**
117
+ - On-the-fly scientific animations (molecular structures, data flows, algorithms)
118
+ - Real-time chart and graph generation from research data
119
+ - Python-based animations using matplotlib, plotly, mayavi
120
+ - Mathematical formula rendering (LaTeX → video)
121
+
122
+ - **External Video Quoting:**
123
+ - YouTube video segment extraction and integration
124
+ - Time-stamped video quoting with proper attribution
125
+ - Educational fair use compliance
126
+ - Source video discovery during research phase
127
+
128
+ - **Advanced Composition:**
129
+ - Multi-layer video composition (background, content, overlays, effects)
130
+ - Automatic subtitle generation from transcripts
131
+ - Text overlay system (key concepts, citations, speaker identification)
132
+ - Professional transitions and effects
133
+ - Audio-visual synchronization
134
+
135
+ **See:** [Science Video Database](https://huggingface.co/spaces/garywelz/sciencevideodb) - Companion project for research video content management.
136
+
137
+ ### 📚 Research Papers Metadata Database (Phase 2)
138
+
139
+ **Planned Implementation:**
140
+ A centralized **metadata repository** (not a file archive) that provides:
141
+
142
+ - **Structured JSON Objects:** Research paper metadata including:
143
+ - DOI, arXiv ID, publication information
144
+ - Abstracts and key findings
145
+ - Extracted entities (genes, proteins, chemical compounds, equations)
146
+ - Citation networks and cross-references
147
+ - Paradigm shift indicators
148
+ - Interdisciplinary connections
149
+ - Quality scores and relevance metrics
150
+
151
+ - **AI-Powered Preprocessing:**
152
+ - LLM-based entity extraction and annotation
153
+ - Automatic categorization by discipline and subdomain
154
+ - Keyword extraction and semantic tagging
155
+ - Citation tracking and relationship mapping
156
+ - Quality assessment and validation
157
+
158
+ - **Integration Features:**
159
+ - DOI/arXiv ID resolution and metadata enrichment
160
+ - Cross-reference linking between papers
161
+ - Podcast-to-paper relationship tracking
162
+ - Search and query capabilities
163
+ - API access for programmatic retrieval
164
+
165
+ **Technical Architecture:**
166
+ - Firestore NoSQL database for flexible JSON storage
167
+ - Google Cloud Functions for automated metadata processing
168
+ - Vertex AI for entity extraction and analysis
169
+ - RESTful API for external access
170
+
171
+ **Benefits:**
172
+ - Enables rapid research discovery across podcasts
173
+ - Supports knowledge graph construction
174
+ - Facilitates cross-disciplinary pattern recognition
175
+ - Provides foundation for semantic search capabilities
176
 
177
+ ---
178
 
179
+ ## 🗄️ System Architecture
180
 
181
+ ### Database Structure (Firestore)
182
 
183
+ **Collections:**
184
+ - **`subscribers`** - User accounts, preferences, subscription tiers, usage analytics
185
+ - **`podcast_jobs`** - Generated podcasts with full metadata, source papers, engagement metrics
186
+ - **`episodes`** - Published episodes with RSS distribution status
187
+ - **`research_papers`** (Phase 2) - Paper metadata database with AI-extracted entities
188
 
189
+ ### Storage Structure (Google Cloud Storage)
190
 
191
+ - **`audio/`** - MP3 podcast files (multi-voice ElevenLabs synthesis)
192
+ - **`videos/`** - MP4 video podcasts (current and future)
193
+ - **`transcripts/`** - Full text transcripts with speaker markers
194
+ - **`descriptions/`** - Markdown descriptions with academic references
195
+ - **`thumbnails/`** - AI-generated episode artwork (DALL-E 3)
196
+ - **`video-assets/`** - Extracted figures, animations, visual content
197
+ - **`glmp-v2/`** - Genome Logic Modeling Project flowcharts (JSON)
198
 
199
+ ### Backend Services (Google Cloud Run)
200
 
201
+ **Microservices Architecture:**
202
+ - **Podcast Generation Service** - Orchestrates research, content generation, and media production
203
+ - **Research Pipeline Service** - Multi-API academic search and analysis
204
+ - **Video Generation Service** - Video composition and encoding (Phase 1 complete)
205
+ - **RSS Service** - Feed generation and distribution
206
+ - **Episode Service** - Catalog management and metadata
207
 
208
+ ---
 
209
 
210
+ ## ⚙️ Technology Stack
211
+
212
+ ### AI & Machine Learning
213
+ - **Google Gemini 3** - Latest LLM for research analysis
214
+ - **Google Vertex AI** - Enterprise-scale model deployment and orchestration (used throughout platform)
215
+ - **OpenAI GPT-4/GPT-3.5** - Content synthesis and validation
216
+ - **Anthropic Claude 3** - Alternative reasoning via OpenRouter
217
+ - **ElevenLabs TTS** - Multi-voice text-to-speech synthesis
218
+ - **DALL-E 3** - AI-generated scientific visualizations
219
+ - **Google Cloud Vision API** - Image analysis and quality assessment
220
+ - **Video Intelligence API** - Scene detection and content analysis
221
+
222
+ ### Backend Infrastructure
223
+ - **FastAPI** (Python) - RESTful API framework
224
+ - **Google Cloud Run** - Serverless container deployment
225
+ - **Firestore** - NoSQL document database
226
+ - **Cloud Storage** - Media file storage and CDN
227
+ - **Cloud Functions** - Event-driven processing
228
+ - **Cloud Tasks** - Background job queuing
229
+ - **Secret Manager** - API key and credential management
230
+
231
+ ### Media Processing
232
+ - **FFmpeg** - Video encoding and composition
233
+ - **MoviePy** - Python video editing (planned)
234
+ - **Matplotlib/Plotly** - Scientific visualization (planned)
235
+ - **PyPDF2/pdfplumber** - PDF processing (planned)
236
+
237
+ ### Frontend
238
+ - **Next.js 15.5.7** - React framework
239
+ - **Alpine.js** - Lightweight reactive UI
240
+ - **Tailwind CSS** - Utility-first styling
241
+ - **Vercel** - Frontend hosting and deployment
242
 
243
+ ---
244
 
245
+ ## 📈 Platform Capabilities
 
 
246
 
247
+ ### Research Coverage
248
+ - **250+ million research papers** accessible through integrated APIs
249
+ - **8+ academic databases** integrated with parallel search
250
+ - **Minimum 3 sources** required per episode for quality assurance
251
+ - **Multi-paper analysis** for comprehensive coverage
252
 
253
+ ### Platform Features
254
+ - **Subscriber-driven content generation** - Users prompt and create podcasts
255
+ - **RSS feed distribution** to major podcast platforms
256
+ - **Public and private podcast options** - Share discoveries or keep them private
257
 
258
+ ---
 
259
 
260
+ ## 🔗 Live Platform & Resources
261
 
262
+ ### Production Deployment
263
+ - 🏠 **[Homepage - Browse Podcasts](https://www.copernicusai.fyi)** - Public podcast catalog
264
+ - 📊 **[Creator Dashboard](https://www.copernicusai.fyi/subscriber-dashboard.html)** - Subscriber interface
265
+ - 📡 **[RSS Feed](https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/feeds/copernicus-mvp-rss-feed.xml)** - Podcast distribution feed
 
266
 
267
+ ## 🧩 CopernicusAI Knowledge Engine Components
 
 
 
 
268
 
269
+ The CopernicusAI Knowledge Engine is an integrated ecosystem of research and collaboration tools. Currently, the platform includes five core components, with additional tools, databases, and collaboration features planned for future development:
 
 
 
 
270
 
271
+ ### Core Components
 
 
 
 
272
 
273
+ 1. **🔬 Copernicus AI (This Platform)** - Core synthesis and distribution component
274
+ - AI-powered research synthesis and podcast generation
275
+ - Multi-API research integration (250+ million papers)
276
+ - Subscriber-driven content creation and sharing
277
+ - RSS feed distribution and platform management
278
 
279
+ 2. **🛠️ Programming Framework** - Foundational meta-tool
280
+ - Universal method for process analysis across any discipline
281
+ - LLM-powered extraction and Mermaid visualization
282
+ - Domain-agnostic methodology for complex process analysis
283
+ - [Explore Framework →](https://huggingface.co/spaces/garywelz/programming_framework)
284
 
285
+ 3. **🧬 GLMP - Genome Logic Modeling Project** - Specialized biological application
286
+ - First application of Programming Framework to biology
287
+ - 50+ biological processes visualized as interactive flowcharts
288
+ - JSON-based structured data in Google Cloud Storage
289
+ - [Explore GLMP →](https://huggingface.co/spaces/garywelz/glmp)
290
 
291
+ 4. **📚 Research Paper Metadata Database** - Core data infrastructure
292
+ - Centralized metadata repository for scientific research papers
293
+ - AI-powered preprocessing and entity extraction
294
+ - Citation network analysis and relationship mapping
295
+ - Foundation for knowledge graph construction
296
+ - [Explore Metadata Database →](https://huggingface.co/spaces/garywelz/metadata_database)
297
 
298
+ 5. **🎬 Science Video Database** - Multi-modal content component
299
+ - Curated searchable database of scientific video content
300
+ - Transcript-based search across multiple disciplines
301
+ - Integration with YouTube and other video sources
302
+ - [Explore Video Database →](https://huggingface.co/spaces/garywelz/sciencevideodb)
303
+ - [Live Demo →](https://scienceviddb-web-204731194849.us-central1.run.app/)
304
 
305
+ ### Future Components
 
 
306
 
307
+ The Knowledge Engine is designed to grow and evolve. Additional tools, databases, and collaboration components will be added as the project develops, expanding capabilities for AI-assisted scientific research and knowledge discovery.
 
 
308
 
309
+ ---
310
 
311
+ ## 🔌 API Documentation
312
+
313
+ **Base URL:** `https://copernicus-podcast-api-phzp4ie2sq-uc.a.run.app`
314
+
315
+ ### Podcast Generation Endpoints
316
+ - `POST /generate-podcast-with-subscriber` - Generate new podcast from research topic
317
+ - `GET /api/subscribers/podcasts/{id}` - Retrieve podcast details
318
+ - `POST /api/subscribers/podcasts/submit-to-rss` - Publish to RSS feed
319
+
320
+ ### Research Endpoints
321
+ - `POST /api/papers/upload` - Upload paper metadata (Phase 2)
322
+ - `GET /api/papers/{paper_id}` - Retrieve paper metadata
323
+ - `POST /api/papers/query` - Query papers by discipline, keywords
324
+ - `POST /api/papers/{id}/link-podcast/{id}` - Link paper to podcast
325
+
326
+ ### Admin Endpoints
327
+ - `GET /api/admin/subscribers` - List all subscribers and statistics
328
+ - `POST /api/admin/podcasts/fix-missing-titles` - Content maintenance
329
+ - `GET /api/admin/podcasts/catalog` - Full podcast catalog
330
+
331
+ ---
332
+
333
+ ## 🚀 Development Roadmap
334
+
335
+ ### ✅ Phase 1: Core Platform (Complete)
336
+ - Multi-API research integration
337
+ - AI podcast generation with multi-voice synthesis
338
+ - RSS feed distribution
339
+ - Subscriber platform
340
+ - Basic video generation (static)
341
+
342
+ ### 🔄 Phase 2: Content Enhancement (In Progress)
343
+ - **Research Papers Metadata Database** - JSON-based metadata repository
344
+ - **Visual Content Extraction** - Figures from papers, web scraping
345
+ - **YouTube Video Quoting** - External video integration with attribution
346
+ - **Advanced Video Features** - Multi-layer composition, animations
347
+
348
+ ### 📋 Phase 3: Advanced Visualizations (Planned)
349
+ - Scientific animation generation (matplotlib, plotly)
350
+ - Real-time data visualization
351
+ - Mathematical formula rendering
352
+ - Dynamic graph and network visualizations
353
+
354
+ ### 🎯 Phase 4: Knowledge Integration (Planned)
355
+ - Knowledge graph construction
356
+ - Semantic search capabilities
357
+ - Cross-disciplinary pattern discovery
358
+ - AI-powered content recommendations
359
+
360
+ ---
361
 
362
+ ## 🔬 Collaborative Research Tools
363
 
364
+ **These platforms enable collective participation and collaboration across diverse user communities:**
365
 
366
+ - **Researchers** - Tools for hypothesis formation and testing, rapid synthesis of cross-disciplinary findings
367
+ - **Collaborators** - Collective knowledge exploration and refinement
368
+ - **Subscribers** - Prompt, generate, and share podcasts (public or private)
369
+ - **Community** - User suggestions, comments, and collaborative flowchart improvement (GLMP)
 
370
 
371
+ **Key Innovations:**
372
+ - **Multi-Source Validation** - Requires minimum 3 research sources per episode
373
+ - **Evidence-Based Generation** - No content generated without research backing
374
+ - **Paradigm Shift Detection** - Identifies revolutionary vs. incremental research
375
+ - **Interdisciplinary Connections** - Reveals cross-domain insights
376
+ - **Collaborative Participation** - User-driven content generation and sharing
377
+ - **Reproducibility** - Full citation tracking and source attribution
378
 
379
+ > *Like a microscope enables observation of the microscopic world, these tools enable observation and exploration of humanity's collective knowledge.*
380
 
381
  ---
382
 
383
+ ## 📚 Prior Work & Research Contributions
384
+
385
+ ### Overview
386
+ This platform represents **prior work** that demonstrates foundational research and development achievements in AI-powered scientific knowledge synthesis, collaborative research tools, and multi-modal content generation. These contributions establish the technical foundation and proof-of-concept for the broader **CopernicusAI Knowledge Engine** initiative.
387
+
388
+ ### Research Contributions
389
+
390
+ **1. AI-Powered Research Synthesis System**
391
+ - Developed and deployed a production-ready system for multi-source research synthesis using LLMs
392
+ - Demonstrated integration of 8+ academic databases (250+ million papers) with parallel search capabilities
393
+ - Implemented evidence-based content generation requiring minimum 3 research sources per output
394
+ - Achieved operational deployment with 64+ generated podcast episodes across 5 scientific disciplines
395
+
396
+ **2. Multi-Model LLM Architecture**
397
+ - Designed and implemented intelligent model selection framework using Google Gemini 3, OpenAI GPT-4, and Anthropic Claude 3
398
+ - Developed fallback chains for reliability and cost optimization
399
+ - Demonstrated paradigm shift detection and interdisciplinary connection identification in research domains
400
+ - Implemented entity extraction (genes, proteins, chemical compounds, mathematical concepts) from research literature
401
+
402
+ **3. Collaborative Research Platform Infrastructure**
403
+ - Built subscriber-driven content generation system enabling public/private research sharing
404
+ - Implemented RSS feed distribution compatible with major podcast platforms
405
+ - Developed microservices architecture on Google Cloud Run with Firestore and Cloud Storage
406
+ - Created RESTful API framework for programmatic access to research synthesis capabilities
407
+
408
+ **4. Integration with Knowledge Engine Components**
409
+ - Established integration pathways with GLMP (Genome Logic Modeling Project) for biological process visualization
410
+ - Designed architecture for Research Papers Metadata Database (Phase 2)
411
+ - Planned integration with Science Video Database for multi-modal content
412
+ - Created framework for Programming Framework integration across disciplines
413
+
414
+ ### Technical Achievements
415
+
416
+ **Production Deployment:**
417
+ - Live platform: https://www.copernicusai.fyi
418
+ - Operational API: https://copernicus-podcast-api-phzp4ie2sq-uc.a.run.app
419
+ - RSS feed distribution: Active and functional
420
+ - Multi-voice audio synthesis: ElevenLabs TTS integration operational
421
+
422
+ **Research Infrastructure:**
423
+ - 250+ million research papers accessible via integrated APIs
424
+ - 8+ academic database integrations (PubMed, arXiv, NASA ADS, Zenodo, bioRxiv, CORE, Google Scholar, News API)
425
+ - Automated citation extraction and formatting
426
+ - Quality scoring and relevance ranking systems
427
+
428
+ **Scalability & Architecture:**
429
+ - Serverless microservices architecture (Google Cloud Run)
430
+ - NoSQL database (Firestore) for flexible metadata storage
431
+ - Cloud Storage for media files and structured data
432
+ - Event-driven processing with Cloud Functions and Cloud Tasks
433
+
434
+ ### Position Within CopernicusAI Knowledge Engine
435
+
436
+ This platform serves as the **core synthesis and distribution component** of the CopernicusAI Knowledge Engine. The Knowledge Engine is an integrated ecosystem of research and collaboration tools that work together to assist scientists in their workflow, from research discovery through knowledge synthesis to multi-format content generation.
437
+
438
+ **Current Components:**
439
+ 1. **Copernicus AI** (This platform) - Core synthesis and distribution component for AI-powered research synthesis and podcast generation
440
+ 2. **Programming Framework** - Foundational meta-tool providing universal process analysis methodology
441
+ 3. **GLMP (Genome Logic Modeling Project)** - Specialized biological application demonstrating domain-specific use of the Programming Framework
442
+ 4. **Research Paper Metadata Database** - Core data infrastructure providing structured metadata and citation networks
443
+ 5. **Science Video Database** - Multi-modal content component enabling video-based learning and research discovery
444
+
445
+ **Future Development:**
446
+ The Knowledge Engine is designed to grow and evolve. Additional tools, databases, and collaboration components will be added as the project develops, expanding capabilities for AI-assisted scientific research and knowledge discovery.
447
+
448
+ ### Academic & Research Impact
449
+
450
+ **Publications & Presentations:**
451
+ - Platform architecture and methodology suitable for academic publication
452
+ - Open-source components available for research community use
453
+ - Publicly accessible research tools demonstrating AI-human collaboration in scientific knowledge synthesis
454
+
455
+ **Research Applications:**
456
+ - Supports hypothesis formation and testing through rapid multi-source synthesis
457
+ - Enables cross-disciplinary pattern recognition and connection identification
458
+ - Facilitates reproducible research communication with full citation tracking
459
+ - Provides infrastructure for collaborative knowledge exploration
460
+
461
+ **Educational Contributions:**
462
+ - 64+ research-driven podcast episodes across Biology, Chemistry, Computer Science, Mathematics, and Physics
463
+ - Evidence-based content requiring minimum 3 academic sources
464
+ - Public and private sharing options for research dissemination
465
+ - Integration with major podcast platforms for broad accessibility
466
+
467
+ ### Citation Information
468
+
469
+ **For Grant Proposals:**
470
+ When citing this work as prior research, please reference:
471
+
472
+ - **Platform Name:** Copernicus AI - Knowledge Engine for Scientific Discovery
473
+ - **URL:** https://huggingface.co/spaces/garywelz/copernicusai
474
+ - **Live Platform:** https://www.copernicusai.fyi
475
+ - **Primary Developer:** Gary Welz
476
+ - **Year:** 2024-2025
477
+ - **License:** MIT
478
+
479
+ **Suggested Citation Format:**
480
+ ```
481
+ Welz, G. (2025). Copernicus AI: Knowledge Engine for Scientific Discovery.
482
+ Hugging Face Space. https://huggingface.co/spaces/garywelz/copernicusai
483
+ ```
484
+
485
+ ## 🌐 Grant Support & Collaboration
486
+
487
+ **Grant Applications Supported:**
488
+ This platform is designed to support grant applications to:
489
+ - **NSF (National Science Foundation)** - Science education and research infrastructure
490
+ - **DOE (Department of Energy)** - Scientific computing and data science
491
+ - **SAIR Foundation** - AI research and development initiatives
492
+
493
+ **Research Contributions:**
494
+ - Open-source components and methodologies
495
+ - Publicly accessible research tools
496
+ - Educational content for broader scientific literacy
497
+ - Infrastructure for reproducible research communication
498
+
499
+ **Collaboration Opportunities:**
500
+ - Integration with academic institutions
501
+ - Partnership with research organizations
502
+ - Open data initiatives
503
+ - Educational program development
504
+
505
+ ---
506
+
507
+ ## How to Cite This Work
508
+
509
+ Welz, G. (2024–2025). *CopernicusAI: AI-Generated Audio Briefings as a Research Interface*.
510
+ Hugging Face Spaces. https://huggingface.co/spaces/garywelz/copernicusai
511
+
512
+ ---
513
+
514
+ ## 📄 License & Attribution
515
+
516
+ **License:** MIT
517
+
518
+ **Attributions:**
519
+ - Built with Google Cloud Platform, Gemini AI, OpenAI, Anthropic Claude, and ElevenLabs
520
+ - Research data from PubMed, arXiv, NASA ADS, Zenodo, bioRxiv, CORE, and Google Scholar
521
+ - Academic paper metadata from respective publishers
522
+
523
+ ---
524
+
525
+ ## 📧 Contact & Support
526
+
527
+ For questions, collaboration inquiries, or grant application support:
528
+ - **Hugging Face Space:** [https://huggingface.co/spaces/garywelz/copernicusai](https://huggingface.co/spaces/garywelz/copernicusai)
529
+ - **Platform:** [https://www.copernicusai.fyi](https://www.copernicusai.fyi)
530
+
531
+ ---
532
 
533
+ **© 2025 Copernicus AI. All rights reserved.**
534
 
535
+ *Advancing scientific knowledge through AI-powered research communication and discovery.*
index.html CHANGED
@@ -3,12 +3,11 @@
3
  <head>
4
  <meta charset="UTF-8">
5
  <meta name="viewport" content="width=device-width, initial-scale=1.0">
6
- <title>The Programming Framework - Universal Process Analysis</title>
7
  <script src="https://cdn.tailwindcss.com"></script>
8
- <script src="https://cdn.jsdelivr.net/npm/mermaid/dist/mermaid.min.js"></script>
9
  <style>
10
  .gradient-bg {
11
- background: linear-gradient(135deg, #f59e0b 0%, #d97706 100%);
12
  }
13
  .card-hover {
14
  transition: transform 0.3s ease, box-shadow 0.3s ease;
@@ -17,6 +16,13 @@
17
  transform: translateY(-4px);
18
  box-shadow: 0 20px 40px rgba(0,0,0,0.15);
19
  }
 
 
 
 
 
 
 
20
  </style>
21
  </head>
22
  <body class="bg-gray-50">
@@ -24,512 +30,659 @@
24
  <header class="gradient-bg text-white">
25
  <div class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-16">
26
  <div class="text-center">
27
- <div class="text-6xl mb-4">🛠️</div>
28
- <h1 class="text-5xl font-bold mb-4">The Programming Framework</h1>
29
- <p class="text-xl opacity-90 mb-6">A Universal Method for Process Analysis</p>
30
- <p class="text-lg opacity-75 max-w-3xl mx-auto">
31
- Combining Large Language Models with Mermaid visualization to dissect and understand
32
- complex processes across any discipline—from biology to business, physics to psychology.
 
 
33
  </p>
34
  </div>
35
  </div>
36
  </header>
37
 
38
- <!-- Prior Work & Research Contributions -->
39
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
40
- <div class="bg-gradient-to-r from-orange-50 to-yellow-50 rounded-xl shadow-lg p-8 mb-8">
41
- <h2 class="text-3xl font-bold text-gray-900 mb-6">📚 Prior Work & Research Contributions</h2>
42
-
43
- <div class="bg-white rounded-lg p-6 mb-6">
44
- <h3 class="text-xl font-semibold text-gray-900 mb-4">Overview</h3>
45
- <p class="text-gray-700 mb-4">
46
- The Programming Framework represents <strong>prior work</strong> that demonstrates a novel methodology for analyzing complex processes by combining Large Language Models (LLMs) with visual flowchart representation. This research establishes a universal, domain-agnostic approach to process analysis that transforms textual descriptions into structured, interactive visualizations.
47
- </p>
 
 
 
 
 
 
 
 
 
 
48
  </div>
 
 
49
 
50
- <div class="grid md:grid-cols-2 gap-6 mb-6">
51
- <div class="bg-white rounded-lg p-6">
52
- <h3 class="text-lg font-semibold text-gray-900 mb-3">🔬 Research Contributions</h3>
53
- <ul class="text-sm text-gray-700 space-y-2">
54
- <li>• <strong>Universal Process Analysis:</strong> Domain-agnostic methodology applicable across multiple fields</li>
55
- <li>• <strong>LLM-Powered Extraction:</strong> Automated extraction using Google Gemini 2.0 Flash</li>
56
- <li>• <strong>Structured Visualization:</strong> Mermaid.js-based flowchart generation encoded as JSON</li>
57
- <li>• <strong>Iterative Refinement:</strong> Systematic approach enabling continuous improvement</li>
58
- </ul>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
59
  </div>
60
 
61
- <div class="bg-white rounded-lg p-6">
62
- <h3 class="text-lg font-semibold text-gray-900 mb-3">⚙️ Technical Achievements</h3>
63
- <ul class="text-sm text-gray-700 space-y-2">
64
- <li>• <strong>Meta-Tool Architecture:</strong> Framework for creating specialized analysis tools</li>
65
- <li>• <strong>JSON-Based Storage:</strong> Structured format enabling version control and API integration</li>
66
- <li>• <strong>Multi-Domain Application:</strong> Successfully applied to biological processes (GLMP)</li>
67
- <li>• <strong>Integration Framework:</strong> Designed for knowledge engines and collaborative platforms</li>
68
- </ul>
69
  </div>
70
- </div>
71
-
72
- <div class="bg-white rounded-lg p-6">
73
- <h3 class="text-lg font-semibold text-gray-900 mb-3">🎯 Position Within CopernicusAI Knowledge Engine</h3>
74
- <p class="text-gray-700 mb-3">
75
- The Programming Framework serves as the <strong>foundational meta-tool</strong> of the CopernicusAI Knowledge Engine, providing the underlying methodology that enables specialized applications:
76
- </p>
77
- <div class="grid md:grid-cols-2 gap-4 text-sm mb-3">
78
- <ul class="text-gray-700 space-y-1">
79
- <li>• GLMP (Genome Logic Modeling Project)</li>
80
- <li>• Copernicus AI (main knowledge engine)</li>
81
- <li>• Research Papers Metadata Database</li>
82
- </ul>
83
- <ul class="text-gray-700 space-y-1">
84
- <li>• Science Video Database</li>
85
- <li>• Multi-domain process analysis</li>
86
- </ul>
 
 
 
 
 
 
 
 
 
87
  </div>
88
- <p class="text-gray-600 text-sm italic">
89
- This work establishes a proof-of-concept for AI-assisted process analysis, demonstrating how LLMs can systematically extract and visualize complex logic from textual sources across diverse domains.
90
- </p>
91
  </div>
92
  </div>
93
  </section>
94
 
95
- <!-- Key Stats -->
96
- <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 -mt-8">
97
- <div class="grid md:grid-cols-4 gap-4">
98
- <div class="bg-white rounded-lg shadow-lg p-6 text-center">
99
- <div class="text-3xl font-bold text-orange-600">Any</div>
100
- <div class="text-sm text-gray-600">Discipline</div>
101
- </div>
102
- <div class="bg-white rounded-lg shadow-lg p-6 text-center">
103
- <div class="text-3xl font-bold text-blue-600">LLM</div>
104
- <div class="text-sm text-gray-600">Powered</div>
105
- </div>
106
- <div class="bg-white rounded-lg shadow-lg p-6 text-center">
107
- <div class="text-3xl font-bold text-green-600">Visual</div>
108
- <div class="text-sm text-gray-600">Flowcharts</div>
109
  </div>
110
- <div class="bg-white rounded-lg shadow-lg p-6 text-center">
111
- <div class="text-3xl font-bold text-purple-600">JSON</div>
112
- <div class="text-sm text-gray-600">Structured Data</div>
 
113
  </div>
114
  </div>
115
  </section>
116
 
117
- <!-- What is the Framework -->
118
- <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
119
- <div class="bg-white rounded-xl shadow-lg p-8">
120
- <h2 class="text-3xl font-bold text-gray-900 mb-6">🎯 What is the Programming Framework?</h2>
121
- <div class="prose max-w-none">
122
- <p class="text-lg text-gray-700 mb-4">
123
- The Programming Framework is a <strong>meta-tool</strong>—a tool for creating tools. It provides a
124
- systematic method for analyzing any complex process by combining the analytical power of Large Language
125
- Models with the clarity of visual flowcharts.
126
- </p>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
127
 
128
- <div class="grid md:grid-cols-2 gap-6 mt-6">
129
- <div class="bg-orange-50 rounded-lg p-6">
130
- <h3 class="text-xl font-semibold text-gray-900 mb-3">🔍 The Problem</h3>
131
- <p class="text-gray-700">
132
- Complex processes—whether biological, computational, or organizational—are difficult to
133
- understand because they involve many steps, decision points, and interactions. Traditional
134
- descriptions in text are hard to follow.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
135
  </p>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
136
  </div>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
137
 
138
- <div class="bg-blue-50 rounded-lg p-6">
139
- <h3 class="text-xl font-semibold text-gray-900 mb-3">✨ The Solution</h3>
140
- <p class="text-gray-700">
141
- Use LLMs to extract process logic from literature, then encode it as Mermaid flowcharts
142
- stored in JSON. Result: Clear, interactive visualizations that reveal hidden patterns and
143
- enable systematic analysis.
 
 
 
144
  </p>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
145
  </div>
146
  </div>
147
  </div>
148
  </div>
149
  </section>
150
 
151
- <!-- How It Works -->
152
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
153
- <div class="bg-gradient-to-r from-orange-50 to-yellow-50 rounded-xl p-8">
154
- <h2 class="text-3xl font-bold text-gray-900 mb-8 text-center">⚙️ How It Works</h2>
155
 
156
- <div class="grid md:grid-cols-4 gap-6">
157
- <div class="bg-white rounded-lg p-6 text-center">
158
- <div class="text-4xl mb-3">1️⃣</div>
159
- <h3 class="font-semibold text-gray-900 mb-2">Input Process</h3>
160
- <p class="text-sm text-gray-600">Provide scientific papers, documentation, or process descriptions</p>
161
- </div>
162
-
163
- <div class="bg-white rounded-lg p-6 text-center">
164
- <div class="text-4xl mb-3">2️⃣</div>
165
- <h3 class="font-semibold text-gray-900 mb-2">LLM Analysis</h3>
166
- <p class="text-sm text-gray-600">AI extracts steps, decisions, branches, and logic flow</p>
 
 
167
  </div>
168
-
169
- <div class="bg-white rounded-lg p-6 text-center">
170
- <div class="text-4xl mb-3">3️⃣</div>
171
- <h3 class="font-semibold text-gray-900 mb-2">Generate Flowchart</h3>
172
- <p class="text-sm text-gray-600">Create Mermaid diagram encoded as JSON structure</p>
 
 
 
 
 
 
 
173
  </div>
174
-
175
- <div class="bg-white rounded-lg p-6 text-center">
176
- <div class="text-4xl mb-3">4️⃣</div>
177
- <h3 class="font-semibold text-gray-900 mb-2">Visualize & Iterate</h3>
178
- <p class="text-sm text-gray-600">Interactive flowchart reveals insights and enables refinement</p>
 
 
 
 
179
  </div>
180
  </div>
 
 
181
 
182
- <div class="mt-8 bg-white rounded-lg p-6">
183
- <h4 class="font-semibold text-gray-900 mb-3">📊 Live Example:</h4>
184
- <div id="exampleDiagram" class="mermaid">
185
- graph TD
186
- A[Complex Process Input] --> B{LLM Analysis}
187
- B -->|Extract Logic| C[Identify Steps]
188
- B -->|Extract Decisions| D[Identify Branches]
189
- C --> E[Create Flowchart Nodes]
190
- D --> F[Create Decision Points]
191
- E --> G[Generate Mermaid Syntax]
192
- F --> G
193
- G --> H[Store as JSON]
194
- H --> I[Interactive Visualization]
195
- I --> J{Insights Gained?}
196
- J -->|No| K[Refine Analysis]
197
- J -->|Yes| L[Apply Knowledge]
198
- K --> B
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
199
  </div>
200
  </div>
201
  </div>
202
  </section>
203
 
204
- <!-- Core Principles -->
205
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
206
- <h2 class="text-3xl font-bold text-gray-900 mb-6">💡 Core Principles</h2>
207
-
208
- <div class="grid md:grid-cols-3 gap-6">
209
- <div class="bg-white rounded-lg shadow-md p-6 card-hover">
210
- <div class="text-4xl mb-3">🌍</div>
211
- <h3 class="text-xl font-semibold text-gray-900 mb-3">Domain Agnostic</h3>
212
- <p class="text-gray-600">
213
- Works across any field: biology, chemistry, software engineering, business processes,
214
- legal workflows, manufacturing, and beyond.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
215
  </p>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
216
  </div>
217
 
218
- <div class="bg-white rounded-lg shadow-md p-6 card-hover">
219
- <div class="text-4xl mb-3">🔄</div>
220
- <h3 class="text-xl font-semibold text-gray-900 mb-3">Iterative Refinement</h3>
221
- <p class="text-gray-600">
222
- Start with rough analysis, visualize, identify gaps, refine with LLM, repeat until
223
- the process logic is crystal clear.
224
  </p>
 
 
 
 
 
225
  </div>
 
 
226
 
227
- <div class="bg-white rounded-lg shadow-md p-6 card-hover">
228
- <div class="text-4xl mb-3">📦</div>
229
- <h3 class="text-xl font-semibold text-gray-900 mb-3">Structured Data</h3>
230
- <p class="text-gray-600">
231
- JSON storage enables programmatic access, version control, cross-referencing,
232
- and integration with other tools and databases.
 
 
233
  </p>
234
  </div>
235
  </div>
236
  </section>
 
 
237
 
238
- <!-- Applications -->
239
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
240
  <div class="bg-white rounded-xl shadow-lg p-8">
241
- <h2 class="text-3xl font-bold text-gray-900 mb-6">🚀 Applications</h2>
242
 
243
- <div class="grid md:grid-cols-2 gap-6">
244
- <div class="border-l-4 border-green-500 pl-4 py-3">
245
- <h3 class="font-semibold text-gray-900 mb-2">🧬 GLMP - Genome Logic Modeling</h3>
246
- <p class="text-gray-600 mb-2">
247
- First specialized application: visualizing biochemical processes like DNA replication,
248
- metabolic pathways, and cell signaling.
249
- </p>
250
- <a href="https://huggingface.co/spaces/garywelz/glmp" class="text-green-600 hover:text-green-700 text-sm font-semibold">
251
- Explore GLMP
252
- </a>
253
- </div>
254
-
255
- <div class="border-l-4 border-blue-500 pl-4 py-3">
256
- <h3 class="font-semibold text-gray-900 mb-2">💻 Software Architecture Analysis</h3>
257
- <p class="text-gray-600 mb-2">
258
- Map complex software systems, API flows, and data pipelines as flowcharts for
259
- better documentation and understanding.
260
- </p>
261
- <span class="text-blue-600 text-sm font-semibold">Coming Soon</span>
262
- </div>
263
-
264
- <div class="border-l-4 border-purple-500 pl-4 py-3">
265
- <h3 class="font-semibold text-gray-900 mb-2">🏢 Business Process Modeling</h3>
266
- <p class="text-gray-600 mb-2">
267
- Analyze organizational workflows, decision trees, and operational procedures
268
- for optimization and automation.
269
- </p>
270
- <span class="text-purple-600 text-sm font-semibold">In Development</span>
271
- </div>
272
-
273
- <div class="border-l-4 border-orange-500 pl-4 py-3">
274
- <h3 class="font-semibold text-gray-900 mb-2">⚛️ Physics & Engineering</h3>
275
- <p class="text-gray-600 mb-2">
276
- Model physical processes, engineering systems, and experimental protocols
277
- with step-by-step visual clarity.
278
- </p>
279
- <span class="text-orange-600 text-sm font-semibold">Planned</span>
280
  </div>
281
  </div>
 
 
 
 
 
 
 
 
 
 
282
  </div>
283
  </section>
284
 
285
- <!-- Interactive Demo -->
286
- <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
287
  <div class="bg-gradient-to-r from-blue-50 to-purple-50 rounded-xl p-8">
288
- <h2 class="text-3xl font-bold text-gray-900 mb-6">🎨 Try It Yourself</h2>
289
 
290
- <div class="bg-white rounded-lg p-6">
291
- <div class="mb-4">
292
- <label class="block text-sm font-medium text-gray-700 mb-2">Select a Sample Process to Analyze:</label>
293
- <select id="sampleSelector" class="w-full px-4 py-2 border border-gray-300 rounded-lg focus:ring-2 focus:ring-orange-500">
294
- <option value="">-- Choose a sample process --</option>
295
- <option value="scientific-method">Scientific Method</option>
296
- <option value="software-deployment">Software Deployment Pipeline</option>
297
- <option value="customer-support">Customer Support Workflow</option>
298
- <option value="research-publication">Research Paper Publication</option>
299
- </select>
 
 
 
 
 
 
 
 
 
 
 
 
 
300
  </div>
301
-
302
- <div id="demoContainer" class="mt-6 p-6 bg-gray-50 rounded-lg min-h-64">
303
- <div class="text-center text-gray-500 py-12">
304
- <div class="text-4xl mb-4">🛠️</div>
305
- <p>Select a sample process to see the Programming Framework in action</p>
306
- </div>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
307
  </div>
308
  </div>
309
  </div>
310
  </section>
311
 
312
- <!-- Technical Details -->
313
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
314
  <div class="bg-gray-900 text-white rounded-xl p-8">
315
- <h2 class="text-3xl font-bold mb-6">⚙️ Technical Architecture</h2>
 
316
 
317
- <div class="grid md:grid-cols-2 gap-8">
318
- <div>
319
- <h3 class="text-xl font-semibold text-orange-400 mb-4">🤖 LLM Integration</h3>
320
- <ul class="space-y-2 text-gray-300">
321
- <li>• Google Gemini 2.0 Flash for analysis</li>
322
- <li>• Vertex AI for enterprise deployment</li>
323
- <li>• Custom prompts for process extraction</li>
324
- <li>• Structured JSON output formatting</li>
325
- </ul>
326
- </div>
327
-
328
  <div>
329
- <h3 class="text-xl font-semibold text-orange-400 mb-4">📊 Visualization Stack</h3>
330
- <ul class="space-y-2 text-gray-300">
331
- <li>• Mermaid.js for flowchart rendering</li>
332
- <li>• JSON schema for data validation</li>
333
- <li>• Interactive SVG output</li>
334
- <li>• Export to PNG/PDF supported</li>
335
  </ul>
336
  </div>
337
-
338
  <div>
339
- <h3 class="text-xl font-semibold text-orange-400 mb-4">💾 Data Storage</h3>
340
- <ul class="space-y-2 text-gray-300">
341
- <li>• Google Cloud Storage for JSON files</li>
342
- <li>• Firestore for metadata indexing</li>
343
- <li>• Version control with Git</li>
344
- <li>• Cross-referencing with papers database</li>
345
  </ul>
346
  </div>
347
-
348
  <div>
349
- <h3 class="text-xl font-semibold text-orange-400 mb-4">🔗 Integration Points</h3>
350
- <ul class="space-y-2 text-gray-300">
351
- <li>• GLMP specialized collections</li>
352
- <li>• Copernicus AI knowledge graph</li>
353
- <li>• Research papers database</li>
354
- <li>• API endpoints for programmatic access</li>
355
  </ul>
356
  </div>
357
  </div>
358
  </div>
359
  </section>
360
 
361
- <!-- Related Projects -->
362
- <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
363
- <h2 class="text-3xl font-bold text-gray-900 mb-6 text-center">🔗 Related Projects</h2>
364
-
365
- <div class="grid md:grid-cols-2 gap-6">
366
- <div class="bg-white rounded-lg shadow-md p-6 card-hover">
367
- <h3 class="text-xl font-semibold text-gray-900 mb-3">🧬 GLMP - Genome Logic Modeling</h3>
368
- <p class="text-gray-600 mb-4">
369
- First specialized application of the Programming Framework to biochemical processes.
370
- 50+ biological pathways visualized as interactive flowcharts.
371
- </p>
372
- <a href="https://huggingface.co/spaces/garywelz/glmp"
373
- class="text-orange-600 hover:text-orange-700 font-semibold"
374
- target="_blank" rel="noopener noreferrer">
375
- Explore GLMP →
376
- </a>
377
- </div>
378
-
379
- <div class="bg-white rounded-lg shadow-md p-6 card-hover">
380
- <h3 class="text-xl font-semibold text-gray-900 mb-3">🔬 Copernicus AI</h3>
381
- <p class="text-gray-600 mb-4">
382
- Knowledge engine integrating the Programming Framework with AI podcasts, research papers,
383
- and knowledge graph for scientific discovery.
384
- </p>
385
- <a href="https://www.copernicusai.fyi"
386
- class="text-orange-600 hover:text-orange-700 font-semibold"
387
- target="_blank" rel="noopener noreferrer">
388
- Visit Copernicus AI →
389
- </a>
390
- </div>
391
- </div>
392
- </section>
393
-
394
- <!-- How to Cite This Work -->
395
- <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
396
- <div class="bg-white rounded-xl shadow-lg p-8">
397
- <h2 class="text-3xl font-bold text-gray-900 mb-6">How to Cite This Work</h2>
398
- <div class="bg-gray-50 rounded-lg p-6 mb-4">
399
- <p class="text-gray-800 font-mono text-lg leading-relaxed mb-4">
400
- Welz, G. (2024–2025). <em>The Programming Framework: A Universal Method for Process Analysis</em>.<br>
401
- Hugging Face Spaces. https://huggingface.co/spaces/garywelz/programming_framework
402
- </p>
403
- </div>
404
- <div class="bg-orange-50 rounded-lg p-4">
405
- <p class="text-gray-700 mb-2">
406
- This project serves as a foundational meta-tool for AI-assisted process analysis, enabling systematic extraction and visualization of complex logic from textual sources across diverse scientific and technical domains.
407
- </p>
408
- <p class="text-gray-700 font-semibold">
409
- The Programming Framework is designed as infrastructure for AI-assisted science, providing a universal methodology that can be specialized for domain-specific applications.
410
- </p>
411
- </div>
412
- </div>
413
- </section>
414
-
415
  <!-- Footer -->
416
  <footer class="gradient-bg text-white py-8 mt-12">
417
  <div class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 text-center">
418
- <p class="text-lg font-semibold mb-2">The Programming Framework</p>
419
- <p class="text-sm opacity-75">A Universal Method for Process Analysis</p>
420
- <p class="text-xs opacity-50 mt-4">&copy; 2025 Gary Welz. All rights reserved.</p>
421
  </div>
422
  </footer>
423
-
424
- <script>
425
- mermaid.initialize({ startOnLoad: true, theme: 'default' });
426
-
427
- document.getElementById('sampleSelector').addEventListener('change', (e) => {
428
- const process = e.target.value;
429
- const container = document.getElementById('demoContainer');
430
-
431
- if (!process) {
432
- container.innerHTML = `
433
- <div class="text-center text-gray-500 py-12">
434
- <div class="text-4xl mb-4">🛠️</div>
435
- <p>Select a sample process to see the Programming Framework in action</p>
436
- </div>
437
- `;
438
- return;
439
- }
440
-
441
- const diagrams = {
442
- 'scientific-method': `
443
- graph TD
444
- A[Observation] --> B[Question]
445
- B --> C[Hypothesis]
446
- C --> D[Design Experiment]
447
- D --> E[Collect Data]
448
- E --> F{Analyze Results}
449
- F -->|Support Hypothesis| G[Report Findings]
450
- F -->|Reject Hypothesis| H[Revise Hypothesis]
451
- H --> D
452
- G --> I[Peer Review]
453
- I --> J{Accepted?}
454
- J -->|Yes| K[Publish]
455
- J -->|No| L[Address Feedback]
456
- L --> I
457
- `,
458
- 'software-deployment': `
459
- graph TD
460
- A[Code Commit] --> B[Run Tests]
461
- B --> C{Tests Pass?}
462
- C -->|No| D[Fix Issues]
463
- C -->|Yes| E[Build Artifact]
464
- D --> A
465
- E --> F[Deploy to Staging]
466
- F --> G[QA Testing]
467
- G --> H{QA Approved?}
468
- H -->|No| I[Bug Fixes]
469
- H -->|Yes| J[Deploy to Production]
470
- I --> A
471
- J --> K[Monitor]
472
- K --> L{Issues Detected?}
473
- L -->|Yes| M[Rollback]
474
- L -->|No| N[Success]
475
- `,
476
- 'customer-support': `
477
- graph TD
478
- A[Ticket Created] --> B[Assign to Agent]
479
- B --> C[Initial Assessment]
480
- C --> D{Issue Type?}
481
- D -->|Technical| E[Tech Team]
482
- D -->|Billing| F[Finance Team]
483
- D -->|General| G[Support Agent]
484
- E --> H[Investigate]
485
- F --> I[Review Account]
486
- G --> J[Provide Solution]
487
- H --> K{Resolved?}
488
- I --> K
489
- J --> K
490
- K -->|Yes| L[Close Ticket]
491
- K -->|No| M[Escalate]
492
- M --> N[Senior Team]
493
- N --> H
494
- `,
495
- 'research-publication': `
496
- graph TD
497
- A[Research Complete] --> B[Write Manuscript]
498
- B --> C[Internal Review]
499
- C --> D{Ready?}
500
- D -->|No| E[Revise]
501
- D -->|Yes| F[Submit to Journal]
502
- E --> B
503
- F --> G[Editor Review]
504
- G --> H{Desk Reject?}
505
- H -->|Yes| I[Choose Different Journal]
506
- H -->|No| J[Peer Review]
507
- I --> F
508
- J --> K{Reviewer Decision?}
509
- K -->|Reject| I
510
- K -->|Major Revisions| L[Address Comments]
511
- K -->|Minor Revisions| M[Quick Fixes]
512
- K -->|Accept| N[Publication]
513
- L --> F
514
- M --> F
515
- `
516
- };
517
-
518
- container.innerHTML = `
519
- <div class="mb-4">
520
- <h3 class="font-semibold text-gray-900 mb-2">${e.target.options[e.target.selectedIndex].text}</h3>
521
- <p class="text-sm text-gray-600">Visualized using the Programming Framework</p>
522
- </div>
523
- <div class="mermaid">${diagrams[process]}</div>
524
- <div class="mt-4 p-3 bg-orange-50 rounded text-sm text-gray-700">
525
- <strong>💡 Framework Applied:</strong> LLM extracted process steps, decision points, and flows.
526
- Result encoded as Mermaid flowchart in JSON for reusability.
527
- </div>
528
- `;
529
-
530
- mermaid.init(undefined, container.querySelectorAll('.mermaid'));
531
- });
532
- </script>
533
  </body>
534
  </html>
535
-
 
3
  <head>
4
  <meta charset="UTF-8">
5
  <meta name="viewport" content="width=device-width, initial-scale=1.0">
6
+ <title>Copernicus AI - Research-Driven Podcast Generation Platform</title>
7
  <script src="https://cdn.tailwindcss.com"></script>
 
8
  <style>
9
  .gradient-bg {
10
+ background: linear-gradient(135deg, #667eea 0%, #764ba2 100%);
11
  }
12
  .card-hover {
13
  transition: transform 0.3s ease, box-shadow 0.3s ease;
 
16
  transform: translateY(-4px);
17
  box-shadow: 0 20px 40px rgba(0,0,0,0.15);
18
  }
19
+ .stat-number {
20
+ font-size: 2.5rem;
21
+ font-weight: bold;
22
+ background: linear-gradient(135deg, #667eea 0%, #764ba2 100%);
23
+ -webkit-background-clip: text;
24
+ -webkit-text-fill-color: transparent;
25
+ }
26
  </style>
27
  </head>
28
  <body class="bg-gray-50">
 
30
  <header class="gradient-bg text-white">
31
  <div class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-16">
32
  <div class="text-center">
33
+ <div class="text-6xl mb-4">🔬</div>
34
+ <h1 class="text-5xl font-bold mb-4">Copernicus AI</h1>
35
+ <p class="text-xl opacity-90 mb-6">Knowledge Engine for Scientific Discovery</p>
36
+ <p class="text-lg opacity-75 max-w-4xl mx-auto">
37
+ A collaborative research platform that transforms cutting-edge scientific research into accessible,
38
+ multi-format tools for collective knowledge exploration. These are research instruments—like microscopes
39
+ for observing the collective knowledge of humanity—enabling hypothesis formation, testing, and discovery
40
+ across scientific disciplines.
41
  </p>
42
  </div>
43
  </div>
44
  </header>
45
 
46
+ <!-- Prior Work: CopernicusAI Research Interface -->
47
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
48
+ <div class="bg-gradient-to-r from-purple-50 to-blue-50 rounded-xl shadow-lg p-8 mb-8">
49
+ <h2 class="text-3xl font-bold text-gray-900 mb-4">Prior Work: CopernicusAI Research Interface</h2>
50
+ <p class="text-lg text-gray-700 leading-relaxed mb-4">
51
+ CopernicusAI is an active research prototype exploring AI-generated audio briefings as an interface for assisted scientific research.
52
+ </p>
53
+ <p class="text-gray-700 mb-4">
54
+ The system allows any user to generate, refine, and share AI-generated science podcasts based on structured prompts, enabling rapid orientation to a topic, iterative deepening, and personalized research briefings.
55
+ </p>
56
+ <p class="text-gray-700 mb-4">
57
+ Rather than functioning as a static content platform, CopernicusAI supports collectively generated and shared research artifacts, analogous to community-driven knowledge platforms (e.g., discussion forums), but grounded in scientific sources and metadata-aware workflows.
58
+ </p>
59
+ <div class="bg-white rounded-lg p-4 mt-4">
60
+ <h3 class="font-semibold text-gray-900 mb-2">This work demonstrates technical feasibility for:</h3>
61
+ <ul class="text-gray-700 space-y-1">
62
+ <li>• AI-assisted research briefing and orientation</li>
63
+ <li>• Iterative question refinement via conversational interfaces</li>
64
+ <li>• Integration of text, audio, and metadata in research workflows</li>
65
+ </ul>
66
  </div>
67
+ </div>
68
+ </section>
69
 
70
+ <!-- Mission & Vision -->
71
+ <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
72
+ <div class="bg-white rounded-xl shadow-lg p-8 mb-8">
73
+ <h2 class="text-3xl font-bold text-gray-900 mb-4">🎯 Mission & Vision</h2>
74
+ <p class="text-lg text-gray-700 leading-relaxed mb-4">
75
+ Inspired by Nicolaus Copernicus who challenged accepted knowledge with evidence and rigorous analysis,
76
+ <strong>Copernicus AI</strong> creates collaborative research tools that enable collective participation in
77
+ scientific discovery. These platforms are instruments for exploring humanity's collective knowledge—tools for
78
+ hypothesis formation, testing, and collaborative research, not just educational content.
79
+ </p>
80
+ <p class="text-gray-600">
81
+ Just as a microscope enables observation of the microscopic world, Copernicus AI tools enable observation and
82
+ exploration of humanity's collective knowledge. Subscribers collaborate to prompt, generate, and refine research
83
+ content—sharing discoveries publicly or keeping them private. As large language models (LLMs) and AI systems
84
+ gain unprecedented knowledge, Copernicus AI provides the infrastructure for human-AI collaborative knowledge
85
+ exploration, with evidence-based truth-seeking as our guiding principle.
86
+ </p>
87
+ </div>
88
+ </section>
89
+
90
+ <!-- Knowledge Engine Ecosystem -->
91
+ <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
92
+ <div class="bg-gradient-to-r from-purple-50 to-indigo-50 rounded-xl shadow-lg p-8 mb-8">
93
+ <h2 class="text-3xl font-bold text-gray-900 mb-6 text-center">🧩 CopernicusAI Knowledge Engine</h2>
94
+ <p class="text-lg text-gray-700 mb-6 text-center max-w-4xl mx-auto">
95
+ An integrated ecosystem of research and collaboration tools designed to assist scientists in their workflow,
96
+ from research discovery through knowledge synthesis to multi-format content generation.
97
+ </p>
98
+
99
+ <div class="grid md:grid-cols-2 lg:grid-cols-3 gap-6 mb-6">
100
+ <div class="bg-white rounded-lg p-6 card-hover">
101
+ <div class="text-3xl mb-3">🔬</div>
102
+ <h3 class="text-lg font-semibold text-gray-900 mb-2">Copernicus AI</h3>
103
+ <p class="text-sm text-gray-600 mb-3">Core synthesis & distribution platform for AI-powered research and podcast generation</p>
104
+ <p class="text-xs text-gray-500 italic">This platform</p>
105
  </div>
106
 
107
+ <div class="bg-white rounded-lg p-6 card-hover">
108
+ <div class="text-3xl mb-3">🛠️</div>
109
+ <h3 class="text-lg font-semibold text-gray-900 mb-2">Programming Framework</h3>
110
+ <p class="text-sm text-gray-600 mb-3">Foundational meta-tool for universal process analysis across disciplines</p>
111
+ <a href="https://huggingface.co/spaces/garywelz/programming_framework" target="_blank" class="text-xs text-blue-600 hover:underline">Explore →</a>
 
 
 
112
  </div>
113
+
114
+ <div class="bg-white rounded-lg p-6 card-hover">
115
+ <div class="text-3xl mb-3">🧬</div>
116
+ <h3 class="text-lg font-semibold text-gray-900 mb-2">GLMP</h3>
117
+ <p class="text-sm text-gray-600 mb-3">Biological process visualization - 50+ processes mapped</p>
118
+ <a href="https://huggingface.co/spaces/garywelz/glmp" target="_blank" class="text-xs text-blue-600 hover:underline">Explore →</a>
119
+ </div>
120
+
121
+ <div class="bg-white rounded-lg p-6 card-hover">
122
+ <div class="text-3xl mb-3">📚</div>
123
+ <h3 class="text-lg font-semibold text-gray-900 mb-2">Metadata Database</h3>
124
+ <p class="text-sm text-gray-600 mb-3">Core data infrastructure for research paper metadata and citation networks</p>
125
+ <a href="https://huggingface.co/spaces/garywelz/metadata_database" target="_blank" class="text-xs text-blue-600 hover:underline">Explore →</a>
126
+ </div>
127
+
128
+ <div class="bg-white rounded-lg p-6 card-hover">
129
+ <div class="text-3xl mb-3">🎬</div>
130
+ <h3 class="text-lg font-semibold text-gray-900 mb-2">Video Database</h3>
131
+ <p class="text-sm text-gray-600 mb-3">Multi-modal content with transcript-based search for scientific videos</p>
132
+ <a href="https://huggingface.co/spaces/garywelz/sciencevideodb" target="_blank" class="text-xs text-blue-600 hover:underline">Explore →</a>
133
+ </div>
134
+
135
+ <div class="bg-white rounded-lg p-6 card-hover border-2 border-dashed border-gray-300">
136
+ <div class="text-3xl mb-3 opacity-50">➕</div>
137
+ <h3 class="text-lg font-semibold text-gray-900 mb-2">Future Components</h3>
138
+ <p class="text-sm text-gray-600 mb-3">Additional tools, databases, and collaboration features will be added as the project develops</p>
139
  </div>
 
 
 
140
  </div>
141
  </div>
142
  </section>
143
 
144
+ <!-- Key Statistics -->
145
+ <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
146
+ <div class="grid md:grid-cols-2 gap-6 mb-12">
147
+ <div class="bg-white rounded-lg shadow-md p-6 text-center">
148
+ <div class="stat-number mb-2">250+</div>
149
+ <div class="text-gray-600 font-semibold">Million Papers</div>
150
+ <div class="text-sm text-gray-500 mt-1">Accessible via APIs</div>
 
 
 
 
 
 
 
151
  </div>
152
+ <div class="bg-white rounded-lg shadow-md p-6 text-center">
153
+ <div class="stat-number mb-2">8+</div>
154
+ <div class="text-gray-600 font-semibold">Academic Databases</div>
155
+ <div class="text-sm text-gray-500 mt-1">Integrated research sources</div>
156
  </div>
157
  </div>
158
  </section>
159
 
160
+ <!-- Core Platform Capabilities -->
161
+ <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
162
+ <h2 class="text-3xl font-bold text-gray-900 mb-8 text-center">🌟 Core Platform Capabilities</h2>
163
+
164
+ <div class="space-y-8">
165
+ <!-- AI Podcast Generation -->
166
+ <div class="bg-white rounded-xl shadow-lg p-8">
167
+ <div class="flex items-start mb-4">
168
+ <span class="text-4xl mr-4">🎙️</span>
169
+ <div class="flex-1">
170
+ <h3 class="text-2xl font-bold text-gray-900 mb-3">AI-Powered Podcast Generation</h3>
171
+ <p class="text-gray-600 mb-4">
172
+ Collaborative research platform where subscribers prompt and generate multi-voice AI podcasts
173
+ (5-10 minutes) synthesizing research from multiple academic sources. Subscribers can share their
174
+ podcasts publicly or keep them private. Evidence-based content generation requiring minimum 3
175
+ research sources per episode.
176
+ </p>
177
+ <div class="grid md:grid-cols-2 gap-4 mt-4">
178
+ <div>
179
+ <h4 class="font-semibold text-gray-800 mb-2">Key Features:</h4>
180
+ <ul class="text-sm text-gray-600 space-y-1">
181
+ <li>✓ Comprehensive research integration (8+ databases)</li>
182
+ <li>✓ Professional multi-speaker dialogue</li>
183
+ <li>✓ AI-generated scientific visualizations</li>
184
+ <li>✓ RSS feed distribution</li>
185
+ <li>✓ Quality scoring & relevance ranking</li>
186
+ <li>✓ Paradigm shift identification</li>
187
+ </ul>
188
+ </div>
189
+ <div>
190
+ <h4 class="font-semibold text-gray-800 mb-2">Research Integration:</h4>
191
+ <ul class="text-sm text-gray-600 space-y-1">
192
+ <li>✓ Real-time discovery from 8+ APIs</li>
193
+ <li>✓ Parallel search across databases</li>
194
+ <li>✓ Automatic citation extraction</li>
195
+ <li>✓ Source validation & verification</li>
196
+ <li>✓ Interdisciplinary connection analysis</li>
197
+ </ul>
198
+ </div>
199
+ </div>
200
+ </div>
201
+ </div>
202
+ </div>
203
 
204
+ <!-- LLM Integration -->
205
+ <div class="bg-white rounded-xl shadow-lg p-8">
206
+ <div class="flex items-start mb-4">
207
+ <span class="text-4xl mr-4">🤖</span>
208
+ <div class="flex-1">
209
+ <h3 class="text-2xl font-bold text-gray-900 mb-3">Advanced LLM Integration</h3>
210
+ <p class="text-gray-600 mb-4">Multi-model architecture with intelligent model selection:</p>
211
+ <div class="grid md:grid-cols-2 gap-4">
212
+ <div>
213
+ <h4 class="font-semibold text-gray-800 mb-2">Primary Models:</h4>
214
+ <ul class="text-sm text-gray-600 space-y-1">
215
+ <li>• <strong>Google Gemini 3</strong> - Latest research analysis and content generation</li>
216
+ <li>• <strong>OpenAI GPT-4/GPT-3.5</strong> - Content synthesis and quality validation</li>
217
+ <li>• <strong>Anthropic Claude 3</strong> (Sonnet, Haiku) - Alternative reasoning paths</li>
218
+ <li>• <strong>ElevenLabs TTS</strong> - Multi-voice text-to-speech synthesis</li>
219
+ </ul>
220
+ </div>
221
+ <div>
222
+ <h4 class="font-semibold text-gray-800 mb-2">Capabilities:</h4>
223
+ <ul class="text-sm text-gray-600 space-y-1">
224
+ <li>• Multi-paper analysis & synthesis</li>
225
+ <li>• Paradigm shift detection</li>
226
+ <li>• Entity extraction (genes, proteins, compounds)</li>
227
+ <li>• Citation tracking & cross-references</li>
228
+ <li>• Content quality scoring</li>
229
+ </ul>
230
+ </div>
231
+ </div>
232
+ </div>
233
+ </div>
234
+ </div>
235
+
236
+ <!-- Research Resources -->
237
+ <div class="bg-white rounded-xl shadow-lg p-8">
238
+ <div class="flex items-start mb-4">
239
+ <span class="text-4xl mr-4">📊</span>
240
+ <div class="flex-1">
241
+ <h3 class="text-2xl font-bold text-gray-900 mb-3">Research Resource Access</h3>
242
+ <p class="text-gray-600 mb-4">
243
+ Comprehensive academic database coverage with <strong>250+ million research papers</strong> accessible
244
+ through integrated APIs.
245
  </p>
246
+ <div class="grid md:grid-cols-2 gap-4">
247
+ <div>
248
+ <h4 class="font-semibold text-gray-800 mb-2">Academic Databases:</h4>
249
+ <ul class="text-sm text-gray-600 space-y-1">
250
+ <li>• PubMed/NCBI (~30+ million papers)</li>
251
+ <li>• arXiv (~2+ million preprints)</li>
252
+ <li>• NASA ADS (~15+ million papers)</li>
253
+ <li>• Zenodo (100K+ datasets)</li>
254
+ <li>• bioRxiv/medRxiv (preprints)</li>
255
+ <li>• CORE (~200+ million papers)</li>
256
+ <li>• Google Scholar (comprehensive)</li>
257
+ <li>• News API (current events)</li>
258
+ <li>• YouTube Data API (academic videos)</li>
259
+ </ul>
260
+ </div>
261
+ </div>
262
  </div>
263
+ </div>
264
+ </div>
265
+
266
+ <!-- Audio and Video Podcast Production -->
267
+ <div class="bg-white rounded-xl shadow-lg p-8">
268
+ <div class="flex items-start mb-4">
269
+ <span class="text-4xl mr-4">🎙️</span>
270
+ <div class="flex-1">
271
+ <h3 class="text-2xl font-bold text-gray-900 mb-3">Audio and Video Podcast Production</h3>
272
+ <p class="text-gray-600 mb-4">
273
+ <strong>Operating Audio Podcast System:</strong> Full production and distribution platform for subscriber-generated
274
+ podcasts. Users can prompt, generate, publish, and distribute audio podcasts with RSS feed support for
275
+ Spotify, Apple Podcasts, and Google Podcasts.
276
+ </p>
277
+ <div class="bg-green-50 rounded-lg p-4 mb-4">
278
+ <h4 class="font-semibold text-gray-800 mb-2">Current Audio Capabilities (Operational):</h4>
279
+ <ul class="text-sm text-gray-700 space-y-1">
280
+ <li>✓ Multi-voice AI podcast generation</li>
281
+ <li>✓ Research-driven content creation</li>
282
+ <li>✓ RSS feed distribution</li>
283
+ <li>✓ Public and private podcast options</li>
284
+ <li>✓ Professional audio quality</li>
285
+ </ul>
286
+ </div>
287
+ <div class="bg-blue-50 rounded-lg p-4 mt-4">
288
+ <h4 class="font-semibold text-gray-800 mb-2">Video Production (Future - Phase 2+):</h4>
289
+ <p class="text-sm text-gray-700 mb-2">Advanced video features planned for future development:</p>
290
+ <ul class="text-sm text-gray-700 space-y-2">
291
+ <li>• <strong>Visual Content Integration:</strong> Automated extraction from papers, web scraping, JSON database integration</li>
292
+ <li>• <strong>Dynamic Visualizations:</strong> Scientific animations, real-time charts, LaTeX rendering</li>
293
+ <li>• <strong>External Video Quoting:</strong> YouTube segment extraction with attribution & fair use compliance</li>
294
+ <li>• <strong>Advanced Composition:</strong> Multi-layer video, auto subtitles, text overlays, professional transitions</li>
295
+ </ul>
296
+ <p class="text-xs text-gray-600 mt-2">
297
+ See: <a href="https://huggingface.co/spaces/garywelz/sciencevideodb" class="text-blue-600 hover:underline">Science Video Database</a> - Companion project for research video content management.
298
+ </p>
299
+ </div>
300
+ </div>
301
+ </div>
302
+ </div>
303
 
304
+ <!-- Research Papers Metadata Database -->
305
+ <div class="bg-white rounded-xl shadow-lg p-8">
306
+ <div class="flex items-start mb-4">
307
+ <span class="text-4xl mr-4">📚</span>
308
+ <div class="flex-1">
309
+ <h3 class="text-2xl font-bold text-gray-900 mb-3">Research Papers Metadata Database (Phase 2)</h3>
310
+ <p class="text-gray-600 mb-4">
311
+ A centralized <strong>metadata repository</strong> (not a file archive) providing structured JSON objects
312
+ with AI-powered preprocessing.
313
  </p>
314
+ <div class="grid md:grid-cols-2 gap-4">
315
+ <div>
316
+ <h4 class="font-semibold text-gray-800 mb-2">Structured JSON Objects:</h4>
317
+ <ul class="text-sm text-gray-600 space-y-1">
318
+ <li>• DOI, arXiv ID, publication info</li>
319
+ <li>• Abstracts & key findings</li>
320
+ <li>• Extracted entities (genes, proteins, compounds, equations)</li>
321
+ <li>• Citation networks & cross-references</li>
322
+ <li>• Paradigm shift indicators</li>
323
+ <li>• Quality scores & relevance metrics</li>
324
+ </ul>
325
+ </div>
326
+ <div>
327
+ <h4 class="font-semibold text-gray-800 mb-2">AI-Powered Preprocessing:</h4>
328
+ <ul class="text-sm text-gray-600 space-y-1">
329
+ <li>• LLM-based entity extraction</li>
330
+ <li>• Automatic categorization</li>
331
+ <li>• Keyword extraction & semantic tagging</li>
332
+ <li>• Citation tracking & mapping</li>
333
+ <li>• Quality assessment</li>
334
+ <li>• RESTful API access</li>
335
+ </ul>
336
+ </div>
337
+ </div>
338
  </div>
339
  </div>
340
  </div>
341
  </div>
342
  </section>
343
 
344
+ <!-- Technology Stack -->
345
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
346
+ <div class="bg-white rounded-xl shadow-lg p-8">
347
+ <h2 class="text-3xl font-bold text-gray-900 mb-6">⚙️ Technology Stack</h2>
348
 
349
+ <div class="grid md:grid-cols-3 gap-6 mb-6">
350
+ <div>
351
+ <h3 class="text-lg font-semibold text-gray-800 mb-3">AI & Machine Learning</h3>
352
+ <ul class="text-sm text-gray-600 space-y-1">
353
+ <li>• Google Gemini 3</li>
354
+ <li>• Google Vertex AI (model orchestration)</li>
355
+ <li>• OpenAI GPT-4/GPT-3.5</li>
356
+ <li>• Anthropic Claude 3</li>
357
+ <li>• ElevenLabs TTS</li>
358
+ <li>• DALL-E 3</li>
359
+ <li>• Cloud Vision API</li>
360
+ <li>• Video Intelligence API</li>
361
+ </ul>
362
  </div>
363
+
364
+ <div>
365
+ <h3 class="text-lg font-semibold text-gray-800 mb-3">Backend Infrastructure</h3>
366
+ <ul class="text-sm text-gray-600 space-y-1">
367
+ <li>• FastAPI (Python)</li>
368
+ <li>• Google Cloud Run</li>
369
+ <li>• Firestore (NoSQL)</li>
370
+ <li>• Cloud Storage</li>
371
+ <li>• Cloud Functions</li>
372
+ <li>• Cloud Tasks</li>
373
+ <li>• Secret Manager</li>
374
+ </ul>
375
  </div>
376
+
377
+ <div>
378
+ <h3 class="text-lg font-semibold text-gray-800 mb-3">Frontend</h3>
379
+ <ul class="text-sm text-gray-600 space-y-1">
380
+ <li>• Next.js 15.5.7</li>
381
+ <li>• Alpine.js</li>
382
+ <li>• Tailwind CSS</li>
383
+ <li>• Vercel</li>
384
+ </ul>
385
  </div>
386
  </div>
387
+ </div>
388
+ </section>
389
 
390
+ <!-- Research & Collaborative Tools -->
391
+ <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
392
+ <div class="bg-gradient-to-r from-green-50 to-blue-50 rounded-xl p-8">
393
+ <h2 class="text-3xl font-bold text-gray-900 mb-6">🔬 Collaborative Research Tools</h2>
394
+
395
+ <div class="grid md:grid-cols-2 gap-6 mb-6">
396
+ <div>
397
+ <h3 class="text-xl font-semibold text-gray-800 mb-3">Collaborative Research Tools</h3>
398
+ <p class="text-gray-700 mb-3">
399
+ These platforms enable collective participation and collaboration across diverse user communities:
400
+ </p>
401
+ <ul class="text-gray-700 space-y-2">
402
+ <li>• <strong>Researchers</strong> - Tools for hypothesis formation and testing, cross-disciplinary synthesis</li>
403
+ <li>• <strong>Collaborators</strong> - Collective knowledge exploration and refinement</li>
404
+ <li>• <strong>Subscribers</strong> - Prompt, generate, and share podcasts (public or private)</li>
405
+ <li>• <strong>Community</strong> - User suggestions, comments, and collaborative flowchart improvement (GLMP)</li>
406
+ </ul>
407
+ <p class="text-gray-600 mt-4 italic">
408
+ Like a microscope enables observation of the microscopic world, these tools enable observation and
409
+ exploration of humanity's collective knowledge.
410
+ </p>
411
+ </div>
412
+
413
+ <div>
414
+ <h3 class="text-xl font-semibold text-gray-800 mb-3">Key Innovations</h3>
415
+ <ul class="text-gray-700 space-y-2">
416
+ <li>• Multi-source validation (min 3 sources)</li>
417
+ <li>• Evidence-based generation</li>
418
+ <li>• Paradigm shift detection</li>
419
+ <li>• Interdisciplinary connections</li>
420
+ <li>• Multiple expertise levels</li>
421
+ <li>• Full citation tracking</li>
422
+ </ul>
423
  </div>
424
  </div>
425
  </div>
426
  </section>
427
 
428
+ <!-- Prior Work & Research Contributions -->
429
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
430
+ <div class="bg-gradient-to-r from-purple-50 to-blue-50 rounded-xl shadow-lg p-8">
431
+ <h2 class="text-3xl font-bold text-gray-900 mb-6">📚 Prior Work & Research Contributions</h2>
432
+
433
+ <div class="bg-white rounded-lg p-6 mb-6">
434
+ <h3 class="text-xl font-semibold text-gray-900 mb-4">Overview</h3>
435
+ <p class="text-gray-700 mb-4">
436
+ This platform represents <strong>prior work</strong> that demonstrates foundational research and development
437
+ achievements in AI-powered scientific knowledge synthesis, collaborative research tools, and multi-modal content
438
+ generation. These contributions establish the technical foundation and proof-of-concept for the broader
439
+ <strong>CopernicusAI Knowledge Engine</strong> initiative.
440
+ </p>
441
+ </div>
442
+
443
+ <div class="grid md:grid-cols-2 gap-6 mb-6">
444
+ <div class="bg-white rounded-lg p-6">
445
+ <h3 class="text-lg font-semibold text-gray-900 mb-3">🔬 Research Contributions</h3>
446
+ <ul class="text-sm text-gray-700 space-y-2">
447
+ <li>• <strong>AI-Powered Research Synthesis:</strong> Production system for multi-source research synthesis using LLMs</li>
448
+ <li>• <strong>Multi-Model Architecture:</strong> Intelligent model selection with Gemini 3, GPT-4, Claude 3</li>
449
+ <li>• <strong>Collaborative Platform:</strong> Subscriber-driven content generation with public/private sharing</li>
450
+ <li>• <strong>Knowledge Engine Integration:</strong> Architecture for Research Papers DB, Video DB, GLMP, Framework</li>
451
+ </ul>
452
+ </div>
453
+
454
+ <div class="bg-white rounded-lg p-6">
455
+ <h3 class="text-lg font-semibold text-gray-900 mb-3">⚙️ Technical Achievements</h3>
456
+ <ul class="text-sm text-gray-700 space-y-2">
457
+ <li>• <strong>250+ Million Papers:</strong> Accessible via 8+ integrated academic databases</li>
458
+ <li>• <strong>64+ Episodes:</strong> Generated across 5 scientific disciplines</li>
459
+ <li>• <strong>Production Deployment:</strong> Live platform with operational API and RSS distribution</li>
460
+ <li>• <strong>Scalable Architecture:</strong> Serverless microservices on Google Cloud</li>
461
+ </ul>
462
+ </div>
463
+ </div>
464
+
465
+ <div class="bg-white rounded-lg p-6 mb-6">
466
+ <h3 class="text-lg font-semibold text-gray-900 mb-3">🎯 Position Within CopernicusAI Knowledge Engine</h3>
467
+ <p class="text-gray-700 mb-3">
468
+ This platform serves as the <strong>core synthesis and distribution component</strong> of the CopernicusAI Knowledge Engine.
469
+ The Knowledge Engine is an integrated ecosystem of research and collaboration tools that work together to assist scientists
470
+ in their workflow, from research discovery through knowledge synthesis to multi-format content generation.
471
  </p>
472
+ <div class="bg-blue-50 rounded-lg p-4 mb-3">
473
+ <h4 class="font-semibold text-gray-900 mb-2">Current Components:</h4>
474
+ <div class="grid md:grid-cols-2 gap-4 text-sm">
475
+ <ul class="text-gray-700 space-y-1">
476
+ <li>1. <strong>Copernicus AI</strong> (This platform) - Core synthesis & distribution</li>
477
+ <li>2. <strong>Programming Framework</strong> - Foundational meta-tool</li>
478
+ <li>3. <strong>GLMP</strong> - Biological process visualization</li>
479
+ </ul>
480
+ <ul class="text-gray-700 space-y-1">
481
+ <li>4. <strong>Research Paper Metadata Database</strong> - Data infrastructure</li>
482
+ <li>5. <strong>Science Video Database</strong> - Multi-modal content</li>
483
+ </ul>
484
+ </div>
485
+ </div>
486
+ <div class="bg-purple-50 rounded-lg p-4">
487
+ <h4 class="font-semibold text-gray-900 mb-2">Future Development:</h4>
488
+ <p class="text-gray-700 text-sm">
489
+ The Knowledge Engine is designed to grow and evolve. Additional tools, databases, and collaboration components
490
+ will be added as the project develops, expanding capabilities for AI-assisted scientific research and knowledge discovery.
491
+ </p>
492
+ </div>
493
  </div>
494
 
495
+ <div class="bg-blue-50 rounded-lg p-6">
496
+ <h3 class="text-lg font-semibold text-gray-900 mb-3">📖 Citation Information</h3>
497
+ <p class="text-sm text-gray-700 mb-3">
498
+ <strong>For Grant Proposals (NSF/DOE):</strong>
 
 
499
  </p>
500
+ <div class="bg-white rounded p-4 font-mono text-sm text-gray-800">
501
+ <p class="mb-2">Welz, G. (2025). Copernicus AI: Knowledge Engine for Scientific Discovery.</p>
502
+ <p class="mb-2">Hugging Face Space. https://huggingface.co/spaces/garywelz/copernicusai</p>
503
+ <p>Live Platform: https://www.copernicusai.fyi</p>
504
+ </div>
505
  </div>
506
+ </div>
507
+ </section>
508
 
509
+ <!-- How to Cite This Work -->
510
+ <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
511
+ <div class="bg-white rounded-xl shadow-lg p-8">
512
+ <h2 class="text-3xl font-bold text-gray-900 mb-6">How to Cite This Work</h2>
513
+ <div class="bg-gray-50 rounded-lg p-6">
514
+ <p class="text-gray-800 font-mono text-lg leading-relaxed">
515
+ Welz, G. (2024–2025). <em>CopernicusAI: AI-Generated Audio Briefings as a Research Interface</em>.<br>
516
+ Hugging Face Spaces. https://huggingface.co/spaces/garywelz/copernicusai
517
  </p>
518
  </div>
519
  </div>
520
  </section>
521
+ </div>
522
+ </section>
523
 
524
+ <!-- Grant Support & Collaboration -->
525
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
526
  <div class="bg-white rounded-xl shadow-lg p-8">
527
+ <h2 class="text-3xl font-bold text-gray-900 mb-6">🌐 Grant Support & Collaboration</h2>
528
 
529
+ <div class="mb-6">
530
+ <h3 class="text-xl font-semibold text-gray-800 mb-3">Grant Applications Supported</h3>
531
+ <p class="text-gray-700 mb-4">
532
+ This platform is designed to support grant applications to:
533
+ </p>
534
+ <div class="grid md:grid-cols-3 gap-4">
535
+ <div class="bg-blue-50 rounded-lg p-4">
536
+ <h4 class="font-semibold text-gray-800 mb-2">NSF</h4>
537
+ <p class="text-sm text-gray-600">National Science Foundation - Science education and research infrastructure</p>
538
+ </div>
539
+ <div class="bg-green-50 rounded-lg p-4">
540
+ <h4 class="font-semibold text-gray-800 mb-2">DOE</h4>
541
+ <p class="text-sm text-gray-600">Department of Energy - Scientific computing and data science</p>
542
+ </div>
543
+ <div class="bg-purple-50 rounded-lg p-4">
544
+ <h4 class="font-semibold text-gray-800 mb-2">SAIR Foundation</h4>
545
+ <p class="text-sm text-gray-600">AI research and development initiatives</p>
546
+ </div>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
547
  </div>
548
  </div>
549
+
550
+ <div>
551
+ <h3 class="text-xl font-semibold text-gray-800 mb-3">Collaboration Opportunities</h3>
552
+ <ul class="text-gray-700 space-y-2">
553
+ <li>• Integration with academic institutions</li>
554
+ <li>• Partnership with research organizations</li>
555
+ <li>• Open data initiatives</li>
556
+ <li>• Educational program development</li>
557
+ </ul>
558
+ </div>
559
  </div>
560
  </section>
561
 
562
+ <!-- Links & Resources -->
563
+ <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
564
  <div class="bg-gradient-to-r from-blue-50 to-purple-50 rounded-xl p-8">
565
+ <h2 class="text-3xl font-bold text-gray-900 mb-6 text-center">🔗 Live Platform & Resources</h2>
566
 
567
+ <div class="grid md:grid-cols-2 gap-6">
568
+ <div class="bg-white rounded-lg p-6">
569
+ <h3 class="text-xl font-semibold text-gray-800 mb-4">🌐 Production Deployment</h3>
570
+ <ul class="space-y-2">
571
+ <li>
572
+ <a href="https://www.copernicusai.fyi" target="_blank"
573
+ class="text-blue-600 hover:text-blue-800 font-medium">
574
+ 🏠 Homepage - Browse Podcasts
575
+ </a>
576
+ </li>
577
+ <li>
578
+ <a href="https://www.copernicusai.fyi/subscriber-dashboard.html" target="_blank"
579
+ class="text-blue-600 hover:text-blue-800 font-medium">
580
+ 📊 Creator Dashboard
581
+ </a>
582
+ </li>
583
+ <li>
584
+ <a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/feeds/copernicus-mvp-rss-feed.xml" target="_blank"
585
+ class="text-blue-600 hover:text-blue-800 font-medium">
586
+ 📡 RSS Feed
587
+ </a>
588
+ </li>
589
+ </ul>
590
  </div>
591
+
592
+ <div class="bg-white rounded-lg p-6">
593
+ <h3 class="text-xl font-semibold text-gray-800 mb-4">🧩 Knowledge Engine Components</h3>
594
+ <p class="text-sm text-gray-600 mb-4">
595
+ The CopernicusAI Knowledge Engine is an integrated ecosystem of research and collaboration tools.
596
+ Additional components will be added as the project develops.
597
+ </p>
598
+ <ul class="space-y-3">
599
+ <li>
600
+ <a href="https://huggingface.co/spaces/garywelz/programming_framework" target="_blank"
601
+ class="text-blue-600 hover:text-blue-800 font-medium">
602
+ 🛠️ Programming Framework
603
+ </a>
604
+ <p class="text-sm text-gray-600 mt-1 ml-6">
605
+ Foundational meta-tool for universal process analysis across any discipline
606
+ </p>
607
+ </li>
608
+ <li>
609
+ <a href="https://huggingface.co/spaces/garywelz/glmp" target="_blank"
610
+ class="text-blue-600 hover:text-blue-800 font-medium">
611
+ 🧬 GLMP - Genome Logic Modeling Project
612
+ </a>
613
+ <p class="text-sm text-gray-600 mt-1 ml-6">
614
+ First application of Programming Framework to biology - 50+ biological processes visualized
615
+ </p>
616
+ </li>
617
+ <li>
618
+ <a href="https://huggingface.co/spaces/garywelz/metadata_database" target="_blank"
619
+ class="text-blue-600 hover:text-blue-800 font-medium">
620
+ 📚 Research Paper Metadata Database
621
+ </a>
622
+ <p class="text-sm text-gray-600 mt-1 ml-6">
623
+ Core data infrastructure for structured research paper metadata and citation networks
624
+ </p>
625
+ </li>
626
+ <li>
627
+ <a href="https://huggingface.co/spaces/garywelz/sciencevideodb" target="_blank"
628
+ class="text-blue-600 hover:text-blue-800 font-medium">
629
+ 🎬 Science Video Database
630
+ </a>
631
+ <p class="text-sm text-gray-600 mt-1 ml-6">
632
+ Multi-modal content component with transcript-based search for scientific videos
633
+ </p>
634
+ </li>
635
+ </ul>
636
  </div>
637
  </div>
638
  </div>
639
  </section>
640
 
641
+ <!-- API Endpoints -->
642
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
643
  <div class="bg-gray-900 text-white rounded-xl p-8">
644
+ <h2 class="text-3xl font-bold mb-6">🔌 API Documentation</h2>
645
+ <p class="text-gray-300 mb-6">Base URL: <code class="bg-gray-800 px-2 py-1 rounded">https://copernicus-podcast-api-phzp4ie2sq-uc.a.run.app</code></p>
646
 
647
+ <div class="grid md:grid-cols-3 gap-4 text-sm">
 
 
 
 
 
 
 
 
 
 
648
  <div>
649
+ <h4 class="font-semibold text-blue-300 mb-2">Podcast Generation</h4>
650
+ <ul class="space-y-1 text-gray-400">
651
+ <li>POST /generate-podcast-with-subscriber</li>
652
+ <li>GET /api/subscribers/podcasts/{id}</li>
653
+ <li>POST /api/subscribers/podcasts/submit-to-rss</li>
 
654
  </ul>
655
  </div>
656
+
657
  <div>
658
+ <h4 class="font-semibold text-blue-300 mb-2">Research Endpoints</h4>
659
+ <ul class="space-y-1 text-gray-400">
660
+ <li>POST /api/papers/upload</li>
661
+ <li>GET /api/papers/{paper_id}</li>
662
+ <li>POST /api/papers/query</li>
663
+ <li>POST /api/papers/{id}/link-podcast/{id}</li>
664
  </ul>
665
  </div>
666
+
667
  <div>
668
+ <h4 class="font-semibold text-blue-300 mb-2">Admin Endpoints</h4>
669
+ <ul class="space-y-1 text-gray-400">
670
+ <li>GET /api/admin/subscribers</li>
671
+ <li>POST /api/admin/podcasts/fix-missing-titles</li>
672
+ <li>GET /api/admin/podcasts/catalog</li>
 
673
  </ul>
674
  </div>
675
  </div>
676
  </div>
677
  </section>
678
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
679
  <!-- Footer -->
680
  <footer class="gradient-bg text-white py-8 mt-12">
681
  <div class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 text-center">
682
+ <p class="text-lg font-semibold mb-2">Copernicus AI - Advancing Scientific Knowledge</p>
683
+ <p class="text-sm opacity-75">Built with Google Cloud, Gemini AI, OpenAI, Anthropic Claude, and ElevenLabs</p>
684
+ <p class="text-xs opacity-50 mt-4">&copy; 2025 Copernicus AI. All rights reserved.</p>
685
  </div>
686
  </footer>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
687
  </body>
688
  </html>