SemantaAI organization landing page
Browse files
README.md
ADDED
|
@@ -0,0 +1,97 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
title: SemantaAI
|
| 3 |
+
emoji: 🌐
|
| 4 |
+
colorFrom: teal
|
| 5 |
+
colorTo: orange
|
| 6 |
+
sdk: static
|
| 7 |
+
pinned: true
|
| 8 |
+
license: mit
|
| 9 |
+
---
|
| 10 |
+
|
| 11 |
+
<p align="center">
|
| 12 |
+
<h1 align="center">🌐 SemantaAI</h1>
|
| 13 |
+
<p align="center"><strong>Hybrid Sovereign World Intelligence OS</strong></p>
|
| 14 |
+
<p align="center">
|
| 15 |
+
<a href="https://semanta.xyz">semanta.xyz</a> ·
|
| 16 |
+
<a href="https://github.com/ivs-123/semantaai">GitHub</a> ·
|
| 17 |
+
<a href="https://api-staging.semanta.xyz">API Staging</a> ·
|
| 18 |
+
<a href="https://studio-staging.semanta.xyz">Studio</a>
|
| 19 |
+
</p>
|
| 20 |
+
</p>
|
| 21 |
+
|
| 22 |
+
---
|
| 23 |
+
|
| 24 |
+
## What is Semanta?
|
| 25 |
+
|
| 26 |
+
Semanta treats data as **worlds** — structured, evolving domains — and provides an end-to-end pipeline from raw data ingestion through labeling, synthetic generation, world building, model training, and continuous observation.
|
| 27 |
+
|
| 28 |
+
**Canon:** v3.7 Final · **First Wedge:** Finance · **Mode:** Results-First
|
| 29 |
+
|
| 30 |
+
## Core Capabilities
|
| 31 |
+
|
| 32 |
+
| Layer | Modules | Description |
|
| 33 |
+
|-------|---------|-------------|
|
| 34 |
+
| 🔬 Statistical | KS, Wasserstein, MMD, Energy, KL, Anderson-Darling | Full statistical validation battery |
|
| 35 |
+
| 📊 Retrospective | Regime detection, structural breaks, tail analysis | Auto-extract data characteristics |
|
| 36 |
+
| 🔮 Continuation | Seamless branch from endpoint, N scenarios | Time-series continuation engine |
|
| 37 |
+
| 🧬 Generative | VAE, mixture models, Student-t, SDE, GBM, Heston | Advanced synthetic data |
|
| 38 |
+
| 🔗 Coherence | Cross-domain FX+credit+equities+insurance | Multi-asset scenario generation |
|
| 39 |
+
| 📈 Econometrics | Granger causality, ARIMA, GARCH, Hurst, t-copula | Full econometric toolkit |
|
| 40 |
+
| 🧠 ML/DL | Random Forest, XGBoost, LightGBM, LSTM, Transformer | Real ML model wrappers |
|
| 41 |
+
| ⚡ Options | Black-Scholes, Greeks, Binomial, Asian, Barrier | Options pricing engine |
|
| 42 |
+
| 📉 Risk | VaR, CVaR, Stress Testing, Kelly, CPPI, Risk Parity | Enterprise risk management |
|
| 43 |
+
| 🔁 Backtesting | Walk-forward, Monte Carlo, Regime robustness | Strategy validation |
|
| 44 |
+
| 🏗️ Infrastructure | Logging, Profiling, CI/CD, Docker, K8s, Notifications | Production-ready |
|
| 45 |
+
|
| 46 |
+
## Datasets
|
| 47 |
+
|
| 48 |
+
| Dataset | Type | Status |
|
| 49 |
+
|---------|------|--------|
|
| 50 |
+
| [semantaai-crypto_assets](https://huggingface.co/datasets/SemantaAI/semantaai-crypto_assets) | Crypto OHLCV + labels | Published |
|
| 51 |
+
| [semantaai-fx-majors](https://huggingface.co/datasets/SemantaAI/semantaai-fx-majors) | FX majors (EUR/USD, GBP/USD, etc.) | Published |
|
| 52 |
+
| [semantaai-fx-other](https://huggingface.co/datasets/SemantaAI/semantaai-fx-other) | FX crosses | Published |
|
| 53 |
+
| [semantaai-test-dataset](https://huggingface.co/datasets/SemantaAI/semantaai-test-dataset) | Test/sample data | Published |
|
| 54 |
+
|
| 55 |
+
## Quick Start
|
| 56 |
+
|
| 57 |
+
```python
|
| 58 |
+
from core.synthetic import retrospective_analysis, continue_from_endpoint
|
| 59 |
+
|
| 60 |
+
# Load your data
|
| 61 |
+
prices = [1.08, 1.081, 1.083, ...] # EUR/USD prices
|
| 62 |
+
|
| 63 |
+
# Analyze historical patterns
|
| 64 |
+
retro = retrospective_analysis(prices)
|
| 65 |
+
print(f"Regimes detected: {retro['regimes']['count']}")
|
| 66 |
+
print(f"Annualized vol: {retro['overall']['annualized_vol']}")
|
| 67 |
+
|
| 68 |
+
# Generate future scenarios
|
| 69 |
+
cont = continue_from_endpoint(prices, n_scenarios=4, n_steps=30)
|
| 70 |
+
for s in cont['scenarios']:
|
| 71 |
+
print(f" {s['label']}: total_return={s['total_return']:.2%}")
|
| 72 |
+
|
| 73 |
+
# Run statistical battery
|
| 74 |
+
from core.synthetic.statistical import run_statistical_battery
|
| 75 |
+
real = [{"v": prices[i]} for i in range(-100, 0)]
|
| 76 |
+
syn = [{"v": s['path'][-1]['price']} for s in cont['scenarios']]
|
| 77 |
+
battery = run_statistical_battery(real, syn, key='v')
|
| 78 |
+
print(f"Quality: {battery['conclusion']} ({battery['battery_passed']}/{battery['battery_total']})")
|
| 79 |
+
```
|
| 80 |
+
|
| 81 |
+
## Stats
|
| 82 |
+
|
| 83 |
+
```
|
| 84 |
+
Tests: 834 (all passing)
|
| 85 |
+
Modules: 135 synthetic + 20 infrastructure
|
| 86 |
+
Functions: 907
|
| 87 |
+
LOC: 26,904
|
| 88 |
+
Production status: PRODUCTION READY
|
| 89 |
+
Stubs in own code: 0
|
| 90 |
+
```
|
| 91 |
+
|
| 92 |
+
---
|
| 93 |
+
|
| 94 |
+
<p align="center">
|
| 95 |
+
<strong>Semanta v3.7 Final · semanta.xyz</strong><br>
|
| 96 |
+
Built with Codex + OpenCode · May 2026
|
| 97 |
+
</p>
|